chrisjcc
/

utdg-maskableppo-policy

Reinforcement Learning

stable-baselines3

deep-reinforcement-learning

Model card Files Files and versions

chrisjcc commited on 7 days ago

Commit

09b634e

·

verified ·

1 Parent(s): 0208e0c

Upload final trained model

Files changed (3) hide show

README.md +5 -5
models/model_policy.zip +2 -2
models/model_policy_v0.3.5.zip +3 -0

README.md CHANGED Viewed

@@ -32,9 +32,9 @@ model-index:
             value: TBD
 pipeline_tag: reinforcement-learning
 metadata:
-  utc_timestamp: 2025-12-20T00:46:04.434122
   env_name: UTDGEnv-v0
-  model_file: model_policy.zip
   total_timesteps: 0
   learning_rate: 0.0003
   n_steps: 2048
@@ -248,7 +248,7 @@ from sb3_contrib import MaskablePPO
 # Download the model from Hugging Face Hub
 model_path = hf_hub_download(
     repo_id="chrisjcc/utdg-maskableppo-policy",
-    filename="model_policy.zip"
 )
 # Load the trained model
@@ -356,7 +356,7 @@ The model was trained using the MaskablePPO algorithm, which extends standard PP
 | File | Description |
 |------|-------------|
-| `model_policy.zip` | Trained MaskablePPO model checkpoint (SB3 format) |
 | `README.md` | This model card with full documentation |
 | `config.yaml` | Hydra configuration snapshot (if included) |
@@ -399,4 +399,4 @@ If you use this model in your research, please cite:
 ---
-*Generated on 2025-12-20T00:46:04.434122 UTC*

             value: TBD
 pipeline_tag: reinforcement-learning
 metadata:
+  utc_timestamp: 2025-12-27T18:33:32.663706
   env_name: UTDGEnv-v0
+  model_file: model_policy_v0.3.5.zip
   total_timesteps: 0
   learning_rate: 0.0003
   n_steps: 2048
 # Download the model from Hugging Face Hub
 model_path = hf_hub_download(
     repo_id="chrisjcc/utdg-maskableppo-policy",
+    filename="model_policy_v0.3.5.zip"
 )
 # Load the trained model
 | File | Description |
 |------|-------------|
+| `model_policy_v0.3.5.zip` | Trained MaskablePPO model checkpoint (SB3 format) |
 | `README.md` | This model card with full documentation |
 | `config.yaml` | Hydra configuration snapshot (if included) |
 ---
+*Generated on 2025-12-27T18:33:32.663706 UTC*

models/model_policy.zip CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:aa351c45d0143cc3810b2ec75f95952611cf37c542ffb54fe4c1fcf1201bcbc3
-size 650323

 version https://git-lfs.github.com/spec/v1
+oid sha256:f34808d6a93d57b8f796064f0afb4e346da4fa9c0f0f926031df6cb0c52d0beb
+size 650374

models/model_policy_v0.3.5.zip ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f34808d6a93d57b8f796064f0afb4e346da4fa9c0f0f926031df6cb0c52d0beb
+size 650374