Upload final trained model
Browse files- README.md +5 -5
- models/model_policy.zip +2 -2
- models/model_policy_v0.3.5.zip +3 -0
README.md
CHANGED
|
@@ -32,9 +32,9 @@ model-index:
|
|
| 32 |
value: TBD
|
| 33 |
pipeline_tag: reinforcement-learning
|
| 34 |
metadata:
|
| 35 |
-
utc_timestamp: 2025-12-
|
| 36 |
env_name: UTDGEnv-v0
|
| 37 |
-
model_file:
|
| 38 |
total_timesteps: 0
|
| 39 |
learning_rate: 0.0003
|
| 40 |
n_steps: 2048
|
|
@@ -248,7 +248,7 @@ from sb3_contrib import MaskablePPO
|
|
| 248 |
# Download the model from Hugging Face Hub
|
| 249 |
model_path = hf_hub_download(
|
| 250 |
repo_id="chrisjcc/utdg-maskableppo-policy",
|
| 251 |
-
filename="
|
| 252 |
)
|
| 253 |
|
| 254 |
# Load the trained model
|
|
@@ -356,7 +356,7 @@ The model was trained using the MaskablePPO algorithm, which extends standard PP
|
|
| 356 |
|
| 357 |
| File | Description |
|
| 358 |
|------|-------------|
|
| 359 |
-
| `
|
| 360 |
| `README.md` | This model card with full documentation |
|
| 361 |
| `config.yaml` | Hydra configuration snapshot (if included) |
|
| 362 |
|
|
@@ -399,4 +399,4 @@ If you use this model in your research, please cite:
|
|
| 399 |
|
| 400 |
---
|
| 401 |
|
| 402 |
-
*Generated on 2025-12-
|
|
|
|
| 32 |
value: TBD
|
| 33 |
pipeline_tag: reinforcement-learning
|
| 34 |
metadata:
|
| 35 |
+
utc_timestamp: 2025-12-27T18:33:32.663706
|
| 36 |
env_name: UTDGEnv-v0
|
| 37 |
+
model_file: model_policy_v0.3.5.zip
|
| 38 |
total_timesteps: 0
|
| 39 |
learning_rate: 0.0003
|
| 40 |
n_steps: 2048
|
|
|
|
| 248 |
# Download the model from Hugging Face Hub
|
| 249 |
model_path = hf_hub_download(
|
| 250 |
repo_id="chrisjcc/utdg-maskableppo-policy",
|
| 251 |
+
filename="model_policy_v0.3.5.zip"
|
| 252 |
)
|
| 253 |
|
| 254 |
# Load the trained model
|
|
|
|
| 356 |
|
| 357 |
| File | Description |
|
| 358 |
|------|-------------|
|
| 359 |
+
| `model_policy_v0.3.5.zip` | Trained MaskablePPO model checkpoint (SB3 format) |
|
| 360 |
| `README.md` | This model card with full documentation |
|
| 361 |
| `config.yaml` | Hydra configuration snapshot (if included) |
|
| 362 |
|
|
|
|
| 399 |
|
| 400 |
---
|
| 401 |
|
| 402 |
+
*Generated on 2025-12-27T18:33:32.663706 UTC*
|
models/model_policy.zip
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f34808d6a93d57b8f796064f0afb4e346da4fa9c0f0f926031df6cb0c52d0beb
|
| 3 |
+
size 650374
|
models/model_policy_v0.3.5.zip
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f34808d6a93d57b8f796064f0afb4e346da4fa9c0f0f926031df6cb0c52d0beb
|
| 3 |
+
size 650374
|