Update README.md
Browse files
README.md
CHANGED
@@ -38,4 +38,17 @@ model-index:
|
|
38 |
value: 0.3521461673080921 +/- 0.28661129618806847
|
39 |
name: "Reward for Moving Resources to Self"
|
40 |
verified: true
|
41 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
38 |
value: 0.3521461673080921 +/- 0.28661129618806847
|
39 |
name: "Reward for Moving Resources to Self"
|
40 |
verified: true
|
41 |
+
---
|
42 |
+
|
43 |
+
This model serves as the baseline for the **Wildfire Resource Management** environment, trained and tested on task <code>2</code> with difficulty <code>5</code> using the Proximal Policy Optimization (PPO) algorithm.<br><br>
|
44 |
+
|
45 |
+
Environment: **Wildfire Resource Management**<br>
|
46 |
+
Task: <code>2</code><br>
|
47 |
+
Difficulty: <code>5</code><br>
|
48 |
+
Algorithm: <code>PPO</code><br>
|
49 |
+
Episode Length: <code>500</code><br>
|
50 |
+
Training <code>max_steps</code>: <code>450000</code><br>
|
51 |
+
Testing <code>max_steps</code>: <code>45000</code><br><br>
|
52 |
+
|
53 |
+
Train & Test [Scripts](https://github.com/hivex-research/hivex)<br>
|
54 |
+
Download the [Environment](https://github.com/hivex-research/hivex-environments)
|