This model serves as the baseline for the Drone-Based Reforestation environment, trained and tested on task 3
with difficulty 5
using the Proximal Policy Optimization (PPO) algorithm.
Environment: Drone-Based Reforestation
Task: 3
Difficulty: 5
Algorithm: PPO
Episode Length: 2000
Training max_steps
: 1200000
Testing max_steps
: 300000
Train & Test Scripts
Download the Environment
Evaluation results
- Cumulative Distance Reward on hivex-drone-based-reforestationself-reported1.317925215959549 +/- 0.28260177110908363
- Cumulative Distance Until Tree Drop on hivex-drone-based-reforestationself-reported48.28620391845703 +/- 7.283860263327832
- Cumulative Distance to Existing Trees on hivex-drone-based-reforestationself-reported64.57429847717285 +/- 5.444324231140867
- Cumulative Normalized Distance Until Tree Drop on hivex-drone-based-reforestationself-reported0.13179252222180365 +/- 0.02826017752675318
- Cumulative Tree Drop Reward on hivex-drone-based-reforestationself-reported4.009531931877136 +/- 0.661158168654566
- Out of Energy Count on hivex-drone-based-reforestationself-reported0.03808173710480332 +/- 0.021781055433560147
- Recharge Energy Count on hivex-drone-based-reforestationself-reported10.746735401153565 +/- 0.6862137746749559
- Tree Drop Count on hivex-drone-based-reforestationself-reported0.9473729825019837 +/- 0.03268839810742225
- Cumulative Reward on hivex-drone-based-reforestationself-reported101.15483459472657 +/- 3.824644657818079