This model serves as the baseline for the Drone-Based Reforestation environment, trained and tested on task 3
with difficulty 8
using the Proximal Policy Optimization (PPO) algorithm.
Environment: Drone-Based Reforestation
Task: 3
Difficulty: 8
Algorithm: PPO
Episode Length: 2000
Training max_steps
: 1200000
Testing max_steps
: 300000
Train & Test Scripts
Download the Environment
Evaluation results
- Cumulative Distance Reward on hivex-drone-based-reforestationself-reported1.3624776875972748 +/- 0.28554580887628567
- Cumulative Distance Until Tree Drop on hivex-drone-based-reforestationself-reported48.52491760253906 +/- 6.240952470711805
- Cumulative Distance to Existing Trees on hivex-drone-based-reforestationself-reported64.33281471252441 +/- 6.835068347254503
- Cumulative Normalized Distance Until Tree Drop on hivex-drone-based-reforestationself-reported0.13624776691198348 +/- 0.02855457901250283
- Cumulative Tree Drop Reward on hivex-drone-based-reforestationself-reported4.091673822402954 +/- 0.9169991715461574
- Out of Energy Count on hivex-drone-based-reforestationself-reported0.03841008508577943 +/- 0.017628847741743853
- Recharge Energy Count on hivex-drone-based-reforestationself-reported10.952794055938721 +/- 0.6585423813912649
- Tree Drop Count on hivex-drone-based-reforestationself-reported0.9443181335926056 +/- 0.027217821741009757
- Cumulative Reward on hivex-drone-based-reforestationself-reported101.03555679321289 +/- 3.1604495163459303