This model serves as the baseline for the Drone-Based Reforestation environment, trained and tested on task 3
with difficulty 9
using the Proximal Policy Optimization (PPO) algorithm.
Environment: Drone-Based Reforestation
Task: 3
Difficulty: 9
Algorithm: PPO
Episode Length: 2000
Training max_steps
: 1200000
Testing max_steps
: 300000
Train & Test Scripts
Download the Environment
Evaluation results
- Cumulative Distance Reward on hivex-drone-based-reforestationself-reported1.3542225050926209 +/- 0.23333899783579354
- Cumulative Distance Until Tree Drop on hivex-drone-based-reforestationself-reported49.953554458618164 +/- 6.4635271249103035
- Cumulative Distance to Existing Trees on hivex-drone-based-reforestationself-reported65.1268350982666 +/- 6.275221217555885
- Cumulative Normalized Distance Until Tree Drop on hivex-drone-based-reforestationself-reported0.13542225003242492 +/- 0.02333390047639469
- Cumulative Tree Drop Reward on hivex-drone-based-reforestationself-reported4.086031370162964 +/- 0.8049852876065032
- Out of Energy Count on hivex-drone-based-reforestationself-reported0.03118088087067008 +/- 0.020989988346072946
- Recharge Energy Count on hivex-drone-based-reforestationself-reported11.123015098571777 +/- 0.6124465630966653
- Tree Drop Count on hivex-drone-based-reforestationself-reported0.9591382348537445 +/- 0.028923095592019384
- Cumulative Reward on hivex-drone-based-reforestationself-reported102.52203262329101 +/- 3.3865961577425283