This model serves as the baseline for the Drone-Based Reforestation environment, trained and tested on task 3
with difficulty 2
using the Proximal Policy Optimization (PPO) algorithm.
Environment: Drone-Based Reforestation
Task: 3
Difficulty: 2
Algorithm: PPO
Episode Length: 2000
Training max_steps
: 1200000
Testing max_steps
: 300000
Train & Test Scripts
Download the Environment
Evaluation results
- Cumulative Distance Reward on hivex-drone-based-reforestationself-reported1.2730848133563994 +/- 0.31384063871152373
- Cumulative Distance Until Tree Drop on hivex-drone-based-reforestationself-reported46.571126289367676 +/- 6.711804112230181
- Cumulative Distance to Existing Trees on hivex-drone-based-reforestationself-reported62.378562469482425 +/- 4.8385232941913126
- Cumulative Normalized Distance Until Tree Drop on hivex-drone-based-reforestationself-reported0.12730848103761672 +/- 0.03138406355454151
- Cumulative Tree Drop Reward on hivex-drone-based-reforestationself-reported4.010982251167297 +/- 0.6601700266326962
- Out of Energy Count on hivex-drone-based-reforestationself-reported0.05754003098234534 +/- 0.03282736545941193
- Recharge Energy Count on hivex-drone-based-reforestationself-reported11.035588264465332 +/- 0.725159645414964
- Tree Drop Count on hivex-drone-based-reforestationself-reported0.9222618734836578 +/- 0.04192513553044461
- Cumulative Reward on hivex-drone-based-reforestationself-reported98.53370529174805 +/- 5.198916602319761