This model serves as the baseline for the Drone-Based Reforestation environment, trained and tested on task 3
with difficulty 7
using the Proximal Policy Optimization (PPO) algorithm.
Environment: Drone-Based Reforestation
Task: 3
Difficulty: 7
Algorithm: PPO
Episode Length: 2000
Training max_steps
: 1200000
Testing max_steps
: 300000
Train & Test Scripts
Download the Environment
Evaluation results
- Cumulative Distance Reward on hivex-drone-based-reforestationself-reported1.3364013075828551 +/- 0.28799869386679394
- Cumulative Distance Until Tree Drop on hivex-drone-based-reforestationself-reported49.32145935058594 +/- 5.925578414273251
- Cumulative Distance to Existing Trees on hivex-drone-based-reforestationself-reported65.15668449401855 +/- 5.863480554164979
- Cumulative Normalized Distance Until Tree Drop on hivex-drone-based-reforestationself-reported0.1336401303112507 +/- 0.028799868452171418
- Cumulative Tree Drop Reward on hivex-drone-based-reforestationself-reported3.9665349340438842 +/- 0.7359914045841582
- Out of Energy Count on hivex-drone-based-reforestationself-reported0.03556458073668182 +/- 0.01864171812248584
- Recharge Energy Count on hivex-drone-based-reforestationself-reported11.073680667877197 +/- 0.6686583978261852
- Tree Drop Count on hivex-drone-based-reforestationself-reported0.9511495363712311 +/- 0.027165762086272822
- Cumulative Reward on hivex-drone-based-reforestationself-reported101.46505523681641 +/- 3.079002322895952