Edit model card

This model serves as the baseline for the Drone-Based Reforestation environment, trained and tested on task 3 with difficulty 5 using the Proximal Policy Optimization (PPO) algorithm.

Environment: Drone-Based Reforestation
Task: 3
Difficulty: 5
Algorithm: PPO
Episode Length: 2000
Training max_steps: 1200000
Testing max_steps: 300000

Train & Test Scripts
Download the Environment

Downloads last month: -; Downloads are not tracked for this model. How to track

Video Preview

Reinforcement Learning

Evaluation results

Cumulative Distance Reward on hivex-drone-based-reforestation
self-reported

1.317925215959549 +/- 0.28260177110908363
Cumulative Distance Until Tree Drop on hivex-drone-based-reforestation
self-reported

48.28620391845703 +/- 7.283860263327832
Cumulative Distance to Existing Trees on hivex-drone-based-reforestation
self-reported

64.57429847717285 +/- 5.444324231140867
Cumulative Normalized Distance Until Tree Drop on hivex-drone-based-reforestation
self-reported

0.13179252222180365 +/- 0.02826017752675318
Cumulative Tree Drop Reward on hivex-drone-based-reforestation
self-reported

4.009531931877136 +/- 0.661158168654566
Out of Energy Count on hivex-drone-based-reforestation
self-reported

0.03808173710480332 +/- 0.021781055433560147
Recharge Energy Count on hivex-drone-based-reforestation
self-reported

10.746735401153565 +/- 0.6862137746749559
Tree Drop Count on hivex-drone-based-reforestation
self-reported

0.9473729825019837 +/- 0.03268839810742225
Cumulative Reward on hivex-drone-based-reforestation
self-reported

101.15483459472657 +/- 3.824644657818079

View on Papers With Code