Edit model card

This model serves as the baseline for the Drone-Based Reforestation environment, trained and tested on task 3 with difficulty 4 using the Proximal Policy Optimization (PPO) algorithm.

Environment: Drone-Based Reforestation
Task: 3
Difficulty: 4
Algorithm: PPO
Episode Length: 2000
Training max_steps: 1200000
Testing max_steps: 300000

Train & Test Scripts
Download the Environment

Downloads last month: -; Downloads are not tracked for this model. How to track

Video Preview

Reinforcement Learning

Evaluation results

Cumulative Distance Reward on hivex-drone-based-reforestation
self-reported

1.3088074851036071 +/- 0.2150446017537585
Cumulative Distance Until Tree Drop on hivex-drone-based-reforestation
self-reported

48.39169616699219 +/- 5.7290635352219415
Cumulative Distance to Existing Trees on hivex-drone-based-reforestation
self-reported

64.92751655578613 +/- 4.971308759365728
Cumulative Normalized Distance Until Tree Drop on hivex-drone-based-reforestation
self-reported

0.13088074818253517 +/- 0.021504460692074904
Cumulative Tree Drop Reward on hivex-drone-based-reforestation
self-reported

3.975467157363892 +/- 0.5825542394332213
Out of Energy Count on hivex-drone-based-reforestation
self-reported

0.04072725491598248 +/- 0.02455264640086014
Recharge Energy Count on hivex-drone-based-reforestation
self-reported

10.695373344421387 +/- 0.5710744663818678
Tree Drop Count on hivex-drone-based-reforestation
self-reported

0.9469023418426513 +/- 0.033324031370099066
Cumulative Reward on hivex-drone-based-reforestation
self-reported

101.1289744567871 +/- 3.8212373566735938

View on Papers With Code