hivex-research
/

hivex-OPC-PPO-baseline-task-1

Reinforcement Learning

hivex-ocean-plastic-collection

multi-agent-reinforcement-learning

Model card Files Files and versions Metrics Training metrics Community

philippds commited on Aug 30

Commit

c6d0442

•

1 Parent(s): 0b6555c

Update README.md

Files changed (1) hide show

README.md +13 -1

README.md CHANGED Viewed

@@ -29,4 +29,16 @@ model-index:
       value: 142.19907608032227 +/- 19.368785745326573
       name: "Local Reward"
       verified: true
----

       value: 142.19907608032227 +/- 19.368785745326573
       name: "Local Reward"
       verified: true
+---
+This model serves as the baseline for the **Ocean Plastic Collection** environment, trained and tested on task <code>1</code> using the Proximal Policy Optimization (PPO) algorithm.<br>
+<br>
+Environment: **Ocean Plastic Collection**<br>
+Task: <code>1</code><br>
+Algorithm: <code>PPO</code><br>
+Episode Length: <code>5000</code><br>
+Training <code>max_steps</code>: <code>3000000</code><br>
+Testing <code>max_steps</code>: <code>150000</code><br>
+<br>
+Train & Test [Scripts](https://github.com/hivex-research/hivex)<br>
+Download the [Environment](https://github.com/hivex-research/hivex-environments)