Update README.md
Browse filesAdded link to paper
README.md
CHANGED
@@ -25,13 +25,28 @@ model-index:
|
|
25 |
This is a trained model of a **A2C** agent playing **PandaReachDense-v2**
|
26 |
using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
|
27 |
|
28 |
-
|
29 |
-
|
|
|
30 |
|
|
|
31 |
|
32 |
-
|
33 |
-
|
34 |
-
|
35 |
|
36 |
-
...
|
37 |
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
25 |
This is a trained model of a **A2C** agent playing **PandaReachDense-v2**
|
26 |
using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
|
27 |
|
28 |
+
The RL Zoo is a training framework for Stable Baselines3
|
29 |
+
reinforcement learning agents,
|
30 |
+
with hyperparameter optimization and pre-trained agents included.
|
31 |
|
32 |
+
## Usage (with SB3 RL Zoo)
|
33 |
|
34 |
+
RL Zoo: https://github.com/DLR-RM/rl-baselines3-zoo<br/>
|
35 |
+
SB3: https://github.com/DLR-RM/stable-baselines3<br/>
|
36 |
+
SB3 Contrib: https://github.com/Stable-Baselines-Team/stable-baselines3-contrib
|
37 |
|
|
|
38 |
```
|
39 |
+
# Download model and save it into the logs/ folder
|
40 |
+
python -m rl_zoo3.load_from_hub --algo tqc --env PandaReachDense-v2 -orga sb3 -f logs/
|
41 |
+
python enjoy.py --algo a2c --env PandaReachDense-v2 -f logs/
|
42 |
+
```
|
43 |
+
|
44 |
+
## Training (with the RL Zoo)
|
45 |
+
```
|
46 |
+
python train.py --algo a2c --env PandaReachDense-v2 -f logs/
|
47 |
+
# Upload the model and generate video (when possible)
|
48 |
+
python -m rl_zoo3.push_to_hub --algo a2c --env PandaReachDense-v2 -f logs/ -orga sb3
|
49 |
+
```
|
50 |
+
|
51 |
+
|
52 |
+
Panda Gym environments: [arxiv.org/abs/2106.13687](https://arxiv.org/abs/2106.13687)
|