alexbalandi's picture
Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter.
3120398
raw
history blame contribute delete
226 Bytes
- OS: Linux-6.2.2-arch1-g14-1-x86_64-with-glibc2.37 # 5 SMP PREEMPT_DYNAMIC Sat, 04 Mar 2023 20:30:14 +0000
- Python: 3.10.9
- Stable-Baselines3: 1.7.0
- PyTorch: 1.13.1+cu117
- GPU Enabled: True
- Numpy: 1.24.2
- Gym: 0.21.0