alexbalandi
/

ppo-LunarLander-v2-4milsteps-200-envs

Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter.

3120398 almost 2 years ago

226 Bytes

	- OS: Linux-6.2.2-arch1-g14-1-x86_64-with-glibc2.37 # 5 SMP PREEMPT_DYNAMIC Sat, 04 Mar 2023 20:30:14 +0000
	- Python: 3.10.9
	- Stable-Baselines3: 1.7.0
	- PyTorch: 1.13.1+cu117
	- GPU Enabled: True
	- Numpy: 1.24.2
	- Gym: 0.21.0