ppo-LunarLander-v2 / results.json
agarcia
PPO with 3e6 iterations
d7efe22
raw
history blame
164 Bytes
{"mean_reward": 276.6844665838818, "std_reward": 23.994863457176383, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-05-29T21:25:44.457497"}