OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework Paper • 2405.11143 • Published May 20 • 34
Neural MMO 2.0: A Massively Multi-task Addition to Massively Multi-agent Learning Paper • 2311.03736 • Published Nov 7, 2023 • 9
Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform Paper • 2310.00036 • Published Sep 29, 2023 • 2
Llama 2: Open Foundation and Fine-Tuned Chat Models Paper • 2307.09288 • Published Jul 18, 2023 • 242
CleanRL: High-quality Single-file Implementations of Deep Reinforcement Learning Algorithms Paper • 2111.08819 • Published Nov 16, 2021 • 2
Secrets of RLHF in Large Language Models Part I: PPO Paper • 2307.04964 • Published Jul 11, 2023 • 28