FLM-101B: An Open LLM and How to Train It with $100K Budget Paper • 2309.03852 • Published Sep 7, 2023 • 43
Efficient RLHF: Reducing the Memory Usage of PPO Paper • 2309.00754 • Published Sep 1, 2023 • 13