Training a Helpful and Harmless Assistant - a vcvcleo Collection

vcvcleo 's Collections

Training a Helpful and Harmless Assistant

Training a Helpful and Harmless Assistant

updated Sep 13, 2023

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

Paper • 2204.05862 • Published Apr 12, 2022 • 2