-
Adapting Large Language Models via Reading Comprehension
Paper • 2309.09530 • Published • 75 -
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Paper • 2309.00267 • Published • 47 -
Self-Alignment with Instruction Backtranslation
Paper • 2308.06259 • Published • 40 -
Unnatural Instructions: Tuning Language Models with (Almost) No Human Labor
Paper • 2212.09689 • Published • 1
Collections
Discover the best community collections!
Collections including paper arxiv:2308.06259
-
Self-Alignment with Instruction Backtranslation
Paper • 2308.06259 • Published • 40 -
ReCLIP: Refine Contrastive Language Image Pre-Training with Source Free Domain Adaptation
Paper • 2308.03793 • Published • 10 -
From Sparse to Soft Mixtures of Experts
Paper • 2308.00951 • Published • 20 -
Revisiting DETR Pre-training for Object Detection
Paper • 2308.01300 • Published • 9
-
Large Language Models as Optimizers
Paper • 2309.03409 • Published • 75 -
One Wide Feedforward is All You Need
Paper • 2309.01826 • Published • 31 -
Self-Alignment with Instruction Backtranslation
Paper • 2308.06259 • Published • 40 -
Shepherd: A Critic for Language Model Generation
Paper • 2308.04592 • Published • 29