Papers to read - General Papers I want to read, at some point. Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation Paper • 2108.12409 • Published Aug 27, 2021 • 5 YaRN: Efficient Context Window Extension of Large Language Models Paper • 2309.00071 • Published Aug 31, 2023 • 65 MIMIC-IT: Multi-Modal In-Context Instruction Tuning Paper • 2306.05425 • Published Jun 8, 2023 • 11 Music ControlNet: Multiple Time-varying Controls for Music Generation Paper • 2311.07069 • Published Nov 13, 2023 • 43
Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation Paper • 2108.12409 • Published Aug 27, 2021 • 5
YaRN: Efficient Context Window Extension of Large Language Models Paper • 2309.00071 • Published Aug 31, 2023 • 65
MIMIC-IT: Multi-Modal In-Context Instruction Tuning Paper • 2306.05425 • Published Jun 8, 2023 • 11
Music ControlNet: Multiple Time-varying Controls for Music Generation Paper • 2311.07069 • Published Nov 13, 2023 • 43
Papers to read - Reinforcement Learning Papers I want to read, at some point. Focused on Reinforcement Learning papers. Deep reinforcement learning from human preferences Paper • 1706.03741 • Published Jun 12, 2017 • 3 Training language models to follow instructions with human feedback Paper • 2203.02155 • Published Mar 4, 2022 • 16 Direct Preference-based Policy Optimization without Reward Modeling Paper • 2301.12842 • Published Jan 30, 2023 Woodpecker: Hallucination Correction for Multimodal Large Language Models Paper • 2310.16045 • Published Oct 24, 2023 • 14
Deep reinforcement learning from human preferences Paper • 1706.03741 • Published Jun 12, 2017 • 3
Training language models to follow instructions with human feedback Paper • 2203.02155 • Published Mar 4, 2022 • 16
Direct Preference-based Policy Optimization without Reward Modeling Paper • 2301.12842 • Published Jan 30, 2023
Woodpecker: Hallucination Correction for Multimodal Large Language Models Paper • 2310.16045 • Published Oct 24, 2023 • 14