DPO datasets for DE Collection A collection of DPO datasets for the DE language. β’ 6 items β’ Updated Apr 15 β’ 1
When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method Paper β’ 2402.17193 β’ Published Feb 27 β’ 23
Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens Paper β’ 2401.17377 β’ Published Jan 30 β’ 32
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling Paper β’ 2311.00430 β’ Published Nov 1, 2023 β’ 53
ModuleFormer: Learning Modular Large Language Models From Uncurated Data Paper β’ 2306.04640 β’ Published Jun 7, 2023 β’ 7
Self-Alignment with Instruction Backtranslation Paper β’ 2308.06259 β’ Published Aug 11, 2023 β’ 38
Retentive Network: A Successor to Transformer for Large Language Models Paper β’ 2307.08621 β’ Published Jul 17, 2023 β’ 168
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis Paper β’ 2307.01952 β’ Published Jul 4, 2023 β’ 75
TART: A plug-and-play Transformer module for task-agnostic reasoning Paper β’ 2306.07536 β’ Published Jun 13, 2023 β’ 10