Flow-DPO: Improving LLM Mathematical Reasoning through Online Multi-Agent Learning Paper • 2410.22304 • Published 30 days ago • 15
SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language Models by Summarizing Training Trajectories of Small Models Paper • 2403.07384 • Published Mar 12 • 1
CleanCLIP: Mitigating Data Poisoning Attacks in Multimodal Contrastive Learning Paper • 2303.03323 • Published Mar 6, 2023 • 1
Unsupervised Learning of Neural Networks to Explain Neural Networks Paper • 1805.07468 • Published May 18, 2018
Data Distillation Can Be Like Vodka: Distilling More Times For Better Quality Paper • 2310.06982 • Published Oct 10, 2023
Robust Learning with Progressive Data Expansion Against Spurious Correlation Paper • 2306.04949 • Published Jun 8, 2023
SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language Models by Summarizing Training Trajectories of Small Models Paper • 2403.07384 • Published Mar 12 • 1
AIR-Bench 2024: A Safety Benchmark Based on Risk Categories from Regulations and Policies Paper • 2407.17436 • Published Jul 11
SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI Paper • 2410.11096 • Published Oct 14 • 12
SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI Paper • 2410.11096 • Published Oct 14 • 12