On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes Paper • 2306.13649 • Published Jun 23, 2023 • 16 • 2
Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning Paper • 2407.18248 • Published Jul 25 • 31 • 4
LETS-C: Leveraging Language Embedding for Time Series Classification Paper • 2407.06533 • Published Jul 9 • 2 • 5