-
Grandmaster-Level Chess Without Search
Paper • 2402.04494 • Published • 65 -
Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks
Paper • 2402.04248 • Published • 29 -
Self-Play Preference Optimization for Language Model Alignment
Paper • 2405.00675 • Published • 23 -
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Paper • 2404.03715 • Published • 60
Ersi Ni PRO
nilbot
·
AI & ML interests
Transformers
Organizations
None yet
Collections
3
-
BiLLM: Pushing the Limit of Post-Training Quantization for LLMs
Paper • 2402.04291 • Published • 48 -
Self-Discover: Large Language Models Self-Compose Reasoning Structures
Paper • 2402.03620 • Published • 109 -
Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks
Paper • 2402.04248 • Published • 29 -
Scaling Laws for Downstream Task Performance of Large Language Models
Paper • 2402.04177 • Published • 17
datasets
None public yet