-
Physics of Language Models: Part 1, Context-Free Grammar
Paper • 2305.13673 • Published • 6 -
Physics of Language Models: Part 3.2, Knowledge Manipulation
Paper • 2309.14402 • Published • 5 -
Physics of Language Models: Part 3.3, Knowledge Capacity Scaling Laws
Paper • 2404.05405 • Published • 8 -
Physics of Language Models: Part 3.1, Knowledge Storage and Extraction
Paper • 2309.14316 • Published • 7
Zeyuan Allen-Zhu
zhuzeyuan
AI & ML interests
None yet
Organizations
None yet
Collections
1
models
None public yet
datasets
None public yet