-
System 2 Attention (is something you might need too)
Paper • 2311.11829 • Published • 39 -
Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers
Paper • 2311.10642 • Published • 23 -
Orca 2: Teaching Small Language Models How to Reason
Paper • 2311.11045 • Published • 70
Lejon
Annelies
AI & ML interests
speech recognition
Organizations
None yet
Collections
2
models
None public yet
datasets
None public yet