Models: L Collection Attention-only transformers, sweep over number of layers • 7 items • Updated Oct 18