SAELens

Commit History

fold in scaling by sqrt(d_model) into params
9ff4e7b

Tom Lieberum commited on