@conceptofmind on Hugging Face: "A 1b dense causal language model begins to "saturate" in terms of accuracy…"

Join the community of Machine Learners and AI enthusiasts.

conceptofmind

posted an update Jan 28

Post

A 1b dense causal language model begins to "saturate" in terms of accuracy around 5 epochs on 1.2T tokens.

In this post