From Babble to Words Collection The models, tokenizers and datasets used for our BabyLM 2024 submission. We have eight prediction files (predictions.json.gz) - the best is BPE-TXT. • 17 items • Updated 10 days ago