--- license: mit language: - en tags: - babylm --- # Lil-Bevo-X Lil-Bevo-X is UT Austin's submission to the BabyLM challenge, specifically the *strict* track. [Link to GitHub Repo](https://github.com/venkatasg/Lil-Bevo) ## Model training regime: 1. 5 epochs on MAESTRO dataset (85M non-language music tokens) combined with strict small dataset. 2. 50 epochs of pretraining with sequence length of 128 on strict dataset. 3. 150 epochs of pretraining with sequence length of 512 on strict dataset. 4. 10 epochs of targeted MLM. This README will be updated with more details soon.