Tristan's picture
Create README.md
3d67f6a verified
metadata
license: mit

This is the fastText pretraining data filter targeting the LAMBADA DE task, discussed in the main text of the Perplexity Correlations paper: https://arxiv.org/abs/2409.05816