File size: 194 Bytes
cc36ba5
 
 
 
 
 
1
2
3
4
5
6
---
license: mit
---
This is the fastText pretraining data filter targeting
the LAMBADA IT task, discussed in the main text of the Perplexity
Correlations paper: https://arxiv.org/abs/2409.05816