← back
x.comCody BlakeneySat, May 30, 2026, 10:51 AM PDT
score 16.2
150likes15RT3reply

Ultra-FineWeb paper shows how to filter and improve training data quality

Original: The Ultra-FineWeb paper is a pretty exceptional manual for thinking about tiers of data, how to do quality filtering, and rephrasing.

Source: x.com

Writing ELI5 summary…