x.comCody BlakeneySat, May 30, 2026, 10:51 AM PDT
score 16.2
150likes15RT3reply
Ultra-FineWeb paper shows how to filter and improve training data quality
Original: The Ultra-FineWeb paper is a pretty exceptional manual for thinking about tiers of data, how to do quality filtering, and rephrasing.
Source: x.com ↗
Writing ELI5 summary…