← back
arXivJaeung Lee, Dohyun Kim, Jaemin JoSat, May 23, 2026, 7:52 AM PDT
score 15.5

New metric audits whether AI models truly forgot sensitive information

Original: Measuring the Depth of LLM Unlearning via Activation Patching

Source: arxiv.org

Writing ELI5 summary…