New metric audits whether AI models truly forgot sensitive information

Original: Measuring the Depth of LLM Unlearning via Activation Patching

Writing ELI5 summary…