New LLM Architecture Enables Native Unlearning of Specific Data Sources.

Gaurav R. Ghosal, Pratyush Maini, Aditi Raghunathan· June 15, 2026 View original

Summary

Researchers propose NULLs, a novel LLM architecture that allows for the native unlearning of specific training data sources without gradient updates or access to retained data. This model class isolates source-specific contributions while maintaining joint learning across sources.

Large Language Models typically struggle with "unlearning" specific data because the contributions of different training sources are deeply intertwined. This new research introduces NULLs (Natively Unlearnable LLMs), an architecture designed to address this challenge by isolating source-specific information. It achieves this by training a shared "backbone" of neurons alongside a pool of sparsely activated "sinks" for each data source. During training, unique information from a source naturally concentrates in its dedicated sinks, while shared knowledge accumulates in the backbone. This structural separation allows for a source to be "unlearned" simply by disabling its corresponding sinks at deployment, eliminating the need for complex gradient updates or access to the original training data. The framework has been successfully scaled to large datasets like Wikipedia, demonstrating its ability to remove specific knowledge (e.g., a single article) while preserving related facts. It also shows robustness against adversarial extraction and relearning, suggesting that unlearning can be a built-in feature of LLMs from the outset.

Why it matters

This innovation offers a robust solution for data privacy and compliance in AI, allowing organizations to easily remove sensitive or outdated information from deployed LLMs without costly retraining. It also enhances model control and adaptability, making LLMs more manageable in dynamic data environments.

How to implement this in your domain

  1. 1Evaluate current LLM deployment strategies for data privacy and compliance requirements.
  2. 2Investigate the NULLs architecture for future LLM development or fine-tuning projects.
  3. 3Develop internal policies for data retention and removal in AI systems, leveraging unlearning capabilities.
  4. 4Collaborate with research teams to explore integrating native unlearning into custom LLM solutions.

Who benefits

Data PrivacyHealthcareFinanceLegalGovernment

Key takeaways

  • NULLs enable native unlearning in LLMs by isolating source-specific data contributions.
  • The architecture uses shared backbone neurons and sparsely activated sinks for each data source.
  • Unlearning is achieved by disabling specific sinks, requiring no gradient updates or retained data.
  • This approach improves data privacy, compliance, and model adaptability without sacrificing performance.

Original post by Gaurav R. Ghosal, Pratyush Maini, Aditi Raghunathan

"arXiv:2606.13873v1 Announce Type: new Abstract: Unlearning aims to remove the influence of specific training data sources, but this has proved challenging because the contributions of different sources are entangled within the model. Isolating source contributions to disjoint par…"

View on X

Originally posted by Gaurav R. Ghosal, Pratyush Maini, Aditi Raghunathan on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses