Stolen AI Models May Lack Practical Equivalence, Research Suggests

Eliott Baltz, Satoshi Hara, Ulrich A\"ivodji· June 16, 2026 View original

▶ The 60-second brief

Summary

New research challenges the assumption that high-fidelity stolen AI models are practically equivalent to their originals. The study finds that despite similar accuracy, surrogate models can vary significantly in other critical performance metrics due to model multiplicity.

A recent research paper introduces a new perspective on model stealing attacks, arguing that simply achieving high fidelity in a surrogate model does not guarantee it will be functionally equivalent to the original service. The authors contend that because query-based extraction provides only partial information, many near-optimal surrogate models can exist, all with comparable fidelity but differing in other crucial deployment-relevant properties. Instead of focusing solely on classic learning-based model stealing, the research evaluates the 'Rashomon Set' of surrogate models, which is the collection of almost-equally-accurate models. By analyzing the diversity within this set using multiplicity and fairness metrics, the study reveals that surrogates, despite matching the target model's fidelity, can exhibit substantial variations in other critical performance aspects across various tasks and datasets. These findings suggest that the conventional wisdom regarding the economic leverage and practical equivalence of high-fidelity stolen models might be flawed. The inherent variability among seemingly similar models could diminish the real-world utility of such stolen intellectual property for adversaries.

Why it matters

This research is crucial for professionals involved in AI security and intellectual property, as it redefines the perceived threat of model stealing. It suggests that protecting AI models might involve more than just preventing fidelity replication, requiring a deeper understanding of model behavior and diversity.

How to implement this in your domain

  1. 1Re-evaluate the risks of model stealing attacks beyond just fidelity metrics.
  2. 2Implement diverse evaluation criteria for AI models to detect subtle differences in performance.
  3. 3Develop robust intellectual property protection strategies that account for model multiplicity.
  4. 4Consider the 'Rashomon Set' concept when assessing the uniqueness and security of deployed AI models.

Who benefits

AI/ML Service ProvidersCybersecurityLegalTech

Key takeaways

  • High-fidelity stolen AI models may not be functionally equivalent to their originals.
  • Model multiplicity means many models can achieve similar accuracy but differ in other properties.
  • Evaluating model stealing requires assessing a broader range of performance metrics.
  • The 'Rashomon Set' concept helps understand the inherent diversity among accurate models.

Original post by Eliott Baltz, Satoshi Hara, Ulrich A\"ivodji

"arXiv:2606.15493v1 Announce Type: new Abstract: Model stealing attacks, where adversaries create high-fidelity surrogate models, are a significant threat to the intellectual property of machine learning services. Conventional wisdom suggests these surrogates could provide adversa…"

View on X

Originally posted by Eliott Baltz, Satoshi Hara, Ulrich A\"ivodji on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses