Can Language Models Independently Discover the Concept of Ze

Can Language Models Independently Discover the Concept of Zero?

Phoebe Zeng, Thomas L. Griffiths, Brenden M. Lake· June 17, 2026 View original

Summary

This research investigates whether AI language models can independently discover the mathematical concept of "zero" through out-of-distribution generalization. Findings show that while smaller GPT-2 models cannot generalize this concept without specific training, language pretraining significantly reduces the number of examples needed for discovery.

A fundamental question in the development of AI systems, particularly those based on artificial neural networks, is their capacity for mathematical discovery and their ability to generalize beyond their training data. This involves hypothesizing genuinely new, and potentially more powerful, mathematical structures. Human cognition suggests that language abilities might support such generalizations. This study uses simple arithmetic as a case study to explore how modern AI models might expand their mathematical understanding, specifically examining if they can independently discover the concept of "zero." The research reveals two key findings. First, language models of a GPT-2 size are unable to perform this generalization at test time, regardless of whether they underwent language pretraining. This indicates a limitation in their ability to spontaneously infer a novel mathematical concept without direct exposure. Second, the models' performance substantially improves after being trained on even a relatively small number of examples (tens or hundreds) involving the concept of zero. Furthermore, the study found that prior language pretraining significantly reduces the number of required examples by approximately 50%. This suggests that while language models may not discover "zero" entirely on their own, their existing language abilities can act as a scaffold, making the process of mathematical discovery more efficient once some initial examples are provided.

Why it matters

This research sheds light on the fundamental capabilities and limitations of current AI models regarding mathematical reasoning and out-of-distribution generalization. For AI developers, it informs strategies for training models that need to grasp abstract concepts, highlighting the interplay between language abilities and mathematical understanding, and the necessity of targeted data for novel concept acquisition.

How to implement this in your domain

1Design training curricula for AI models that explicitly introduce foundational mathematical concepts like zero.
2Leverage language pretraining to accelerate the learning of new mathematical or abstract concepts in AI.
3Evaluate AI models for their ability to generalize out-of-distribution, especially for novel conceptual understanding.
4Investigate the role of different data modalities (e.g., symbolic, linguistic) in fostering mathematical discovery in AI.
5Develop benchmarks specifically designed to test an AI's capacity for genuine conceptual innovation beyond rote learning.

Who benefits

AI ResearchEdTechSoftware DevelopmentScientific ComputingCognitive Science

Key takeaways

GPT-2 sized language models cannot independently discover the concept of zero.
Models improve significantly after training on a small number of examples of zero.
Language pretraining reduces the required training examples by about 50%.
Language abilities can scaffold mathematical discovery in neural models, but direct exposure is often needed.

Original post by Phoebe Zeng, Thomas L. Griffiths, Brenden M. Lake

"arXiv:2606.17289v1 Announce Type: new Abstract: AI systems based on artificial neural networks are being developed with aspirations of pushing the boundary of human mathematical knowledge. A key question for these systems is how much they can reach beyond their training data. Mat…"

View on X

Originally posted by Phoebe Zeng, Thomas L. Griffiths, Brenden M. Lake on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses

Can Language Models Independently Discover the Concept of Zero?

Why it matters

How to implement this in your domain

Who benefits

Key takeaways

Want to go deeper?

More in AI Research

VISReg Enhances JEPA Training with Novel Regularization

Margaret Atwood Criticizes AI for "Garbage In, Garbage Out" Flaw

Podcast Explores Large Test-Time Compute and AI Model Budgets