Scaling Law Limits: Data-Driven ML Fails Symbolic Logical Reasoning.

Tiansi Dong, Mateja Jamnik, Pietro Li\`o· June 26, 2026 View original

▶ The 2-minute explainer

Summary

This paper argues that data-driven machine learning, even with scaling, cannot achieve symbolic-level syllogistic reasoning due to methodological limitations like insufficient training data to distinguish all valid syllogisms and contradictory training targets. Experiments with Euler Net and ChatGPT support this, showing surface form affects performance and even 100% accuracy doesn't guarantee rigorous reasoning.

This research posits that data-driven machine learning systems, despite advancements in scaling laws, are fundamentally limited in their ability to achieve rigorous symbolic-level logical reasoning. The authors identify two primary methodological constraints preventing this: first, training data often lacks the granularity to differentiate all valid syllogistic reasoning types; second, end-to-end mapping from premises to conclusions can introduce conflicting training signals for neural components responsible for pattern recognition versus logical inference. To support their theoretical analysis, the paper presents experimental evidence. It demonstrates that Euler Net, a neural network designed for logical reasoning, fails to achieve rigorous syllogistic reasoning. Furthermore, tests on recent ChatGPT models (GPT-5-nano and GPT-5) reveal that the surface form of syllogistic statements—whether presented as words, double words, simple symbols, or long random symbols—significantly impacts reasoning performance. While GPT-5 might achieve 100% accuracy on certain tasks, the study highlights that its explanations can still be incorrect, indicating a lack of true symbolic understanding. The conclusion is that supervised machine learning, even when achieving high accuracy, does not attain the same level of rigor as symbolic logical reasoning, suggesting a ceiling to the scaling law's effectiveness in this domain.

Why it matters

This research challenges the prevailing belief that simply scaling up data and models will lead to human-level logical reasoning in AI, forcing a re-evaluation of current AI development strategies for tasks requiring true symbolic understanding. It highlights a fundamental limitation for data-driven approaches.

How to implement this in your domain

  1. 1Re-evaluate AI project requirements to distinguish between pattern recognition tasks and those demanding true symbolic logical reasoning.
  2. 2Explore hybrid AI architectures that combine data-driven models with symbolic reasoning components for complex logical tasks.
  3. 3Develop more robust evaluation metrics that go beyond accuracy to assess the rigor and explainability of AI's logical inferences.
  4. 4Investigate alternative training paradigms that can address the identified limitations of data-driven approaches for symbolic reasoning.
  5. 5Educate teams on the inherent limitations of current LLMs for tasks requiring deep logical understanding, even when they appear to perform well.

Who benefits

AI ResearchSoftware DevelopmentEducationLegalTechFinance

Key takeaways

  • Data-driven ML faces fundamental limits in achieving symbolic logical reasoning.
  • Training data and contradictory targets hinder rigorous syllogistic reasoning.
  • Even high accuracy in LLMs doesn't guarantee true symbolic understanding or correct explanations.
  • Hybrid AI approaches may be necessary for tasks requiring deep logical inference.

Original post by Tiansi Dong, Mateja Jamnik, Pietro Li\`o

"arXiv:2606.26454v1 Announce Type: new Abstract: Sphere neural networks have achieved symbolic level syllogistic reasoning without training data, raising the question of where the limit of the scaling law for logical reasoning lies, i.e., whether data-driven machine learning syste…"

View on X

Originally posted by Tiansi Dong, Mateja Jamnik, Pietro Li\`o on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses