ITNet Unifies AI Architectures: A Learnable Integral Transform for All Models

Ashim Dhor, Rasel Mondal, Pin Yu Chen· June 19, 2026 View original

Summary

This paper introduces the Integral Transform Network (ITNet), a unified AI architecture based on a learnable integral transform that subsumes convolutional networks, recurrent networks, and transformers. ITNet uses a small neural network to model pairwise interactions, adapting its behavior from data and matching or exceeding specialized baselines across various modalities.

For decades, major neural network architectures like convolutional networks, recurrent networks, and transformers have been considered mathematically distinct, each embodying specific inductive biases such as locality, sequential memory, and content-dependent interactions. This research challenges that fragmentation by proposing that these diverse architectures are, in fact, incomplete views of a single, underlying mathematical object: a learnable integral transform. The Integral Transform Network (ITNet) is introduced as a unified architecture built around this concept. At its core is a learnable kernel, implemented as a small neural network (specifically, an MLP), which models pairwise interactions between data points. This design allows ITNet to adapt its processing behavior directly from the data, rather than relying on pre-defined architectural biases. The paper demonstrates that convolution, self-attention (including multi-head), and various forms of autoregressive recurrence (such as LSTM, GRU, S4, and Mamba) can all emerge as special cases of ITNet under appropriate parameterizations. This establishes ITNet as a universal approximator of continuous operators. To ensure practical applicability, the authors developed efficient computational techniques like tiled kernel fusion, importance-weighted Monte Carlo integration, and learned low-rank factorization. Benchmarking shows that a single ITNet architecture, with a shared operator and lightweight modality-specific encoders, performs comparably to or better than specialized baselines across diverse tasks including ImageNet-1K, GLUE, ModelNet40, VQA v2, and NLVR2.

Why it matters

ITNet offers a paradigm shift in AI architecture design, potentially simplifying the development process by providing a single, flexible framework that can adapt to various data types and tasks. For AI engineers and researchers, this could lead to more efficient model development, reduced architectural complexity, and a deeper theoretical understanding of how different neural network components function.

How to implement this in your domain

  1. 1Explore ITNet as a foundational architecture for new AI model development, aiming for unified solutions across modalities.
  2. 2Investigate replacing specialized convolutional, recurrent, or attention layers with ITNet's learnable integral transform.
  3. 3Apply ITNet's principles to tasks requiring diverse inductive biases, such as image, text, and graph processing.
  4. 4Utilize the proposed computational optimizations (tiled kernel fusion, Monte Carlo integration, low-rank factorization) for efficient implementation.
  5. 5Contribute to or adopt open-source implementations of ITNet to accelerate research and development.

Who benefits

AI EngineeringSoftware DevelopmentResearch & DevelopmentRoboticsAutonomous Systems

Key takeaways

  • ITNet unifies diverse neural network architectures (convolution, attention, recurrence) under a single learnable integral transform.
  • Its core is a learnable kernel, implemented as a small MLP, that models pairwise interactions.
  • ITNet is a universal approximator of continuous operators and can recover specialized behaviors from data.
  • Efficient computational techniques make ITNet practical and scalable.

Original post by Ashim Dhor, Rasel Mondal, Pin Yu Chen

"arXiv:2606.19538v1 Announce Type: new Abstract: Convolutional networks, recurrent networks, and transformers each encode different inductive biases -- locality, sequential memory, and content-dependent pairwise interaction -- and have remained mathematically distinct since their…"

View on X

Originally posted by Ashim Dhor, Rasel Mondal, Pin Yu Chen on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses