On-Device NAS Optimizes Tiny Neural Architectures

Andrea Mattia Garavagno, Edoardo Ragusa, Paolo Gastaldo, Antonio Frisoli, Claudio Loconsole· June 25, 2026 View original

▶ The 2-minute explainer

Summary

This paper introduces a new approach for near-sensor computing where a lightweight Neural Architecture Search (NAS) is performed directly on the deployment device to find optimal tiny neural architectures. Validated on Italian Sign Language and fault diagnosis datasets, this method significantly reduces RAM occupancy and improves accuracy compared to state-of-the-art solutions on embedded systems like Raspberry Pi 4.

Researchers have proposed an innovative method for near-sensor computing, enabling Neural Architecture Search (NAS) to be executed directly on the deployment device. This on-device NAS aims to identify the most efficient and compact neural architecture specifically tailored for analyzing real-time sensor data. The approach is particularly beneficial for applications like human-machine interfaces, where neural networks can be dynamically redesigned for individual users, effectively mitigating inter-individual data variations. A new lightweight NAS was designed and rigorously validated using the Italian Sign Language (ISL) dataset, which comprises surface electromyography (sEMG) signals, and the Case Western Reserve University (CWRU) dataset for intelligent fault diagnosis. When tested on a Raspberry Pi 4, the proposed NAS demonstrated superior performance, achieving 0.63 times less RAM occupancy and 5.96 percentage points higher accuracy for the ISL dataset, and 0.44 times less RAM and 0.2 percentage points higher accuracy for the CWRU dataset, surpassing existing state-of-the-art solutions.

Why it matters

Professionals in edge AI, IoT, and embedded systems can leverage this on-device NAS to deploy highly optimized and personalized AI models directly on resource-constrained devices. This leads to significant improvements in efficiency, accuracy, and adaptability for real-time data analysis, opening new possibilities for personalized and robust edge applications.

How to implement this in your domain

  1. 1Integrate lightweight NAS capabilities into edge AI development workflows for optimizing models directly on target hardware.
  2. 2Apply on-device NAS to personalize human-machine interfaces by adapting neural networks to individual user biometrics.
  3. 3Utilize this approach for real-time fault diagnosis in industrial IoT settings, optimizing models for specific machinery.
  4. 4Explore deploying the proposed NAS on various embedded systems to evaluate its performance and resource efficiency for custom applications.
  5. 5Develop frameworks that enable dynamic neural network redesign on-device for enhanced adaptability and performance.

Who benefits

IoTWearablesIndustrial AutomationHealthcareRobotics

Key takeaways

  • On-device NAS optimizes neural architectures directly on deployment devices.
  • It significantly reduces RAM usage and improves accuracy for tiny models.
  • The approach is beneficial for personalized applications like human-machine interfaces.
  • Validated on sEMG and fault diagnosis datasets, outperforming state-of-the-art.

Original post by Andrea Mattia Garavagno, Edoardo Ragusa, Paolo Gastaldo, Antonio Frisoli, Claudio Loconsole

"arXiv:2606.24900v1 Announce Type: new Abstract: This paper proposes a new approach to near-sensor computing, in which a lightweight Neural Architecture Search (NAS) is performed directly on the deployment device to find the best tiny neural architecture for analyzing the real-tim…"

View on X

Originally posted by Andrea Mattia Garavagno, Edoardo Ragusa, Paolo Gastaldo, Antonio Frisoli, Claudio Loconsole on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses