LLM-Based Framework Boosts Bearing Fault Diagnosis with Limi

LLM-Based Framework Boosts Bearing Fault Diagnosis with Limited Data

Jinghan Wang, Feng Cheng, Wentao Wu, Hang Li, Gaoliang Peng, Tianchen Liu· June 24, 2026 View original

Summary

A new two-stage transfer learning framework, utilizing a lightweight GPT-2-style Transformer, significantly improves cross-domain bearing fault diagnosis. It achieves 92.61% average accuracy with only 10% labeled target data by explicitly transferring knowledge through pre-trained encoder weights and fault prototype embeddings, outperforming state-of-the-art methods.

Bearing fault diagnosis in industrial settings faces significant hurdles due to dataset heterogeneity, varying operating conditions, and a scarcity of labeled data. Current diagnostic approaches often tackle these issues in isolation and rely on implicit feature alignment, which limits their effectiveness when multiple challenges occur simultaneously. This paper introduces a novel knowledge-guided two-stage transfer learning framework designed to address these complex problems. The framework employs a lightweight GPT-2-style Transformer, featuring causal self-attention, for hierarchical feature extraction from vibration signals. It establishes explicit knowledge transfer pathways where pre-trained encoder weights and fault prototype embeddings carry information from multi-source pre-training to target adaptation. The proposed framework effectively handles the dual-shift challenge through multi-source learning for generalizable representations, prototype-based knowledge modulation for adapting to target domains, and taxonomy-adaptive classification for seamless transfer across diverse fault categories. Experimental validation across four real-world datasets demonstrated an impressive 92.61% average accuracy using only 10% labeled target data, surpassing state-of-the-art methods by 17.24 percentage points. This breakthrough offers a practical and cost-effective solution for predictive maintenance in Industry 4.0 applications.

Why it matters

Manufacturing and industrial professionals can leverage this framework to implement highly accurate and cost-effective predictive maintenance systems for critical machinery, even with limited historical fault data, significantly reducing downtime and operational costs.

How to implement this in your domain

1Adopt the two-stage Transformer framework for predictive maintenance in industrial machinery, especially for bearing fault diagnosis.
2Integrate lightweight GPT-2-style Transformers for hierarchical feature extraction from vibration data.
3Utilize multi-source pre-training and prototype-based knowledge modulation to adapt models to new operating conditions.
4Pilot the framework on a subset of critical assets to validate its performance with limited labeled data.

Who benefits

ManufacturingIndustrial IoTAutomotiveEnergyAerospace

Key takeaways

A new LLM-based framework significantly improves bearing fault diagnosis.
It excels in cross-domain scenarios with limited labeled data.
The framework uses a two-stage transfer learning approach with a lightweight Transformer.
Achieves high accuracy, outperforming existing methods for predictive maintenance.

Original post by Jinghan Wang, Feng Cheng, Wentao Wu, Hang Li, Gaoliang Peng, Tianchen Liu

"arXiv:2606.24459v1 Announce Type: new Abstract: Bearing fault diagnosis faces critical challenges when dataset heterogeneity, operating condition variations, and limited labeled data occur simultaneously in industrial environments. Existing approaches address these issues in isol…"

View on X

Originally posted by Jinghan Wang, Feng Cheng, Wentao Wu, Hang Li, Gaoliang Peng, Tianchen Liu on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses

LLM-Based Framework Boosts Bearing Fault Diagnosis with Limited Data

Why it matters

How to implement this in your domain

Who benefits

Key takeaways

Want to go deeper?

More in AI Engineering & DevTools

AI-Powered Development Workflow Integrates Multiple Models

Proposing AI Usage Transparency for Credible Commentary

MCP and A2A Protocols Standardize Agentic Internet Development