MOSAIC Improves Knowledge Tracing with LLM Semantic Alignment

Xinjin Li, Mengyue Wang, Yuzhen Lin, Pengbin Feng, Ziqi Sha, Yeyang Zhou, Yu Ma· June 30, 2026 View original

Summary

MOSAIC is a new framework that enhances knowledge tracing by orchestrating LLM-driven semantic alignment with sequential modeling, capturing collaborative signals and hierarchical knowledge dependencies. It achieves state-of-the-art results across multiple educational datasets by generating dynamic, context-aware embeddings and prediction prompts.

Knowledge Tracing (KT), a crucial component of personalized education, traditionally faces limitations due to its reliance on shallow, ID-based representations and single-granularity mastery estimation. These issues neglect the semantic depth of knowledge and hierarchical dependencies between concepts. This research introduces MOSAIC (Multi-granularity Online Semantic AI for Collaborative Knowledge), a novel framework designed to overcome these challenges. MOSAIC orchestrates large language model (LLM)-driven semantic alignment with sequential modeling. Unlike methods that use LLMs solely for prediction, MOSAIC leverages a frozen LLM to generate dynamic, context-aware embeddings and hierarchical prediction prompts. This approach explicitly captures collaborative signals and peer interactions, enriching the understanding of student learning. Furthermore, the framework incorporates a cross-granularity consistency objective, which jointly regularizes mastery estimation across different levels: individual concepts, topic clusters, and overall proficiency. Extensive experiments on datasets like ASSISTments, EdNet, and a new large-scale MOOC dataset demonstrate MOSAIC's superior performance, achieving AUC improvements of up to 3.4% and Accuracy gains of up to 2.5%. MOSAIC also shows enhanced robustness in collaborative and long-sequence learning scenarios, offering both high predictive precision and semantically grounded interpretability.

Why it matters

EdTech professionals can leverage MOSAIC to create more personalized, accurate, and semantically rich educational experiences, improving student mastery prediction and adaptive learning systems by understanding knowledge at multiple granularities.

How to implement this in your domain

  1. 1Evaluate current knowledge tracing systems for their ability to capture semantic depth and hierarchical knowledge.
  2. 2Explore integrating LLM-driven embedding generation into educational platforms for richer student interaction analysis.
  3. 3Design adaptive learning pathways that leverage multi-granularity mastery estimation for personalized content delivery.
  4. 4Pilot MOSAIC or similar frameworks in online learning environments to assess improvements in student engagement and learning outcomes.
  5. 5Collaborate with AI researchers to adapt and refine knowledge tracing models for specific educational contexts and curricula.

Who benefits

EdTechEducationCorporate Learning & DevelopmentOnline Learning Platforms

Key takeaways

  • MOSAIC improves knowledge tracing using LLM-driven semantic alignment.
  • It captures collaborative signals and hierarchical knowledge dependencies.
  • The framework achieves state-of-the-art results across multiple educational datasets.
  • MOSAIC offers high predictive precision and semantically grounded interpretability.

Original post by Xinjin Li, Mengyue Wang, Yuzhen Lin, Pengbin Feng, Ziqi Sha, Yeyang Zhou, Yu Ma

"arXiv:2606.29049v1 Announce Type: new Abstract: Knowledge Tracing (KT) is important for personalized education but traditionally suffers from two key limitations: a reliance on shallow ID-based representations that neglect semantic depth and a restriction to single-granularity ma…"

View on X

Originally posted by Xinjin Li, Mengyue Wang, Yuzhen Lin, Pengbin Feng, Ziqi Sha, Yeyang Zhou, Yu Ma on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses