LLMs Learn Transient Semantic Structure Despite One-Hot Training

Yize Zhao, Isabel Papadimitriou, Christos Thrampoulidis· June 26, 2026 View original

Summary

Research shows that language models develop transient semantic geometry early in training, clustering representations by shared attributes despite one-hot next-token prediction. This structure eventually collapses to a symmetric state with sufficient capacity and time.

Neural Collapse theory suggests that balanced one-hot classification pushes model representations into a symmetric configuration, where they are equally distant from each other, ignoring semantic similarities in inputs. This presents a puzzle for next-token prediction language models, which are predominantly trained with one-hot labels but clearly learn rich latent semantic structures. This study investigates how gradient descent manages to find such categorical semantic structure when co-occurrence statistics, under a one-hot regime, would seem to eliminate shared next-tokens among different contexts. Using synthetic controlled settings, researchers found that semantic geometry emerges early in the training process, causing representations to cluster by shared attributes even without explicit supervision for this. However, this emergent structure is transient. Given sufficient model capacity and training time, the representations eventually converge to the symmetric state predicted by Neural Collapse, where all representations are equally separated. The research analyzes this phase transition using Gram matrix analysis and proposes a preliminary modification to the unconstrained features model to better capture this transient semantic geometry.

Why it matters

Understanding how LLMs learn and retain semantic structure, even transiently, is crucial for developing more robust and interpretable models. Professionals can leverage these insights to design training regimes that preserve desired semantic properties or to better diagnose model behavior.

How to implement this in your domain

  1. 1Analyze the Gram matrices of your LLM embeddings during different training phases to observe semantic geometry.
  2. 2Experiment with early stopping or regularization techniques to potentially preserve transient semantic structures.
  3. 3Consider modifying training objectives or model architectures to explicitly encourage or maintain semantic clustering.
  4. 4Use insights into semantic geometry to improve interpretability or steer the latent space of your language models.

Who benefits

AI/ML DevelopmentNatural Language ProcessingResearch & AcademiaSoftware Engineering

Key takeaways

  • LLMs learn transient semantic structure early in training despite one-hot labels.
  • Representations cluster by shared attributes without explicit supervision.
  • This semantic geometry eventually collapses to a symmetric state with more training.
  • Understanding this phase transition can inform model design and training.

Original post by Yize Zhao, Isabel Papadimitriou, Christos Thrampoulidis

"arXiv:2606.26749v1 Announce Type: new Abstract: Neural Collapse predicts that balanced one-hot classification pushes model representations to be equally far from each other; a symmetric configuration that depends only on the output label and ignores any semantic similarity in the…"

View on X

Originally posted by Yize Zhao, Isabel Papadimitriou, Christos Thrampoulidis on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses