MO-DiT+HPPO Enhances Generative Retrieval for Pattern-Preser

MO-DiT+HPPO Enhances Generative Retrieval for Pattern-Preserving Attributes

Chenghao Liu, Yu Zhang, Zhongtao Jiang, Kun Xu, Zhenwei An, Renzhi Wang, Zhao Wang, Jiachen Zhang, Yuxiao Zhang, Kun Xu, Songfang Huang· June 26, 2026 View original

▶ The 2-minute explainer

Summary

Researchers introduce MO-DiT+HPPO, a framework for generative retrieval that addresses pattern-preserving attribute retrieval by training a Diffusion Transformer with metric-ordered sequences and hybrid-policy preference optimization. This improves finding items that match a pattern and a target attribute.

A new research paper presents MO-DiT+HPPO, a sophisticated framework designed to improve generative retrieval, particularly for "pattern-preserving attribute retrieval." This challenging task requires finding items that not only possess a specific target attribute but also adhere to a fine-grained pattern expressed by a seed set of items. Traditional embedding-based retrieval often struggles with this dual objective, as preserving the pattern can lead to low-attribute regions, while global attribute retrieval might drift to unrelated patterns. MO-DiT+HPPO tackles this by using a continuous generative retrieval approach, where a Diffusion Transformer model learns to generate query embeddings for nearest-neighbor search. The framework employs a staged training process: initial raw-sequence pretraining, followed by multi-domain metric-ordered continuation pretraining. This metric-ordered training converts sparse online retrieval labels into ordered trajectories, teaching the model to improve metrics across domains. The final stage involves "Hybrid-Policy Preference Optimization (HPPO)," which aligns the generated query distribution with the true online objective. HPPO uses a hybrid candidate pool labeled with an online intersection metric and applies reference-anchored preference optimization. A Pareto pair filter ensures that only winner pairs that do not compromise same-pattern purity are kept, thereby raising the attribute metric without sacrificing the pattern. Experiments across four attribute domains show significant gains in the intersection metric, demonstrating the effectiveness of metric-ordered DiT and HPPO.

Why it matters

This advancement is critical for e-commerce, content recommendation, and information retrieval systems that need to provide highly relevant results based on complex user preferences and specific item attributes, moving beyond simple similarity searches.

How to implement this in your domain

1Investigate MO-DiT+HPPO for enhancing product recommendation engines to suggest items that fit specific user patterns and desired attributes.
2Apply metric-ordered sequence training to improve the relevance and diversity of search results in specialized databases.
3Explore using Hybrid-Policy Preference Optimization to fine-tune generative models for more precise content curation.
4Develop new retrieval systems that can balance pattern preservation with attribute targeting for complex information needs.

Who benefits

E-commerceContent PlatformsInformation RetrievalAdvertisingData Science

Key takeaways

Pattern-preserving attribute retrieval is a challenging task for generative models.
MO-DiT+HPPO uses metric-ordered training and preference optimization to address this.
Metric-ordered training teaches models to improve metrics across domains.
HPPO aligns generated queries with online objectives, improving attribute metrics without sacrificing pattern purity.

Original post by Chenghao Liu, Yu Zhang, Zhongtao Jiang, Kun Xu, Zhenwei An, Renzhi Wang, Zhao Wang, Jiachen Zhang, Yuxiao Zhang, Kun Xu, Songfang Huang

"arXiv:2606.26899v1 Announce Type: new Abstract: Embedding-based retrieval ranks items by their similarity to a query in a shared vector space and usually aims to return the highest-scoring items. In many production settings this is not what is wanted: given a seed set that expres…"

View on X

Originally posted by Chenghao Liu, Yu Zhang, Zhongtao Jiang, Kun Xu, Zhenwei An, Renzhi Wang, Zhao Wang, Jiachen Zhang, Yuxiao Zhang, Kun Xu, Songfang Huang on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses

MO-DiT+HPPO Enhances Generative Retrieval for Pattern-Preserving Attributes

Why it matters

How to implement this in your domain

Who benefits

Key takeaways

Want to go deeper?

More in AI Research

VISReg Enhances JEPA Training with Novel Regularization

Margaret Atwood Criticizes AI for "Garbage In, Garbage Out" Flaw

Podcast Explores Large Test-Time Compute and AI Model Budgets