LLMs Extract User Preferences for Privacy-Preserving Recommendation Systems

Abhirup Dasgupta, Fernando Spadea, Oshani Seneviratne· July 2, 2026 View original

▶ The 2-minute explainer

Summary

Researchers developed a pipeline to extract structured user-preference triples from conversational data using lightweight Large Language Models (LLMs) for Personal Knowledge Graph (PKG) construction. The study evaluates Qwen- and Gemma-based models, finding good performance in both semantic extraction and downstream recommendation tasks.

Personal Knowledge Graphs (PKGs) offer a promising method for modeling user preferences while maintaining privacy, especially when dealing with decentralized and unstructured conversational data. A new paper introduces a reproducible pipeline designed to bridge the gap between raw conversational text and structured semantic information. This pipeline leverages lightweight Large Language Models (LLMs) to extract RDF-compliant triples, which are then linked to Wikidata identifiers, from conversational data. This process facilitates the construction of PKGs that can power recommendation systems. The study specifically evaluated Qwen- and Gemma-based models on their ability to accurately extract these triples and assessed the practical utility of the resulting knowledge graphs in a recommendation task. The findings indicate that certain models performed effectively, demonstrating a strong correlation between their triple extraction accuracy and their subsequent performance in generating recommendations.

Why it matters

Professionals in e-commerce, content platforms, and data privacy can utilize this method to build more accurate and privacy-preserving recommendation systems by structuring user preferences from diverse data sources.

How to implement this in your domain

  1. 1Explore using lightweight LLMs (e.g., Qwen, Gemma) for extracting structured user preferences from conversational data.
  2. 2Design and implement a pipeline for converting unstructured text into RDF-compliant triples for PKG construction.
  3. 3Integrate PKGs into existing recommendation engines to enhance personalization and privacy.
  4. 4Evaluate the trade-offs between different LLM models for triple extraction accuracy and computational cost.

Who benefits

E-commerceSocial MediaContent PlatformsAdvertisingData Privacy

Key takeaways

  • Lightweight LLMs can effectively extract structured user preferences from conversational data.
  • Personal Knowledge Graphs offer a privacy-preserving framework for user preference modeling.
  • The extracted triples can significantly improve the performance of downstream recommendation systems.
  • This approach bridges the gap between unstructured text and semantic "things" for better personalization.

Original post by Abhirup Dasgupta, Fernando Spadea, Oshani Seneviratne

"arXiv:2607.00003v1 Announce Type: cross Abstract: Personal Knowledge Graphs (PKGs) offer a privacy-preserving framework for modeling user preferences, yet constructing them from unstructured, decentralized conversational data remains a challenge. This paper bridges the gap betwee…"

View on X

Originally posted by Abhirup Dasgupta, Fernando Spadea, Oshani Seneviratne on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses