New Benchmark for Audience-Aware AI Slide Generation

Haodong Chen, Xuanhe Zhou, Wei Zhou, Xinyue Shao, Yanbing Zhu, Bo Wang, Jiawei Hong, Anya Jia, Fan Wu· June 18, 2026 View original

Summary

A new benchmark, X+Slides, has been introduced to evaluate large language models' ability to generate slide decks tailored to specific target audiences. Unlike previous benchmarks, X+Slides assesses audience coverage, domain-wise coverage, efficiency, and correctness, revealing that current systems still struggle to fully meet audience-specific information needs.

A novel benchmark called X+Slides has been developed to address a critical gap in evaluating AI-driven slide generation: the ability to tailor content for different audiences. Existing benchmarks primarily focus on completeness and technical depth, often overlooking the crucial requirement for presentations to resonate with specific groups, such as specialists needing detailed proofs versus decision-makers requiring actionable summaries. X+Slides is built upon a diverse dataset of 113 topics and seven presentation scenarios, utilizing a dynamic evaluation framework with over 8,000 source-grounded probes. It introduces four key metrics: Audience Coverage, which quantifies essential information conveyed; Domain-wise Coverage, indicating types of information covered; Efficiency, measuring utility per attention cost; and Correctness, verifying source support for claims. Experiments with models like DeepPresenter, SlideTailor, and NotebookLM show that while these systems can recover a substantial portion of audience-essential information, they still fall short of complete audience-specific tailoring. For instance, DeepPresenter achieved an Audience Coverage of 0.714, highlighting that visual quality and broad topic coverage alone are insufficient without robust source-grounded and audience-aware evaluation.

Why it matters

This benchmark is vital for professionals who rely on AI for content creation, especially presentations, as it pushes for more sophisticated, audience-aware AI tools. Improving audience conditioning can significantly enhance communication effectiveness and reduce manual refinement efforts.

How to implement this in your domain

  1. 1Adopt audience-conditioned prompting strategies when using LLMs for presentation generation.
  2. 2Evaluate AI-generated content not just for accuracy but also for its relevance and suitability for the intended audience.
  3. 3Provide explicit audience profiles and communication goals to AI models when requesting slide decks.
  4. 4Integrate X+Slides metrics into internal AI content generation tool development and testing.

Who benefits

MarketingEducationConsultingCorporate TrainingAI Engineering

Key takeaways

  • X+Slides benchmarks LLMs for generating audience-conditioned slide decks.
  • It introduces metrics like Audience Coverage, Efficiency, and Correctness.
  • Current LLMs show promise but still struggle with full audience-specific tailoring.
  • Audience-aware prompt design is crucial for effective AI-generated presentations.

Original post by Haodong Chen, Xuanhe Zhou, Wei Zhou, Xinyue Shao, Yanbing Zhu, Bo Wang, Jiawei Hong, Anya Jia, Fan Wu

"arXiv:2606.19256v1 Announce Type: new Abstract: Automatically generating slide decks from source documents is an important application of large language models (LLMs). Existing benchmarks primarily assess slide completeness and technical depth, while overlooking the target audien…"

View on X

Originally posted by Haodong Chen, Xuanhe Zhou, Wei Zhou, Xinyue Shao, Yanbing Zhu, Bo Wang, Jiawei Hong, Anya Jia, Fan Wu on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses