Cognitive Models Enhance LLM Simulation of Human Persuasion

Zirui Cheng, Zeyu Shen, Thomas L. Griffiths, Peter Henderson· June 17, 2026 View original

Summary

Researchers propose "Equation-to-Behavior Prompting" and "Equation-to-Behavior RL" to guide large language models in simulating diverse human decision-making behaviors, including biases, in persuasion games. This approach uses mathematical cognitive models to create more realistic and varied simulated human agents for AI training and evaluation.

Large language models (LLMs) are increasingly used to simulate human behavior for safety evaluations and training purposes. However, these simulations often fail to capture the full spectrum of human decision-making, which can range from rational Bayesian updating to biased motivated reasoning. To address this limitation, researchers suggest leveraging mathematical models from cognitive science and economics to guide LLMs in mimicking a broader range of human behaviors. The proposed approach, termed "Equation-to-Behavior Prompting," involves instructing LLMs to adhere to specific mathematical models of human decision-making. This method was evaluated on persuasion games based on legal scenarios, demonstrating that large models can approximate various equation-based specifications, including Bayesian updating and models of motivated reasoning. Smaller models, however, struggled with this prompting technique. To improve smaller models, "Equation-to-Behavior RL" was introduced, using reinforcement learning to train them to follow mathematical rules. This significantly reduced belief error in out-of-distribution parameterizations. The study highlights that these enhanced simulations can create more diverse training environments, leading to improved performance in persuading other LLMs. This work paves the way for more realistic human simulations, benefiting AI training and evaluation, and enabling deeper research into complex human decision-making models.

Why it matters

This research is crucial for professionals developing AI systems that interact with humans, especially in sensitive domains like negotiation, sales, or legal tech. By enabling more realistic and diverse human simulations, it allows for more robust AI training, better safety evaluations, and the development of AI that can adapt to varied human behaviors.

How to implement this in your domain

  1. 1Integrate cognitive models of human decision-making into LLM prompting strategies for simulating user behavior.
  2. 2Apply "Equation-to-Behavior Prompting" to create diverse simulated personas for testing AI agents in persuasion or negotiation scenarios.
  3. 3Consider using reinforcement learning (Equation-to-Behavior RL) to fine-tune smaller LLMs for more accurate behavioral simulations.
  4. 4Utilize these advanced human simulations to generate more varied and robust training data for AI models.
  5. 5Evaluate AI system performance against a wider range of simulated human biases and decision-making styles to improve real-world adaptability.

Who benefits

AI DevelopmentMarketingSalesLegalTechCustomer Service

Key takeaways

  • Cognitive models can significantly improve LLM simulations of diverse human behaviors.
  • "Equation-to-Behavior Prompting" guides large LLMs to match mathematical decision models.
  • Reinforcement learning can enhance smaller LLMs' ability to adhere to these models.
  • More realistic human simulations lead to better AI training and evaluation environments.

Original post by Zirui Cheng, Zeyu Shen, Thomas L. Griffiths, Peter Henderson

"arXiv:2606.17657v1 Announce Type: new Abstract: People make decisions differently in strategic interactions. Some update beliefs like a Bayesian; others exhibit biases like motivated reasoning. Although creators of large language models use simulated humans for safety evaluations…"

View on X

Originally posted by Zirui Cheng, Zeyu Shen, Thomas L. Griffiths, Peter Henderson on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses