Small On-Prem LLMs Show Overrefusal in Legal Contexts

Anastasiia Kucherenko, Fran\c{c}ois Brouchoud, Dimitri Percia David, Andrei Kucharavy· June 24, 2026 View original

Summary

Research reveals that small, on-premises LLMs exhibit significantly higher refusal rates when given authority-style prefixes in legal prompts, potentially introducing biases. This instability under contextual framing suggests a need for further investigation to minimize bias opportunities.

A new study investigates the behavior of small, on-premises large language models (LLMs) when used in legal contexts, specifically focusing on their tendency to refuse to answer certain prompts. Legal professionals are already experimenting with these personal LLMs for tasks like translation and reformulation, but concerns exist about potential biases. The research found a surprising effect: when LLMs were given authority-style prefixes, such as "you are acting as an assistant of the national supreme court" or "[...] defense lawyer," their refusal rates increased dramatically, by 2 to 20 times compared to a baseline without such prefixes. This indicates that these smaller, deployable LLMs are unstable when presented with contextual framings that a real institutional user would naturally employ. The findings highlight a critical issue for the practical deployment of LLMs in sensitive domains like law. The observed overrefusal under specific contextual cues could lead to unintended biases in case processing speed or selective assistance, underscoring the necessity for further research to mitigate these risks and ensure fair and reliable application.

Why it matters

Professionals deploying or considering LLMs in sensitive, regulated environments like legal or healthcare must understand how contextual framing can introduce unexpected biases and affect model reliability. This research highlights a critical, often overlooked, failure mode in smaller, on-premise models.

How to implement this in your domain

  1. 1Test LLM behavior: Conduct thorough adversarial testing on LLMs with various contextual prompts, especially those involving authority or specific roles, to identify refusal biases.
  2. 2Develop bias mitigation strategies: Implement techniques to reduce overrefusal, such as prompt engineering, fine-tuning with diverse legal datasets, or using ensemble methods.
  3. 3Establish clear usage guidelines: Create internal policies for legal professionals on how to phrase prompts to minimize refusal rates and ensure consistent LLM assistance.
  4. 4Monitor LLM outputs: Continuously monitor and audit LLM responses for consistency and potential biases, particularly when used for critical tasks.

Who benefits

LegalComplianceHealthcareGovernment

Key takeaways

  • Small LLMs show increased refusal rates when given authority-style legal prefixes.
  • Contextual framing can significantly impact LLM stability and introduce biases.
  • Overrefusal in LLMs can lead to selective assistance and affect case processing.
  • Further investigation is crucial to minimize bias opportunities in legal AI applications.

Original post by Anastasiia Kucherenko, Fran\c{c}ois Brouchoud, Dimitri Percia David, Andrei Kucharavy

"arXiv:2606.24585v1 Announce Type: new Abstract: While the validity of LLMs' use in the legal context remains subject to ethical and legal debate, legal professionals are already experimenting with personal LLMs, if only for translation and reformulation. However, even such a seem…"

View on X

Originally posted by Anastasiia Kucherenko, Fran\c{c}ois Brouchoud, Dimitri Percia David, Andrei Kucharavy on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses