Language Priors Enhance Darcy-Flow Inversion in Geological Modeling

Taiga Saito, Yu Otake, Daijiro Mizutani, Sopheakpolin Mom· June 25, 2026 View original

Summary

This research explores using sentence embeddings as an interface to inject qualitative geological descriptions into learned Darcy-flow inverse solvers. It demonstrates that text conditioning significantly reduces reconstruction error, primarily by providing categorical, class-level constraints.

In complex inverse problems, the accuracy of a solution heavily relies on the prior information provided, often as much as on the data itself. Much of the valuable engineering knowledge, especially in fields like geology, exists in qualitative descriptions rather than formal mathematical models. This study investigates whether sentence embeddings can bridge this gap by serving as an inference-time interface to integrate geological descriptions into a learned Darcy-flow inverse solver. The researchers tested this approach across six synthetic geological classes and a benchmark reservoir model. By varying only the conditioning representation, they found that text conditioning reduced reconstruction error by 81% compared to a no-text baseline. This improvement largely stemmed from categorical, class-level constraints, particularly in areas where hydraulic head data left the conductivity field underdetermined. While within-class geometric detail was secondary, sentence embeddings also improved training stability and enabled advanced analyses like paraphrase-based sensitivity and open-vocabulary inputs, showcasing their potential as an engineering-informatics tool for injecting expert knowledge.

Why it matters

Integrating qualitative expert knowledge via language priors can significantly improve the accuracy and stability of inverse problem solvers in engineering and scientific domains. This approach offers a new way to leverage unstructured textual data for complex simulations and predictions.

How to implement this in your domain

  1. 1Identify inverse problems in your domain where qualitative expert descriptions are abundant but underutilized.
  2. 2Explore using sentence embeddings or other natural language processing techniques to convert textual knowledge into model priors.
  3. 3Integrate these language-based priors into existing learned solvers or develop new ones capable of handling such inputs.
  4. 4Evaluate the impact of language priors on solution accuracy, training stability, and the ability to perform sensitivity analyses.
  5. 5Develop interfaces that allow domain experts to easily input and refine textual descriptions for improved model performance.

Who benefits

Oil & GasEnvironmental EngineeringGeophysicsMaterials ScienceCivil Engineering

Key takeaways

  • Language priors, via sentence embeddings, can significantly improve the accuracy of learned inverse solvers.
  • Text conditioning primarily provides valuable categorical, class-level constraints, especially in underdetermined regions.
  • This method enhances training stability and enables advanced sensitivity analysis.
  • It offers a novel engineering-informatics interface for injecting qualitative expert knowledge into complex models.

Original post by Taiga Saito, Yu Otake, Daijiro Mizutani, Sopheakpolin Mom

"arXiv:2606.24967v1 Announce Type: new Abstract: In ill-posed inverse problems, the recovered solution depends as much on the prior as on the data, yet much of the engineering knowledge that could serve as that prior is recorded qualitatively rather than in formal mathematical for…"

View on X

Originally posted by Taiga Saito, Yu Otake, Daijiro Mizutani, Sopheakpolin Mom on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses