Cough Regression Benchmark Advances Respiratory Acoustic AI Models

Mayur Sanap, Prasanna Desikan, Edgar Lobaton· June 16, 2026 View original

Summary

A new benchmark evaluates respiratory acoustic foundation models for predicting continuous health quantities like age, BMI, and disease probability from cough audio, moving beyond simple classification.

While respiratory acoustic foundation models (FMs) have shown strong performance in classifying coughs, their capability to predict continuous health metrics from cough audio has remained largely unexplored. This is a significant gap, as estimating quantities like age, BMI, or disease probability passively from coughs holds substantial clinical value, particularly in settings where direct physical measurements are not feasible. Researchers have introduced a comprehensive multi-model, multi-target cough regression benchmark. This benchmark assesses five different FMs across six distinct health targets using three diverse datasets, employing subject-disjoint protocols. The evaluation compares the effectiveness of linear, small MLP, and full MLP regression heads attached to these foundation models. Key findings indicate that a small MLP regression head generally outperforms both a mean-predictor baseline and linear probing across most tasks. While a full MLP can overfit on smaller clinical datasets, its performance recovers with larger data, highlighting a trade-off between dataset size and model capacity. Notably, models like HeAR and OPERA-GT demonstrate strong performance in age regression, and cross-dataset transfer learning shows asymmetric results, with large, diverse datasets generalizing better to smaller clinical populations than vice versa.

Why it matters

This research is crucial for professionals in healthcare and AI, as it paves the way for more sophisticated, passive health monitoring systems that can derive continuous, clinically relevant information from readily available data like cough sounds.

How to implement this in your domain

  1. 1Explore existing respiratory acoustic foundation models for potential integration into health monitoring applications.
  2. 2Develop and fine-tune regression models to predict continuous health metrics from cough audio data.
  3. 3Evaluate model performance on diverse datasets, paying close attention to the trade-offs between model complexity and dataset size.
  4. 4Investigate cross-dataset transfer learning strategies to enhance the robustness and generalizability of cough analysis models.
  5. 5Integrate passive cough analysis capabilities into telehealth platforms or remote patient monitoring devices.

Who benefits

HealthcareTelemedicineWearable TechDigital Health

Key takeaways

  • Respiratory acoustic foundation models can predict continuous health metrics from coughs.
  • A new benchmark evaluates FMs for regression tasks like age, BMI, and disease probability.
  • Model capacity and dataset size are critical factors for effective cough audio regression.
  • Cross-dataset transfer learning for cough analysis exhibits asymmetric generalization patterns.

Original post by Mayur Sanap, Prasanna Desikan, Edgar Lobaton

"arXiv:2606.15436v1 Announce Type: new Abstract: Respiratory acoustic foundation models (FMs) excel at cough classification, yet their ability to predict continuous health quantities from cough audio remains largely unexplored, despite the clinical value of passive age, BMI, and d…"

View on X

Originally posted by Mayur Sanap, Prasanna Desikan, Edgar Lobaton on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses