Machine-Learned Comorbidity Index Improves Patient Risk Assessment

Suleman Baloch, Kishlay Jha, Alberto M. Segre, Philip M. Polgreen, Bijaya Adhikari· June 17, 2026 View original

Summary

A new Machine-Learned Comorbidity Index (MLCI) has been developed to overcome limitations of traditional comorbidity scores by capturing nonlinear, outcome-specific risk relationships from diagnosis codes. MLCI maximizes the normalized Hilbert-Schmidt Independence Criterion between the score and multiple clinical outcomes, offering a more informative patient stratification.

Traditional comorbidity scores, such as the Charlson and Elixhauser indices, are widely used in healthcare for risk adjustment and patient stratification. However, these conventional scores suffer from two primary limitations. Firstly, they are largely focused on mortality outcomes, which means they may not accurately align with other crucial clinical outcomes. Secondly, their linear, rule-based structure prevents them from capturing the complex, nonlinear, and outcome-specific risk relationships that exist within patient data. To address these shortcomings, researchers have proposed a novel Machine-Learned Comorbidity Index (MLCI). This index is designed to map diagnosis codes to a single scalar value. Its development involves maximizing the normalized Hilbert-Schmidt Independence Criterion (nHSIC) between the learned score and multiple clinical outcomes. This approach allows MLCI to effectively capture nonlinear dependencies between risk factors and patient outcomes. The theoretical underpinnings of MLCI also characterize the conditions under which a unified, informative admission-level ordering can be achieved across various outcomes. Empirical evaluations using multiple benchmark electronic health record (EHR) datasets demonstrated that MLCI consistently outperforms strong baseline models across several evaluation metrics, indicating its potential to provide a more accurate and comprehensive assessment of patient risk and comorbidity.

Why it matters

For healthcare professionals and data scientists, MLCI offers a more accurate and nuanced tool for patient risk assessment and stratification, moving beyond mortality-centric views. This can lead to better-informed clinical decisions, more personalized treatment plans, and improved resource allocation, ultimately enhancing patient care.

How to implement this in your domain

  1. 1Evaluate existing comorbidity indices in your clinical setting for their alignment with diverse outcomes.
  2. 2Explore machine learning techniques to develop custom comorbidity indices tailored to specific patient populations or outcomes.
  3. 3Utilize advanced statistical methods like nHSIC to capture nonlinear relationships in health data.
  4. 4Integrate machine-learned comorbidity scores into electronic health record (EHR) systems for real-time risk assessment.
  5. 5Collaborate with data scientists to validate and deploy MLCI or similar models for improved patient stratification.

Who benefits

HealthcareHealthTechPharmaceuticalsInsurancePublic Health

Key takeaways

  • Traditional comorbidity scores are limited by mortality focus and linear structure.
  • MLCI captures nonlinear, outcome-specific risk relationships from diagnosis codes.
  • It maximizes nHSIC between the score and multiple clinical outcomes for better alignment.
  • Empirical results show MLCI outperforms baselines in patient risk assessment.

Original post by Suleman Baloch, Kishlay Jha, Alberto M. Segre, Philip M. Polgreen, Bijaya Adhikari

"arXiv:2606.17450v1 Announce Type: new Abstract: Traditional comorbidity scores (e.g., Charlson and Elixhauser) are widely used for risk adjustment and patient stratification, but they have two key limitations: (i) they are largely mortality-centric and do not align well with othe…"

View on X

Originally posted by Suleman Baloch, Kishlay Jha, Alberto M. Segre, Philip M. Polgreen, Bijaya Adhikari on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses