New ML Method Boosts CNS Tumor Classification Accuracy

Paulo R. Ferreira Jr., Lucas Coutinho Freitas, La\'is dos Santos Gon\c{c}alves, William Borges Domingues, Lucas Petitemberte de Souza, Mariana B. Michalowski, Vinicius F. Campos· July 3, 2026 View original

Summary

This research introduces a novel machine learning approach combining Sparse Random Projection with multinomial logistic regression for classifying Central Nervous System (CNS) tumors from DNA methylation data. The method significantly improves classification accuracy, achieving 96% on a reference cohort and outperforming state-of-the-art by 4-5 percentage points on an independent clinical cohort.

Researchers have developed a new, methodologically rigorous machine learning technique for classifying Central Nervous System (CNS) tumors based on DNA methylation profiles. This innovative approach integrates Sparse Random Projection for efficient dimensionality reduction with multinomial logistic regression for robust classification. The study rigorously evaluated the new method against a widely used reference classifier. On a 2,801-sample reference cohort, the proposed model achieved an impressive mean accuracy of 96% under stratified 3-fold cross-validation. Crucially, when tested on an independent 1,104-sample clinical evaluation cohort, the method reached 86% accuracy at the 91-class level and 93% at the methylation class family level. These figures represent a significant improvement of approximately 4 to 5 percentage points over the previous state-of-the-art, offering clinically relevant gains that can directly impact cancer subtype assignment and treatment decisions.

Why it matters

Improved accuracy in CNS tumor classification directly translates to more precise diagnoses and better-informed treatment plans, leading to better patient outcomes in oncology.

How to implement this in your domain

  1. 1Evaluate the feasibility of integrating this new ML approach into existing diagnostic pipelines for CNS tumors.
  2. 2Collaborate with research institutions to validate the model on diverse, larger clinical datasets.
  3. 3Train medical professionals on the capabilities and limitations of AI-assisted tumor classification.
  4. 4Develop robust data governance and privacy protocols for handling sensitive DNA methylation data.
  5. 5Explore commercial partnerships to bring this advanced diagnostic tool to clinical practice.

Who benefits

HealthcarePharmaceuticalsMedical DiagnosticsBiotechnology

Key takeaways

  • A new ML approach significantly improves CNS tumor classification accuracy using DNA methylation.
  • The method combines Sparse Random Projection with multinomial logistic regression.
  • It outperforms state-of-the-art by 4-5 percentage points on independent clinical data.
  • Improved classification directly impacts cancer subtype assignment and treatment decisions.

Original post by Paulo R. Ferreira Jr., Lucas Coutinho Freitas, La\'is dos Santos Gon\c{c}alves, William Borges Domingues, Lucas Petitemberte de Souza, Mariana B. Michalowski, Vinicius F. Campos

"arXiv:2607.01307v1 Announce Type: new Abstract: NA methylation profiling has become a powerful approach for central nervous system (CNS) tumor classification, yet important challenges remain regarding cross-cohort transferability, methodological correctness, and robust multiclass…"

View on X

Originally posted by Paulo R. Ferreira Jr., Lucas Coutinho Freitas, La\'is dos Santos Gon\c{c}alves, William Borges Domingues, Lucas Petitemberte de Souza, Mariana B. Michalowski, Vinicius F. Campos on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses