RGLD Boosts Tabular Anomaly Detection Accuracy and Speed

Quanling Zhao, Jiaying Yang, Ye Tian, Josh Victoria, Zhijun Wang, Pietro Mercati, Onat Gungor, Tajana Rosing· June 30, 2026 View original

Summary

RGLD is a new randomized global-local density estimator for unsupervised tabular anomaly detection that combines global random-feature density with local neighbor support. It achieves state-of-the-art accuracy, outperforms deep detectors in speed, and is robust across diverse datasets.

Unsupervised tabular anomaly detection requires methods that are not only accurate and robust across various datasets but also computationally efficient. Traditional statistical detectors often lack flexibility, while deep learning methods, though powerful, are typically slow and difficult to tune without supervision. This research introduces RGLD (Randomized Global-Local Density Estimation), a novel approach designed to address these challenges. RGLD combines two complementary branches: a global random-feature density branch that identifies samples in broadly low-density regions, and a local neighbor branch that detects samples poorly supported by nearby observations. Both branches operate over feature-bagged randomized views of the data, allowing RGLD to uncover anomaly evidence that might be obscured in a single data representation. Extensive experiments on 47 tabular datasets, comparing RGLD against 23 statistical and deep anomaly detection baselines, demonstrated its superior performance. RGLD achieved the strongest dataset-level AUROC, ranking first in dataset wins, and second in AUPRC wins. Crucially, it also proved significantly faster than all evaluated deep detectors, achieving 50x-580x speedups, while remaining competitive with statistical methods in runtime, offering an excellent accuracy-efficiency trade-off.

Why it matters

Professionals dealing with tabular data can leverage RGLD to detect anomalies more accurately and efficiently, improving fraud detection, cybersecurity, quality control, and predictive maintenance without the high computational cost of deep learning.

How to implement this in your domain

  1. 1Evaluate current anomaly detection systems for tabular data, noting their accuracy, robustness, and computational overhead.
  2. 2Explore integrating RGLD or similar randomized density estimation techniques into existing data analytics pipelines.
  3. 3Pilot RGLD on critical business datasets (e.g., transaction logs, sensor data) to identify previously missed anomalies.
  4. 4Benchmark RGLD's performance against current deep learning and statistical methods in terms of both accuracy and processing speed.
  5. 5Train data scientists and analysts on the principles and application of advanced unsupervised anomaly detection methods.

Who benefits

BFSICybersecurityManufacturingHealthcareRetail

Key takeaways

  • RGLD offers superior accuracy and efficiency for tabular anomaly detection.
  • It combines global and local density estimation for robust performance.
  • The method significantly outperforms deep learning detectors in speed.
  • RGLD is highly effective across a wide range of diverse datasets.

Original post by Quanling Zhao, Jiaying Yang, Ye Tian, Josh Victoria, Zhijun Wang, Pietro Mercati, Onat Gungor, Tajana Rosing

"arXiv:2606.28970v1 Announce Type: new Abstract: Unsupervised tabular anomaly detection requires methods that are accurate, robust across heterogeneous datasets, and computationally efficient. Classical statistical detectors are often efficient, but they usually rely on a fixed da…"

View on X

Originally posted by Quanling Zhao, Jiaying Yang, Ye Tian, Josh Victoria, Zhijun Wang, Pietro Mercati, Onat Gungor, Tajana Rosing on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses