ML Detects LDAP Reconnaissance Using Weak Supervision

Shaefer Drew, Edward Raff, Michael Brautbar, Yaron Zinar, Benjamin Malmberg, Dor Agron, Sagi Sheinfeld, Avraham Kama, Asaf Romano· June 30, 2026 View original

Summary

Researchers developed two machine learning frameworks to identify malicious LDAP queries and extract signatures, aiming to detect threat actors early in the reconnaissance phase. The approach uses weak supervision to label large datasets, making it practical for deployment.

This research introduces two machine learning frameworks designed to enhance the detection of Lightweight Directory Access Protocol (LDAP) reconnaissance activities. LDAP is a common target for threat actors seeking to map Active Directory data after an initial compromise. The goal is to identify these malicious queries early in the attack chain. The first framework employs a weakly supervised ML classifier to predict malicious LDAP queries. It correlates LDAP queries with endpoint detections to automatically label a massive dataset, overcoming the cost and time constraints of manual labeling. The second framework builds upon this by using a statistical hypothesis-testing method to mine novel, malicious LDAP signatures for immediate deployment. Despite the limitations of weak supervision compared to manual labeling, this automated approach proved practical and effective. The classifier achieved a 65% True Positive Rate with controlled false positives, while the mined signatures showed 81.48% field precision in real-world testing.

Why it matters

Professionals can leverage these ML-driven methods to significantly improve early detection of sophisticated cyber threats targeting Active Directory, reducing the window of opportunity for attackers. This offers a more scalable and efficient alternative to traditional, static detection rules.

How to implement this in your domain

  1. 1Evaluate current LDAP logging and monitoring capabilities for completeness and integration with security information and event management (SIEM) systems.
  2. 2Explore integrating weak supervision techniques into existing security analytics platforms to automate the labeling of large security datasets.
  3. 3Pilot the deployment of ML classifiers for real-time analysis of LDAP query logs to identify suspicious patterns.
  4. 4Develop or adopt tools that can automatically extract and deploy new malicious LDAP signatures based on observed anomalies.
  5. 5Train security operations center (SOC) analysts on the outputs and interpretability of ML-driven detection systems to enhance incident response.

Who benefits

CybersecurityBFSIGovernmentIT ServicesHealthcare

Key takeaways

  • New ML frameworks detect LDAP reconnaissance early in cyberattacks.
  • Weak supervision enables large-scale, cost-effective dataset labeling for security.
  • The methods achieve high true positive rates and precision in identifying malicious queries.
  • This approach offers a dynamic alternative to static, rule-based threat detection.

Original post by Shaefer Drew, Edward Raff, Michael Brautbar, Yaron Zinar, Benjamin Malmberg, Dor Agron, Sagi Sheinfeld, Avraham Kama, Asaf Romano

"arXiv:2606.28917v1 Announce Type: new Abstract: Lightweight Directory Access Protocol (LDAP) is a protocol that allows users to query and modify Active Directory (AD) data. By default, all users have read access to all AD data through LDAP, making it a common initial tool for rec…"

View on X

Originally posted by Shaefer Drew, Edward Raff, Michael Brautbar, Yaron Zinar, Benjamin Malmberg, Dor Agron, Sagi Sheinfeld, Avraham Kama, Asaf Romano on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses