LaNCoR Improves Earthquake Arrival Time Picking with Noisy Labels

Sen Li, Xu Yang, S. Mostafa Mousavi, Anye Cao, Keting Fan, Yaoqi Liu, Changbin Wang, Qiang Niu· June 16, 2026 View original

Summary

A new approach called LaNCoR (Label Noise-Contrastive Robust Learning) effectively handles inaccurate labels in seismic signal processing, particularly for earthquake P-phase arrival-time picking. It aligns feature and label distributions to correct mislabeling, significantly improving model performance without requiring extensive datasets.

In supervised machine learning, the presence of inaccurately labeled training data, commonly known as "label noise," poses a significant threat to model integrity and performance. This corruption directly leads to models learning erroneous mappings, resulting in poor generalization and reduced accuracy on unseen data. In seismology, current applications typically rely on either very large training datasets or extensive data augmentation techniques to mitigate the impact of such label noise, which can be both labor-intensive and costly. This paper introduces a novel method called Label Noise-Contrastive Robust Learning (LaNCoR), designed to effectively manage noisy labels in seismic signal processing tasks. A key advantage of LaNCoR is its ability to achieve this without the need for massive training datasets. The approach works by aligning the input waveform feature and label representation distributions within the feature space. This alignment mechanism helps to correct mislabeling and substantially reduce its detrimental effects on the training process. The researchers demonstrated LaNCoR's effectiveness on the critical task of P-phase arrival-time picking using real microseismic data. When applied to two baseline models and training approaches, LaNCoR showed remarkable improvements, boosting performance by up to 28.8% across various metrics. This robust learning approach holds considerable promise for enhancing model training not only in seismology but also across broader geosciences.

Why it matters

For geoscientists, seismologists, and AI engineers working with sensor data, LaNCoR offers a powerful solution to a common problem: noisy labels. It enables the development of more accurate and reliable models for critical tasks like earthquake detection and analysis, even with imperfect data.

How to implement this in your domain

  1. 1Assess the quality of labels in existing datasets for time-series or sensor data applications.
  2. 2Investigate integrating LaNCoR's label noise-contrastive learning approach into your machine learning pipelines.
  3. 3Apply LaNCoR to tasks involving noisy labels, such as event detection, signal classification, or anomaly identification in various domains.
  4. 4Benchmark the performance improvements achieved by LaNCoR against traditional noise handling methods or large-scale data augmentation.
  5. 5Consider using LaNCoR to reduce the reliance on costly manual labeling or extensive data collection efforts.

Who benefits

SeismologyGeosciencesOil & GasEnvironmental MonitoringStructural Health Monitoring

Key takeaways

  • Label noise significantly degrades machine learning model performance in seismology.
  • LaNCoR is a new approach to handle noisy labels without large datasets.
  • It works by aligning feature and label distributions in the feature space.
  • LaNCoR substantially improves performance for earthquake arrival-time picking.

Original post by Sen Li, Xu Yang, S. Mostafa Mousavi, Anye Cao, Keting Fan, Yaoqi Liu, Changbin Wang, Qiang Niu

"arXiv:2606.15377v1 Announce Type: new Abstract: Inaccurately labeled training data, or "label noise", poses a significant threat to the integrity of supervised machine learning models. This corruption directly degrades performance by teaching the model erroneous mappings between…"

View on X

Originally posted by Sen Li, Xu Yang, S. Mostafa Mousavi, Anye Cao, Keting Fan, Yaoqi Liu, Changbin Wang, Qiang Niu on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses