LaNCoR Improves Earthquake Arrival Time Picking with Noisy Labels
Summary
A new approach called LaNCoR (Label Noise-Contrastive Robust Learning) effectively handles inaccurate labels in seismic signal processing, particularly for earthquake P-phase arrival-time picking. It aligns feature and label distributions to correct mislabeling, significantly improving model performance without requiring extensive datasets.
Why it matters
For geoscientists, seismologists, and AI engineers working with sensor data, LaNCoR offers a powerful solution to a common problem: noisy labels. It enables the development of more accurate and reliable models for critical tasks like earthquake detection and analysis, even with imperfect data.
How to implement this in your domain
- 1Assess the quality of labels in existing datasets for time-series or sensor data applications.
- 2Investigate integrating LaNCoR's label noise-contrastive learning approach into your machine learning pipelines.
- 3Apply LaNCoR to tasks involving noisy labels, such as event detection, signal classification, or anomaly identification in various domains.
- 4Benchmark the performance improvements achieved by LaNCoR against traditional noise handling methods or large-scale data augmentation.
- 5Consider using LaNCoR to reduce the reliance on costly manual labeling or extensive data collection efforts.
Who benefits
Key takeaways
- Label noise significantly degrades machine learning model performance in seismology.
- LaNCoR is a new approach to handle noisy labels without large datasets.
- It works by aligning feature and label distributions in the feature space.
- LaNCoR substantially improves performance for earthquake arrival-time picking.
Original post by Sen Li, Xu Yang, S. Mostafa Mousavi, Anye Cao, Keting Fan, Yaoqi Liu, Changbin Wang, Qiang Niu
"arXiv:2606.15377v1 Announce Type: new Abstract: Inaccurately labeled training data, or "label noise", poses a significant threat to the integrity of supervised machine learning models. This corruption directly degrades performance by teaching the model erroneous mappings between…"
View on XOriginally posted by Sen Li, Xu Yang, S. Mostafa Mousavi, Anye Cao, Keting Fan, Yaoqi Liu, Changbin Wang, Qiang Niu on X · view source
Want to go deeper?
Turn these trends into skills with Learnijoy's hands-on AI & tech courses.
Explore coursesMore in AI Engineering & DevTools
OpenAI's Advanced Models: Frustration Over Limited Access
The author expresses frustration over the limited public access to OpenAI's most powerful AI models, like the rumored 5.6, suggesting that current models still suffice for most tasks, albeit requiring more prompting. They criticize OpenAI's communication strategy regarding these advanced, restricted models.
AI-Powered Development Workflow Integrates Multiple Models
A new development workflow leverages various AI models like Grok 4.3, GPT-5.5, and Opus 4.8 for distinct stages including research, planning, coding, testing, and debugging. This structured approach aims to optimize the software development lifecycle.

Proposing AI Usage Transparency for Credible Commentary
The author suggests a requirement for individuals and organizations to publish their percentage of frontier AI usage at work and personal usage. This transparency would establish credibility before commenting on AI's utility.