AI Framework Predicts Individual Absenteeism with High Accuracy

Kwong Ho Li, Matthew Roughan, Wathsala Karunarathne· July 1, 2026 View original

Summary

This research proposes a Time Series Classification (TSC) framework for proactive, individual-level absenteeism prediction, addressing severe class imbalance. It demonstrates that an LSTM-FCN model, combined with Binary Focal Loss or G-Mean, achieves high precision and specificity, outperforming traditional methods.

Staff absenteeism poses significant operational costs, particularly in high-demand sectors like healthcare and logistics, where reliable individual-level absence prediction is crucial for workforce planning. Existing prediction methods often fall short by merely reproducing past outcomes rather than forecasting future events, and they typically ignore the sequential nature of attendance histories.To address these limitations, researchers developed a novel Time Series Classification (TSC) framework. This framework distinctly separates historical attendance sequences from future absence labels, enabling genuinely proactive predictions. Given the scarcity of public longitudinal attendance data, a reproducible simulated dataset, calibrated to the UCI dataset, was constructed for the study.The framework was evaluated under severe class imbalance, comparing Binary Focal Loss (BFL) and Geometric Mean (G-Mean) loss functions. Both achieved comparable, strong performance, with BFL requiring parameter calibration while G-Mean adapted automatically. Among the deep learning architectures tested—LSTM, CNN, and the hybrid LSTM-FCN—the LSTM-FCN consistently delivered robust precision and specificity. The study found stable performance with specific batch and window sizes, achieving approximately 80% balanced accuracy on test data.

Why it matters

For HR, operations, and leadership teams in industries with high demand and significant absenteeism costs, this framework offers a powerful tool for proactive workforce planning, resource optimization, and cost reduction.

How to implement this in your domain

  1. 1Collect and structure individual-level attendance data as time-series sequences.
  2. 2Explore implementing a Time Series Classification (TSC) framework for absenteeism prediction, potentially using LSTM-FCN architectures.
  3. 3Experiment with loss functions like Binary Focal Loss or G-Mean to handle class imbalance in your specific dataset.
  4. 4Integrate proactive absenteeism predictions into workforce planning and scheduling systems.
  5. 5Develop strategies to address potential ethical concerns regarding individual-level prediction and employee privacy.

Who benefits

HealthcareLogisticsManufacturingConstructionRetail

Key takeaways

  • Proactive absenteeism prediction requires a time-series classification approach.
  • The LSTM-FCN architecture shows strong performance for this task.
  • Loss functions like Binary Focal Loss or G-Mean effectively handle class imbalance.
  • Reliable individual-level predictions can significantly aid workforce planning.

Original post by Kwong Ho Li, Matthew Roughan, Wathsala Karunarathne

"arXiv:2606.31532v1 Announce Type: new Abstract: Staff absenteeism imposes substantial operational costs in high-demand work environments such as healthcare, emergency services, meat processing, construction, and courier and delivery services, where proactive workforce planning de…"

View on X

Originally posted by Kwong Ho Li, Matthew Roughan, Wathsala Karunarathne on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses

More in AI Engineering & DevTools

AI ResearchAI Engineering & DevTools

Philosophical Foundations for Explainable AI in Healthcare Explored

This paper critically reviews the intersection of philosophy of science and explainable AI (XAI) in health sciences, examining what constitutes an adequate medical explanation. It identifies causality, trust, and epistemic adequacy as central axes for designing robust XAI systems in clinical decision-making.

Martina Mattioli, Marcello PelilloJul 1, 2026
AI ResearchAI Engineering & DevTools

New Metric Improves LLM Reinforcement Learning with Verifiable Rewards.

This research introduces the Relative Surprisal Index (RSI), an information-theoretic metric for adaptive token selection in Reinforcement Learning with Verifiable Rewards (RLVR) for LLMs. RSI-S, an entropy-adaptive filtering method based on RSI, improves reasoning accuracy by 2-3 percentage points by retaining tokens within a stable surprisal interval.

Outongyi Lv, Yanzhao Zheng, Yuanwei Zhang, Zhenghao Huang, Xingjun Wang, Baohua Dong, Hangcheng Zhu, Yingda ChenJul 1, 2026
AI Engineering & DevToolsAI Research

New ACE Module Boosts LLM Agent Context Management

Researchers introduce ACE (Adaptive Context Elasticizer), a plug-and-play module that dynamically manages historical information for LLM-based agents. ACE maintains a lossless message layer and adaptively orchestrates context, significantly improving performance across various agent frameworks without architectural changes.

Ning Liao, Zihao Long, Xiaoxing Wang, Xue Yang, Yaoming Wang, Ziyuan Zhuang, Xunliang Cai, Rongxiang Weng, Junchi YanJul 1, 2026