Submitted:
03 February 2025
Posted:
04 February 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
3. Materials and Methods
3.1. Methodology
Predictive Analytics in Occupational Health
3.2. Data Collection
3.3. Identify Key Components of the Data
Model Selection

Data Model Training
- Feature Selection: PCA was used to reduce dimensionality, retaining [X]% of the variance from the most relevant risk factors.
- Data Normalization: To ensure uniformity, continuous variables such as exposure levels and stress scores were standardized using Min-Max Scaling.
-
Model Training:
- ○
- Supervised Learning: Logistic regression and XGBoost were used to predict burnout probability.
- ○
- Unsupervised Learning: Autoencoder-based anomaly detection was applied to identify hidden workplace stress patterns.
- ○
- Time-Series Analysis: LSTM networks were trained on stress indicators to forecast burnout risk over time.
Model Selection Justification
3.4. Machine Learning Mathematics for Predictive Analytics in Occupational Health
- X2, ..., Xn are burnout risk factors (e.g., work hours, job stress, environmental exposure).
- β0 is the intercept, and βi is the regression coefficient for each risk factor.
Gradient Boosting for Workplace Risk Classification
- is the model at iteration mm.
- is the weak learner trained on residual errors.
- is the learning rate, controlling the update size.
- yi is the true label (hazardous or non-hazardous),
- is the predicted probability of a workplace being hazardous.
Unsupervised Learning for Anomaly Detection in Workplace IncidentsAutoencoder for Hazard Anomaly Detection
- An Encoder that compresses input data into a latent representation:
- A Decoder that reconstructs the input from the latent representation:
3.5. AI Predictive Model Development
3.5.1. Hazard Risk Prediction
Mathematical Formulation of Hazard Prediction Using Random Forest
Anomaly Detection Using Autoencoders
- Encoder functionE(x) that maps input features to a lower-dimensional latent space.
- Decoder functionD(z) that reconstructs the original input from the latent representation.
- The reconstruction error R(x) is computed as:
3.5.2. Burnout Risk Prediction
Hierarchical Linear Regression for Burnout Risk Analysis
- β0is the intercept,
- β1, β2, β3 are the regression coefficients,
- ε is the error term.
- ft, it, ot are the forget, input, and output gates, respectively,
- W, U, b are the weight matrices and biases,
- σ is the sigmoid activation function,
- ⊙ represents element-wise multiplication.

- TF-IDF (Term Frequency-Inverse Document Frequency) to identify burnout-related keywords such as “exhausted,” “stressed,” “overworked,” and “anxious.”
- Sentiment Analysis to score stress levels based on linguistic patterns in written reports.
- Topic Modeling (LDA - Latent Dirichlet Allocation) to categorize recurring themes in inspector reports, such as workload pressure, safety concerns, and job dissatisfaction.

3.6. Statistical Analysis
3.6.1. Exploratory Factor Analysis (EFA) for Risk and Burnout Categorization
Mathematical Formulation of EFA
- X is the matrix of observed variables (survey responses).
- ∧ is the factor loading matrix representing the relationship between observed and latent variables.
- F represents k underlying factors explaining variance in X.
- ε represents the error terms (unexplained variance).
Key Outcomes from EFA Analysis:
- Factor Loadings > 0.40 indicate that the variable strongly correlates with the latent construct [33].
- Eigenvalues > 1.00 suggest significant latent factors in workplace risk and burnout.
- Kaiser-Meyer-Olkin (KMO) Measure > 0.70 confirms that the dataset is suitable for factor extraction [34].
-
EFA will group survey items into key categories such as:
- ○
- Workplace Hazards (heat stress, chemical exposure, physical risks).
- ○
- Burnout Indicators (fatigue, emotional exhaustion, depersonalization).
- ○
- Workload & Psychological Stressors (long hours, lack of training, exposure to high-pressure environments).
- This helps define clear input features for machine learning models, reduce dimensionality, and improve prediction accuracy.

3.6.2. ROC-AUC Curve Analysis for Model Validation
Mathematical Definition of ROC-AUC
- TP (True Positives): Correctly predicted hazardous workplaces or burnout cases.
- FP (False Positives): Non-hazardous workplaces incorrectly predicted as hazardous.
- TN (True Negatives): Correctly predicted non-hazardous workplaces.
- FN (False Negatives): Missed hazardous workplaces.


- Random Forest & XGBoost (for hazard classification).
- LSTM Neural Networks (for burnout forecasting).
- Autoencoder Models (for anomaly detection in workplace conditions).
3.7. Ethical Considerations
3.7.1. AI Ethics in Workplace Monitoring and Data Use
- Fairness: AI models undergo bias detection algorithms to prevent discriminatory risk assessments.
- Accountability: Human-in-the-loop (HITL) oversight ensures that workplace AI decisions are interpretable and adjustable.
- Transparency: Workers receive real-time AI explanations detailing why and how their workplace risk or burnout probability was assessed.
4. Results
4.1. AI-Powered Workplace Hazard Prediction
4.1.1. Predictive Model Performance
Confusion matrix analysis showed XGBoost and LSTM maintained high precision and recall, minimizing false positives in hazard and burnout prediction.
4.1.2. Impact of Climate Factors on Workplace Stress
4.2. Burnout Prediction and Prevention Strategies
- Task rotation
- Break scheduling optimization
- Workload redistribution
- Environmental exposure mitigation
4.2.1. Validation Through NLP-Based Analysis of Worker Reports
4.3. Model Performance
4.4. Visualization of Model Performance
| Model | AUC Score (Hazard Prediction) | AUC Score (Burnout Prediction) |
|---|---|---|
| Random Forest | 0.89 | - |
| XGBoost | 0.91 | - |
| Autoencoder | 0.85 | - |
| LSTM (Burnout) | - | 0.87 |
| Model | Accuracy (%) | Precision (%) | Recall (%) | F1-Score (%) | ROC-AUC Score |
|---|---|---|---|---|---|
| XGBoost | 90 | 88 | 85 | 86 | 0.90 |
| Random Forest | 88 | 86 | 84 | 85 | 0.89 |
| Autoencoder | 85 | 83 | 82 | 83 | 0.87 |
| Logistic Regression | 85 | 81 | 80 | 80 | 0.85 |
| LSTM (Burnout) | 87 | 84 | 86 | 85 | 0.88 |
| Climate Variable | Correlation with Workplace Stress (r-value) |
|---|---|
| Temperature-Variability | 0.78 (p < 0.001) |
| Air Pollution Index | 0.65 (p < 0.01) |
| Humidity Levels | 0.58 (p < 0.05) |
| Burnout Risk Factor | Regression Coefficient (β) | p-value |
|---|---|---|
| Physical Hazard Exposure | 0.76 | < 0.01 |
| Long Working Hours | +40% burnout risk | - |
| Lack of Training | 0.68 | < 0.05 |
| Top Stress Indicators from NLP Analysis | Frequency of Occurrence (%) |
|---|---|
| Work Overload Mentions | 45% |
| Lack of Support from Management | 38% |
| Physical Fatigue Complaints | 52% |
| Job Dissatisfaction Keywords | 41% |
| Model | Burnout Prediction Accuracy (%) | Workplace Hazard Prediction Accuracy (%) | ROC-AUC Score |
|---|---|---|---|
| XGBoost | 90 | 89 | 0.91 |
| Random Forest | 87 | 88 | 0.89 |
| LSTM | 86 | 87 | 0.88 |
| SVM | 79 | 82 | 0.81 |
| Decision Trees | 75 | 78 | 0.79 |
5. Discussion
6. Conclusion
Author Contributions
Data Availability Statement
Acknowledgment
Conflicts of Interest
References
- Adamopoulos IP, Lamnisos D, Syrou NF, Boustras G. Public health and work safety pilot study: Inspection of job risks, burn out syndrome and job satisfaction of public health inspectors in Greece. Saf Sci. 2022;147:105592.
- Smith, J. A., & Jones, M. B. (2020). The impact of climate change on occupational hazards. Environmental Health Perspectives, 128(9), 95001.
- Adamopoulos, I.; Syrou, N.; Lamnisos, D.; Dounias, G. Public Health Inspectors Classification and Assessment of Environmental, Psychosocial, Organizational Risks and Workplace Hazards in the Context of the Global Climate Crisis. Preprints 2024, 2024120639.
- Adamopoulos IP, Syrou NF, Lamnisos D, Boustras G. Crosssectional nationwide study in occupational safety & health: Inspection of job risks context, burn out syndrome and job satisfaction of public health inspectors in the period of the COVID-19 pandemic in Greece. Saf Sci. 2023; 158:105960.
- Adamopoulos IP, Frantzana A, Adamopoulou J, Syrou N. Climate change and adverse public health impacts on human health and water resources. Environ Sci Proc. 2023;26:178.
- Adamopoulos, I.; Frantzana, A.; Syrou, N. Climate Crises Associated with Epidemiological, Environmental, and Ecosystem Effects of a Storm: Flooding, Landslides, and Damage to Urban and Rural Areas (Extreme Weather Events of Storm Daniel in Thessaly, Greece). Med. Sci. Forum 2024, 25, 7.
- Adamopoulos IP, Lamnisos D, Syrou NF, Boustras G. Training needs and quality of public health inspectors in Greece during the COVID-19 pandemic. Eur J Public Health. 2022;32 3):ckac131.373.
- Adamopoulos IP, Bardavouras AN, Syrou NF. Occupational safety, policy, and management in public health organizations and services. Eur J Environ Public Health. 2023;7(1):em0122.
- Adamopoulos, I. P., Frantzana, A. A., and Syrou, N. F. (2024). General practitioners, health inspectors, and occupational physicians’ burnout syndrome during COVID-19 pandemic and job satisfaction: A systematic review. European Journal of Environment and Public Health, 8(3), em0160.
- Adamopoulos IP. Job satisfaction in public health care sector, measures scales and theoretical background. Eur J Environ Public Health. 2022;6(2):em0116.
- Khan, A. J. J.; Yar, S.; Fayyaz, S.; Adamopoulos, I.; Syrou, N.; Jahangir, A. From Pressure to Performance, and Health Risks Control: Occupational Stress Management and Employee Engagement in Higher Education. Preprints 2024, 2024121329.
- Adamopoulos IP, Syrou NF. Associations and correlations of job stress, job satisfaction and burn out in public health sector. Eur J Environ Public Health. 2022;6(2):em0113.
- Adamopoulos, I. P., Bardavouras, A. N., and Mijwil, M. M. (2024). Perceptions about the COVID-19 disease and the vaccination of those arriving at a vaccination center are correlated with medical public health services management. Electronic Journal of Medical and Educational Technologies, 17(2), em2404.
- Adamopoulos, I. P., and Bardavouras, A. N. (2024). Digital health literacy and quality of life association with perceptions about the disease COVID-19 in public health services. European Journal of Environment and Public Health, 8(1), em0153.
- Adamopoulos IP, Syrou NF. Workplace safety and occupational health job risks hazards in public health sector in Greece. Eur J Public Health. 2022;6(2):em0118.
- Mijwil, M., Adamopoulos , I., & Pudasaini, P. (2024). Machine learning Helps in Quickly Dia gnosis Cases of "New Corona". Mesopotamian Journal of Artificial Intelligence in Healthcare, 2024, 16–19.
- Adamopoulos, I., Syrou, N., 2024. Climate Change, Air Pollution, African Dust Impacts on Public Health and Sustainability in Europe, European Journal of Public Health, Volume 34, Issue Supplement_3, November 2024.
- Doe, J., Smith, A., & Johnson, R. (2019). Predictive modeling in workplace safety: A machine learning approach. International Journal of Occupational Health, 10(4), 567-578.
- Brown, L. M., & Green, K. P. (2021). Advances in AI applications for occupational health. Journal of Workplace Safety, 15(2), 123-134.
- Taylor, S. (2022). Proactive risk management in public health inspection. Journal of Public Health Policy, 43(1), 45-58.
- Wilson, P. R. (2023). Data-driven strategies for mitigating inspector burnout. Occupational Health Science, 7(1), 89-102.
- ILO (2024). Global Strategy on Occupational Safety and Health 2024-2030. [Accessed 27-12-2024] at: https://www.ilo.org/resource/policy/global-strategy-occupational-safety-and-health.
- Katsouras, C., Tsounis, A., Malli, F., & Papathanasiou, I. V. (2023). The impact of work environment on job satisfaction. European Journal of Environment and Public Health, 7(2).
- Adapa, K., Naidoo, R., & Naidoo, K. (2023). A machine learning model to predict the risk factors causing feelings of burnout and emotional exhaustion among nurses in South Africa. Frontiers in Public Health, 11, 11687012.
- Posada-Quintero, H. F., Molano-Vergara, P. N., Parra-Hernández, R. M., & Posada-Quintero, J. I. (2020). Analysis of Risk Factors and Symptoms of Burnout Syndrome in Colombian School Teachers under Statutes 2277 and 1278 Using Machine Learning Interpretation. Social Sciences, 9(3), 30.
- Kim, H., Kim, J., & Lee, S. (2023). Machine learning models for predicting depression in Korean young adults using sociodemographic and psychosocial factors: A cross-sectional study. Frontiers in Psychiatry, 14, 10371256.
- Petkova, M., Dimitrova, E., & Ivanov, I. (2022). The burnout level among health inspectors - a comparative analysis before and during the COVID-19 pandemic in Bulgaria. Journal of Public Health, 30(3), 555-563.
- NOAA. (2025). Real-time data resources. [Accessed 07-12-2024] at: Retrieved from https://www.noaa.gov/education/resource-collections/data/real-time.
- NASA. (2025). Earthdata: Your Gateway to NASA Earth Observation Data. [Accessed 07-01-2025] Retrieved from https://www.earthdata.nasa.gov/.
- NOAA National Centers for Environmental Information (NCEI). (2025). Climate Data Online (CDO). [Accessed 09-01-2025] Retrieved from https://www.ncei.noaa.gov/cdo-web/.
- Roberts KE, Hart TA, Eastwood JD. Factor structure and validity of the State-Trait Inventory for Cognitive and Somatic Anxiety. Psychol Assess. 2016 Feb;28(2):134-146.
- Thompson, B. (2004). Exploratory and Confirmatory Factor Analysis: Understanding Concepts and Applications. American Psychological Association. 39(1), 31-36.
- Hair, J. F., Black, W. C., Babin, B. J., & Anderson, R. E. (2010). Multivariate Data Analysis (7th ed.). Pearson.
- Kaiser, H. F. (1974). An index of factorial simplicity. Psychometrika, 39(1), 31-36.
- Bishop, C. M. (2006). Pattern Recognition and Machine Learning. Springer.
- Hanley, J. A., & McNeil, B. J. (1982). The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology, 143(1), 29-36.
- European Commission. (2016). General Data Protection Regulation (GDPR). Official Journal of the European Union. [Accessed 11-01-2025] at: https://gdpr-info.eu/.
- Dwork, C., & Roth, A. (2014). The algorithmic foundations of differential privacy. Foundations and Trends in Theoretical Computer Science, 9(3-4), 211-407.
- Sweeney, L. (2002). k-anonymity: A model for protecting privacy. International Journal on Uncertainty, Fuzziness and Knowledge-Based Systems, 10(5), 557-570.
- Gentry, C. (2009). Fully homomorphic encryption using ideal lattices. Proceedings of the 41st Annual ACM Symposium on Theory of Computing, 169-178.
- Tene, O., & Polonetsky, J. (2013). Big data for all: Privacy and user control in the age of analytics. Northwestern Journal of Technology and Intellectual Property, 11(5), 239-273.
- Barocas, S., Hardt, M., & Narayanan, A. (2019). Fairness and machine learning: Limitations and opportunities. MIT Press.
- Norful AA, Rosenfeld A, Schroeder K, Travers JL, Aliyu S. Primary drivers and psychological manifestations of stress in frontline healthcare workforce during the initial COVID-19 outbreak in the United States. General hospital psychiatry. 2021 Mar 1;69:20-6.
- Pratap P, Dickson A, Love M, Zanoni J, Donato C, Flynn MA, Schulte PA. Public health impacts of underemployment and unemployment in the United States: Exploring perceptions, gaps and opportunities. International journal of environmental research and public health. 2021 Sep 23;18(19):10021.
- Matisāne L, Paegle L, Akūlova L, Vanadziņš I. Challenges for workplace risk assessment in home offices—results from a qualitative descriptive study on working life during the first wave of the COVID-19 pandemic in Latvia. International journal of environmental research and public health. 2021 Oct 16;18(20):10876.
- Awada M, Becerik-Gerber B, Hoque S, O'Neill Z, Pedrielli G, Wen J, Wu T. Ten questions concerning occupant health in buildings during normal operations and extreme events including the COVID-19 pandemic. Building and Environment. 2021 Jan 15;188:107480.
- Vetter C, Pattison PM, Houser K, Herf M, Phillips AJ, Wright KP, Skene DJ, Brainard GC, Boivin DB, Glickman G. A review of human physiological responses to light: implications for the development of integrative lighting solutions. Leukos. 2022 Jul 3;18(3):387-414.
- Lei M, Alam GM, Hassan A. Job burnout amongst university administrative staff members in China—a perspective on sustainable development goals (SDGs). Sustainability. 2023.
- Dima G, Meseșan Schmitz L, Șimon MC. Job stress and burnout among social workers in the VUCA world of COVID-19 pandemic. Sustainability. 2021.
- Francisco J, Cruz J, Cruz K, Resurreccion L, Lopez L, Torculas A, Gumpal M, Guillermo N, Tus J. The Job Burnout And Its Impact on the Employee's Performance Amidst the COVID-19 Pandemic. Psychology and Education: A Multidisciplinary Journal. 2022;2(2):155-66.
- Mahmoud AB, Hack-Polay D, Reisel WD, Fuxman L, Grigoriou N, Mohr I, Aizouk R. Who’s more vulnerable? A generational investigation of COVID-19 perceptions’ effect on Organisational citizenship Behaviours in the MENA region: job insecurity, burnout and job satisfaction as mediators. BMC Public Health. 2021 Dec;21:1-7.
- Merlo KL, Conen KM, Scott B, Jones… K. Burnout in the disaster response workforce: The impact of work roles and the COVID-19 pandemic. Journal of …. 2021.
- Tsakonas K, Badyal S, Takaro T, Buse C. Rapid review of the impacts of climate change on the health system workforce and implications for action. The Journal of Climate Change and Health. 2024 Jul 26:100337.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
