Interesting

AI predicts preterm birth risk with 82% accuracy

Could AI predict preterm births before symptoms arise? A new study finds that machine learning models, especially SVMs, can assess risk with impressive accuracy—offering hope for earlier interventions and better neonatal outcomes.

Study: Predicting preterm birth using machine learning methods. ​​​​​​​Image Credit: StockKK / Shutterstock

In a recent study in the journal Scientific Reports, researchers evaluated the accuracy, precision, and F1-score of several machine learning (ML) models in predicting the likelihood of preterm births in 50 pregnant women. Despite several attempts at unraveling the underlying causes of preterm birth, the multifaceted nature of the condition has made identifying a biological cue for preterm births hitherto impossible.

Given its status as a significant health concern and its strong correlation to adverse neonatal outcomes (mortality and morbidity), this study aims to use ML models to predict preterm risk, thereby allowing for timely interventions in high-risk women. Study findings identified linear support vector machines (SVMs), particularly those with optimized hyperparameters, as the best-performing (accuracy = 82%) out of the several (n = 5) tested.

Background

Preterm births, also called 'premature births,' are when babies are born before 37 weeks of pregnancy. They can be medically severe conditions substantially increasing neonatal complications, including breathing difficulties, feeding difficulties, cerebral palsy, and even neonatal mortality. Unfortunately, preterm births are an increasingly common occurrence, with the World Health Organization (WHO) estimating that 1 in every 10 babies is born premature (WHO 2020).

While decades of research have elucidated some of the underlying causes of preterm birth, including maternal smoking, alcohol consumption, stress, pollution exposure, and, most recently, genetics, the complex interplay between these factors has resulted in a lack of a single, definitive risk determinant of the condition. Consequently, clinicians today rely on risk evaluation models to determine the likelihood of preterm birth and administer timely interventions and care.

Machine learning models (ML) are witnessing unprecedented use in clinical decision support systems, including risk determination. Their ability to detect patterns invisible to traditional statistics and leverage a wide range of input data types (transvaginal ultrasound, electronic health records (EHRs), and electrohysterogram signals) makes them increasingly valuable in preventive medicine. While ML models have been studied before for preterm birth prediction, the present study focuses on identifying the most effective models and improving their predictive accuracy through hyperparameter tuning.

About the Study

The present study aims to identify the best-performing ML algorithms in determining preterm risk by leveraging a cohort of 50 women (28 cases and 22 controls) to assess their accuracy metrics. Participant data was obtained from pregnant women admitted to Dr. Antoni Biziel University Hospital in Bydgoszcz, Poland. Study data included detailed medical examinations (health evaluations, gynecological assessments, and blood tests) and medical questionnaires (participants' medical history, current medication, and other clinically relevant details).

This study evaluated several cutting-edge ML algorithms, including XGBoost, logistic regression, CatBoost, decision trees, and support vector machines (SVMs). To maximize the algorithms' F1 scores (and thereby performance), models were subjected to hyperparameter optimization using the Optuna framework. The study then assessed model-specific performance across four main metrics: accuracy, recall, precision, and F1 Score.

To establish statistical significance and differentiate performance between models, chi-squared tests and Welch’s unpaired t-tests were employed. Finally, the best-performing models were subjected to feature performance analysis to help identify participant traits that contributed most to model accuracy, thereby hinting at clinically relevant symptoms that could be used to predict preterm births in future investigations.

Study Findings

The study identified the linear SVM (with optimized hyperparameters) as the best-performing model, achieving 82% accuracy, 86% recall, 83% precision, and an 84% overall F1 score. The linear SVM was followed closely by the logistic regression model (also with optimized hyperparameters), which achieved comparable performance with an 80% accuracy, 82% recall, 82% precision, and 82% overall F1-score. Notably, both of these models are objectively relatively simple algorithms.

More complex algorithms, such as XGBoost and CatBoost, performed more poorly than expected, potentially due to the small dataset size (n = 50), which limited their ability to generalize effectively. The study suggests that these models may have been too complex for the available dataset, leading to inefficiencies in learning from the given features. Elementary models (e.g., random forests and decision trees) also underperformed, not only due to dataset size limitations but also because of their difficulty in handling the large number of features supplied in the study.

Feature performance analysis revealed that in addition to C-reactive protein (CRP) from blood morphology parameters and parity (the number of previous childbirths), hematocrit (HCT) and platelet count (PLT) were also significant predictors of preterm birth. Notably, education level was also identified as a statistically significant factor, suggesting that socioeconomic factors play a role in preterm birth risk. These findings indicate that factors related to inflammation and blood composition play an important role in preterm risk assessment.

"Collectively, these findings suggest that preterm birth is driven by a multifactorial interplay of physiological, socioeconomic, and behavioral factors. The findings highlight the need for integrated care approaches that address both biological and social determinants in pregnancy."

Conclusions

The present study identified linear SVMs as the ML model with the highest accuracy, precision, recall, and overall F1 score among the five models evaluated. Alongside logistic regression (the second-best performer), this model highlights that optimal algorithmic complexity plays a critical role in preterm birth prediction, as models that were either too simple or too complex tended to underperform.

Despite the study's limited sample size (n = 50 participants), which significantly influenced model performance, the findings are promising. However, the researchers caution that larger-scale studies are necessary to validate the models' generalizability. Future research should focus on collecting larger, more diverse datasets and including earlier-stage pregnancy screening to enhance predictive accuracy.

"The results of this study have the potential to inform the development of interventions aimed at reducing the incidence of preterm birth… prospective studies should be designed to explore the real-world applicability of the identified model in clinical settings, where its predictive power could aid in early risk identification and intervention strategies for preterm birth."

Journal reference:
  • Kloska, A., Harmoza, A., Kloska, S. M., & Marciniak, T. (2025). Predicting preterm birth using machine learning methods. Scientific Reports, 15(1), 1-8. DOI:10.1038/s41598-025-89905-1, https://www.nature.com/articles/s41598-025-89905-1


Source: http://www.news-medical.net/news/20250217/AI-predicts-preterm-birth-risk-with-8225-accuracy.aspx

Inline Feedbacks
View all comments
guest

Fewer than 20% of women screened for cardiovascular risk after pregnancy complications

Less than one in five patients are tested for cardiovascular risk factors following pregnancy-related hypertension or diabetes, according...

Study suggests sun exposure during first year of life may reduce MS relapses

Getting at least 30 minutes of daily summer sun in the first year of life may mean a...

Republican states claim zero abortions. A red-state doctor calls that ‘ludicrous.’

In Arkansas, state health officials announced a stunning statistic for 2023: The total number of abortions in the...

Lack of regulation in sperm donation sparks concerns in Africa

Poor regulation and lack of transparency in Africa's fertility industry leave sperm donors and recipients vulnerable to exploitation...

Air pollution in late pregnancy linked to higher NICU admissions for newborns

Exposure to traffic-related pollutants like NO₂ and PM2.5 in the final month of pregnancy increases the risk of...

Community health workers – the unsung heroines

Reproductive Health has published a new supplement titled ‘Building community-level resilience for the care of women with pre-eclampsia’....

Blood test can predict preterm preeclampsia with 80% accuracy

A new blood test has an 80% accuracy in predicting preterm preeclampsia, according to a study published today, Feb. 12,...

Survey shows increased use of fertility apps after Dobbs decision

The use of fertility-tracking technology increased in some states after the U.S. Supreme Court overturned Roe v. Wade...

Groundbreaking malaria vaccine provides high-level protection with just one dose

Scientists at Sanaria and Seattle Children's Research Institute's Center for Global Infectious Disease Research (CGIDR) have unveiled a...

Postpartum depression research could lead to blood test for at-risk women

New postpartum depression research from the University of Virginia School of Medicine and Weill Cornell Medicine could lead...

AZoNetwork honors women in STEM on the 10th anniversary of IDWGS

As a network of websites with a truly global audience, AZoNetwork is joining the global effort to close...

Women prefer female cardiologists for better heart care

According to the U.S. Physician Workforce Data Dashboard, only about 17% of cardiologists are women, ranking as one...

Our Editor of the year: José Belizán from Reproductive Health

Every year, both BMC-series Section Editors and Editors of our society and proprietary titles are nominated by BioMed...

Babies develop food preferences in the womb, study suggests

Babies show positive responses to the smell of foods they were exposed to in the womb after they...

Climate change may increase the risk of prolonged pregnancy

New Curtin University research has found exposure to outdoor air pollution and extreme temperatures during pregnancy may increase...

Vitamin E supplementation may reduce food allergy development in newborns

New research found that supplementing maternal diet with α-tocopherol, a form of vitamin E, can reduce the development...

Нейробиология материнства: как беременность меняет мозг женщины

Беременность — это не только период физических изменений, но и глубокая перестройка работы мозга. Последние исследования в области...

Male Reproductive Health: How Lifestyle Affects Sperm Quality and Fertility

Male reproductive health has emerged as a critical component in understanding fertility challenges facing modern couples. While historically,...

Air pollution exposure in late pregnancy linked to higher NICU admissions

Air pollution caused by auto emissions, wildfires and other sources is problematic for many people. It's of particular...

New review maps the impact of reproductive hormones on neurological health

A comprehensive review published today in Brain Medicine by leading neuroendocrinologist Professor Hyman M. Schipper from McGill University's...