Methodological conduct and risk of bias in studies on prenatal birthweight prediction models using machine learning techniques: a systematic review

Jing Gao; Yujun Yao; Jingdong Xue; Ruiyao Chen; XingYu Yang; Jie Xu; Weiwei Cheng

doi:10.1186/s12884-025-07727-5

Methodological conduct and risk of bias in studies on prenatal birthweight prediction models using machine learning techniques: a systematic review

BMC Pregnancy Childbirth. 2025 Jul 2;25(1):696. doi: 10.1186/s12884-025-07727-5.

Authors

Jing Gao^#^{1

2

3}, Yujun Yao^#⁴, Jingdong Xue^#⁵, Ruiyao Chen⁴, XingYu Yang^{1

2

3}, Jie Xu⁶, Weiwei Cheng^{7

8

9}

Affiliations

¹ International Peace Maternity and Child Health Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, 200030, China.
² Shanghai Key Laboratory of Embryo Original Disease, Shanghai, 200040, China.
³ Shanghai Municipal Key Clinical Specialty, Shanghai, 200030, China.
⁴ Shanghai Artificial Intelligence Laboratory, Shanghai, 200030, China.
⁵ Department of Urology, School of Medicine, Tongji Hospital, Tongji University, Shanghai, 200030, China.
⁶ Shanghai Artificial Intelligence Laboratory, Shanghai, 200030, China. xujie@pjlab.org.cn.
⁷ International Peace Maternity and Child Health Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, 200030, China. wwcheng29@shsmu.edu.cn.
⁸ Shanghai Key Laboratory of Embryo Original Disease, Shanghai, 200040, China. wwcheng29@shsmu.edu.cn.
⁹ Shanghai Municipal Key Clinical Specialty, Shanghai, 200030, China. wwcheng29@shsmu.edu.cn.

^# Contributed equally.

PMID: 40604554
DOI: 10.1186/s12884-025-07727-5

Abstract

Objective: To assess the methodological quality and the risk of bias, of studies that developed prediction models using Machine Learning (ML) techniques to estimate prenatal birthweight.

Study design and methods: We conducted a systematic review, searching the PubMed databases between 01/01/2018 and 01/08/2022, for studies that developed fetal weight prediction models using ML. We used the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) statement to assess the reporting quality of included publications and the Prediction Model Risk of Bias Assessment Tool (PROBAST) to assess the risk of bias. We measured the overall adherence to the TRIPOD reporting checklist, provided a detailed analysis of the methodological quality of each study, and examined risk of bias in specific domains, including participant, predictor, outcome and analysis.

Results: Fourteen studies were included and the adherence to the TRIPOD reporting items ranged from 34.62% to 80.77%, with a median adherence of 63.19%. The studies showed significant variation in their methodological rigor, with a particularly high risk of bias in the selection of participants and predictors. Notably, issues related to missing data, sample size adequacy, performance evaluation, and model validation were prominent across studies. Several studies showed limited model transparency and reproducibility.

Conclusion: Methodological quality of the ML-based prediction models for prenatal birthweight estimation was generally poor, with most studies at high risk of bias. There is an urgent need for improvements in the design and reporting of these studies. The adaptation of the TRIPOD and PROBAST statements specifically for ML models should be promoted to enhance transparency and reproducibility, which would facilitate the wider clinical application of ML-based prediction models and reduce research waste.

Keywords: Fetal weight; Machine learning; Methodological quality; Prediction model; Risk of bias; Systematic review.

Publication types

Systematic Review

MeSH terms

Bias
Birth Weight*
Female
Fetal Weight
Humans
Infant, Newborn
Machine Learning*
Pregnancy
Reproducibility of Results
Research Design* / standards

Grants and funding

Project NO. 20244Y0133/scientific research project of the Shanghai Municipal Health Commission