Machine learning for personalized risk assessment of HIV, syphilis, gonorrhoea and chlamydia: A systematic review and meta-analysis

Phyu M Latt; Nyi N Soe; Christopher K Fairley; Eric P F Chow; Cheryl C Johnson; Purvi Shah; Ismail Maatouk; Lei Zhang; Jason J Ong

doi:10.1016/j.ijid.2025.107922

Machine learning for personalized risk assessment of HIV, syphilis, gonorrhoea and chlamydia: A systematic review and meta-analysis

Int J Infect Dis. 2025 Aug:157:107922. doi: 10.1016/j.ijid.2025.107922. Epub 2025 May 6.

Authors

Phyu M Latt¹, Nyi N Soe², Christopher K Fairley³, Eric P F Chow⁴, Cheryl C Johnson⁵, Purvi Shah⁶, Ismail Maatouk⁵, Lei Zhang⁷, Jason J Ong⁸

Affiliations

¹ Artificial Intelligence and Modelling in Epidemiology Program, Melbourne Sexual Health Centre, Alfred Health, Melbourne, Australia; School of Translational Medicine, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Australia. Electronic address: phyu.latt@monash.edu.
² Artificial Intelligence and Modelling in Epidemiology Program, Melbourne Sexual Health Centre, Alfred Health, Melbourne, Australia; School of Translational Medicine, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Australia.
³ School of Translational Medicine, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Australia; Melbourne Sexual Health Centre, Alfred Health, Melbourne, Australia.
⁴ School of Translational Medicine, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Australia; Melbourne Sexual Health Centre, Alfred Health, Melbourne, Australia; Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, Australia.
⁵ Global HIV, Hepatitis and STIs Programmes, World Health Organization, Geneva, Switzerland.
⁶ Global HIV, Hepatitis and STIs Programmes, World Health Organization, Geneva, Switzerland; Regional Support Team, Asia Pacific, UNAIDS, Bangkok, Thailand.
⁷ Artificial Intelligence and Modelling in Epidemiology Program, Melbourne Sexual Health Centre, Alfred Health, Melbourne, Australia; School of Translational Medicine, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Australia; Clinical Medical Research Centre, Children's Hospital of Nanjing Medical University, Nanjing, Jiangsu, China.
⁸ School of Translational Medicine, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Australia; Melbourne Sexual Health Centre, Alfred Health, Melbourne, Australia; Department of Clinical Research, London School of Hygiene and Tropical Medicine, London, UK.

PMID: 40339784
PMCID: PMC12181084 (available on 2025-08-01)
DOI: 10.1016/j.ijid.2025.107922

Abstract

Background: Machine learning (ML) shows promise for sexually transmitted infection (STI) risk prediction, but systematic evidence of its effectiveness remains fragmented.

Methods: We systematically searched six electronic databases, three preprint archives and conference proceedings (January 2010-April 2024). Studies reporting quantitative performance metrics for supervised ML-based STI risk prediction models were included. We used a bivariate random-effects model to estimate pooled sensitivity, specificity and area under the curve (AUC). The risk of bias was assessed using the Prediction model Risk of Bias Assessment Tool. We conducted sequential analyses of studies with complete and reconstructed confusion matrices. Subgroup analyses and meta-regression explored potential sources of heterogeneity.

Results: Among 3877 records screened, 25 studies comprising 45 unique models met inclusion criteria. For HIV, analysis of studies with complete confusion matrices (7 studies, 9 contingency tables) demonstrated summary AUC of 0.91 (95% CI: 0.88-0.93), pooled sensitivity 0.84 (0.76-0.90) and specificity 0.84 (0.70-0.93). Substantial heterogeneity persisted across subgroups (I² > 98%). For other STIs, individual studies reported AUCs ranging from 0.75-0.87 for syphilis (n = 5), 0.73-1.00 for gonorrhoea (n = 6) and 0.67-1.00 for chlamydia (n = 6).

Discussion: While ML models show promising performance, particularly for HIV, significant heterogeneity complicates interpretation. Future research should prioritize external validation, standardized guidelines and multi-centred robust implementation studies to evaluate clinical impact.

Keywords: HIV; Machine learning; Meta-analysis; Risk assessment; Sexually transmitted infections; Systematic review.

Publication types

Systematic Review
Meta-Analysis

MeSH terms

Chlamydia Infections* / epidemiology
Gonorrhea* / epidemiology
HIV Infections* / epidemiology
Humans
Machine Learning*
Risk Assessment / methods
Sensitivity and Specificity
Sexually Transmitted Diseases* / diagnosis
Sexually Transmitted Diseases* / epidemiology
Syphilis* / epidemiology

Grants and funding

001/WHO_/World Health Organization/International