Deep Learning for the Early Diagnosis of Candidemia

Daniele Roberto Giacobbe; Sabrina Guastavino; Anna Razzetta; Cristina Marelli; Sara Mora; Chiara Russo; Giorgia Brucci; Alessandro Limongelli; Antonio Vena; Malgorzata Mikulska; Alessio Signori; Antonio Di Biagio; Anna Marchese; Ylenia Murgia; Marco Muccio; Nicola Rosso; Michele Piana; Mauro Giacomini; Cristina Campi; Matteo Bassetti

doi:10.1007/s40121-025-01171-w

Deep Learning for the Early Diagnosis of Candidemia

Infect Dis Ther. 2025 Jun 23. doi: 10.1007/s40121-025-01171-w. Online ahead of print.

Authors

Daniele Roberto Giacobbe^#^{1

2}, Sabrina Guastavino^#³, Anna Razzetta⁴, Cristina Marelli^{5

6}, Sara Mora⁷, Chiara Russo^{8

9}, Giorgia Brucci^{8

9}, Alessandro Limongelli^{8

9}, Antonio Vena^{8

9}, Malgorzata Mikulska^{8

9}, Alessio Signori¹⁰, Antonio Di Biagio^{8

9}, Anna Marchese^{11

12}, Ylenia Murgia¹³, Marco Muccio⁹, Nicola Rosso⁷, Michele Piana^{3

14}, Mauro Giacomini¹³, Cristina Campi^#^{3

14}, Matteo Bassetti^#^{8

9}

Affiliations

¹ Department of Health Sciences (DISSAL), University of Genoa, Via A. Pastore 1, 16132, Genoa, Italy. danieleroberto.giacobbe@unige.it.
² Clinica Malattie Infettive, IRCCS Ospedale Policlinico San Martino, Genoa, Italy. danieleroberto.giacobbe@unige.it.
³ Department of Mathematics (DIMA), University of Genoa, Genoa, Italy.
⁴ School of Mathematics, University of Genoa, Genoa, Italy.
⁵ Oncostat, CESP, Inserm U1018, Université Paris-Saclay, Labeled Ligue Contre le Cancer, Gustave Roussy, Villejuif, France.
⁶ Institut Curie - INSERM U1331, Team statistics applied to personalized medicine, Paris, France.
⁷ UO Information and Communication Technologies, IRCCS Ospedale Policlinico San Martino, Genoa, Italy.
⁸ Department of Health Sciences (DISSAL), University of Genoa, Via A. Pastore 1, 16132, Genoa, Italy.
⁹ Clinica Malattie Infettive, IRCCS Ospedale Policlinico San Martino, Genoa, Italy.
¹⁰ Section of Biostatistics, Department of Health Sciences (DISSAL), University of Genoa, Genoa, Italy.
¹¹ Department of Surgical Sciences and Integrated Diagnostics (DISC), University of Genoa, Genoa, Italy.
¹² Microbiology Unit, IRCCS Ospedale Policlinico San Martino, Genoa, Italy.
¹³ Department of Informatics, Bioengineering, Robotics and System Engineering (DIBRIS), University of Genoa, Genoa, Italy.
¹⁴ Life Science Computational Laboratory (LISCOMP), IRCCS Ospedale Policlinico San Martino, Genoa, Italy.

^# Contributed equally.

PMID: 40549343
DOI: 10.1007/s40121-025-01171-w

Abstract

Introduction: Candidemia carries a heavy burden in terms of mortality, especially when presenting as septic shock, and its early diagnosis remains crucial.

Methods: We assessed the performance of a deep learning model for the early differential diagnosis between candidemia and bacteremia. The model was trained on a large dataset of automatically extracted laboratory features.

Results: A total of 12,483 episodes of candidemia (1275; 10%) or bacteremia (11,208; 90%) were included. For recognizing candidemia, a deep learning model showed sensitivity 0.80, specificity 0.59, positive predictive value (PPV) 0.18, weighted PPV (wPPV) 0.88, and negative predictive value (NPV) 0.96 on the training set (area under the curve [AUC] 0.69), and sensitivity 0.70, specificity 0.58, PPV 0.16, wPPV 0.87, and NPV 0.95 on the test set (AUC 0.64). Then, the learned discriminatory ability was tested in the subgroup of patients with available serum β-D-glucan (BDG) and procalcitonin (PCT) values to explore additive or synergistic effects with these more specific markers. Both feature selection and transfer learning did not improve the diagnostic performance of a model based on BDG and PCT only.

Conclusions: A deep learning model trained on nonspecific laboratory features showed some discriminatory ability to differentiate candidemia from bacteremia, highlighting the ability of deep learning to exploit complex patterns within nonspecific laboratory data. However, the learned patterns did not improve the diagnostic performance of more specific markers. Further exploration of candidemia prediction using laboratory features through machine learning techniques remains a promising area of research, serving as a valuable complement to the development of large-scale models that also incorporate clinical features.

Keywords: Candida; Artificial intelligence; Biomarker; Machine learning; Neural networks.

Grants and funding

Pfizer Global Medical Grants (GMG) for General Research [Project Tracking Number 69511763]/Pfizer