Enhanced Malignancy Prediction of Small Lung Nodules in Different Populations Using Transfer Learning on Low-Dose Computed Tomography

Jyun-Ru Chen; Kuei-Yuan Hou; Yung-Chen Wang; Sen-Ping Lin; Yuan-Heng Mo; Shih-Chieh Peng; Chia-Feng Lu

doi:10.3390/diagnostics15121460

Enhanced Malignancy Prediction of Small Lung Nodules in Different Populations Using Transfer Learning on Low-Dose Computed Tomography

Diagnostics (Basel). 2025 Jun 8;15(12):1460. doi: 10.3390/diagnostics15121460.

Authors

Jyun-Ru Chen¹, Kuei-Yuan Hou^{2

3}, Yung-Chen Wang^{2

4}, Sen-Ping Lin², Yuan-Heng Mo^{2

4}, Shih-Chieh Peng², Chia-Feng Lu¹

Affiliations

¹ Department of Biomedical Imaging and Radiological Sciences, National Yang Ming Chiao Tung University, Taipei 112, Taiwan.
² Department of Radiology, Cathay General Hospital, Taipei 106, Taiwan.
³ Department of Biomedical Imaging and Radiological Sciences, Chung Shan Medical University, Taichung 402, Taiwan.
⁴ Department of Medicine, School of Medicine, Fu Jen Catholic University, Taipei 242, Taiwan.

Abstract

Background: Predicting malignancy in small lung nodules (SLNs) across diverse populations is challenging due to significant demographic and clinical variations. This study investigates whether transfer learning (TL) can improve malignancy prediction for SLNs using low-dose computed tomography across datasets from different countries. Methods: We collected two datasets: an Asian dataset (669 SLNs from Cathay General Hospital, CGH, Taiwan) and an American dataset (600 SLNs from the National Lung Screening Trial, NLST, America). Initial U-Net models for malignancy prediction were trained on each dataset, followed by the application of TL to transfer model parameters across datasets. Model performance was evaluated using accuracy, specificity, sensitivity, and the area under the receiver operating characteristic curve (AUC). Results: Significant demographic differences (p < 0.001) were observed between the CGH and NLST datasets. Initial models trained on one dataset showed a substantial performance decline of 15.2% to 97.9% when applied to the other dataset. TL enhanced model performance across datasets by 21.1% to 159.5% (p < 0.001), achieving an accuracy of 0.86-0.91, sensitivity of 0.81-0.96, specificity of 0.89-0.92, and an AUC of 0.90-0.97. Conclusions: TL enhances SLN malignancy prediction models by addressing population variations and enabling their application across diverse international datasets.

Keywords: deep learning; malignancy prediction; population variations; small lung nodule; transfer learning.

Abstract

Grants and funding