Objective: To construct a diagnostic and predictive model for chronic obstructive pulmonary disease complicated with pulmonary hypertension (COPD-PH) and evaluate its effect. Methods: A total of 1 514 COPD patients treated in 5 hospitals from January 1, 2014 to December 31, 2019 were retrospectively collected and divided into training cohort (1 072 cases) and validation cohort (442 cases) according to the ratio of 7∶3 according to the inclusion time. Data including demographic data, smoking status, history of disease, and clinical examination were collected through patient medical records and electronic medical record systems. Multivariate logistic regression models were used to explore the related factors of COPD-PH, and the nomogram model was constructed using the "rms" program package. The calibration curve was used to evaluate the consistency between the prediction probability of the model and the actual results. The C index and the area under the receiver operating characteristic curve (ROC) were used to evaluate the discrimination of the model. The decision curve analysis (DCA) was used to evaluate the clinical practicability of the model. Results: In the training cohort, 3.7%, 15.2% and 81.1% were aged 50-59, 60-69 and ≥70 years, respectively, which were significantly different from the age composition of the validation cohort (7.9%, 27.8% and 64.3%, respectively) (P=0.041). There was no significant difference between the training cohort (79.4%) and the validation cohort (84.6%) (P=0.243). Multivariate logistic regression analysis of the training cohort showed that age ≥70 years [OR (95%CI): 3.32 (1.49-7.36)] and smoking status [former (current) smoking, OR (95%CI)] were 3.67 (2.51-5.37) and 2.04 (1.44-2.90), respectively], NT-probNP≥1 400 ng/L[OR (95%CI): 9.88 (6.23-15.66)], right atrial diameter [OR (95%CI): 1.11 (1.07-1.15)] was COPD-related factors of PH, based on the above factors-PH nomogram COPD model was set up and develop for online tools (https://ph-666.shinyapps.io/zhonghua-PH/). The calibrated C index (95%CI) of the training cohort and the validation cohort were 0.82 (0.77-0.87) and 0.77 (0.68-0.86), respectively. The calibration curve was close to the diagonal in both the training cohort and the validation cohort. The AUC (95%CI) of the nomogram model was 0.82 (0.80-0.85) in the training cohort and 0.77 (0.73-0.82) in the validation cohort. ROC curve showed that the optimal threshold in the training cohort was 0.60, and the sensitivity and specificity under this value were 0.74 and 0.78, respectively; the optimal threshold for the validation cohort was 0.70, and the sensitivity and specificity under this value were 0.76 and 0.65, respectively. DCA analysis showed that the nomogram model provided better net benefits than the all-variable selection and no-variable selection strategies with threshold probabilities greater than 15.0% and 13.0% in the training and validation cohorts, respectively. Conclusions: The nomogram model for the diagnosis and prediction of COPD-PH is simple and accurate, which has a good clinical application prospect.
目的: 构建慢性阻塞性肺疾病(COPD)合并肺动脉高压(PH)诊断预测模型并评估其效果。 方法: 回顾性收集2014年1月1日至2019年12月31日于5家医院就诊的1 514例COPD患者,根据纳入时间排序,将患者按7∶3的比例分为训练集(1 072例)和验证集(442例)。通过患者病历和电子病历系统收集相关资料,包括人口学资料、吸烟情况、疾病既往史和临床检查资料。采用多因素logistic回归模型分析COPD合并PH的相关因素,采用“rms”程序包构建列线图模型。采用Calibration校准曲线评估模型预测概率与实际结果的一致性,采用C 指数、受试者工作特征(ROC)曲线下面积(AUC)评价模型的区分度,采用决策分析曲线(DCA)评估模型的临床实用性。 结果: 训练集年龄为50~59、60~69和≥70岁者分别占3.7%、15.2%和81.1%,与验证集的年龄构成(分别占7.9%、27.8%和64.3%)差异有统计学意义(P=0.041)。训练集男性占79.4%,与验证集(84.6%)差异无统计学意义(P=0.243)。对训练集的多因素logistic回归模型分析显示:年龄≥70岁[OR(95%CI):3.32(1.49~7.36)]、吸烟状态[曾经、目前吸烟OR(95%CI)分别为3.67(2.51~5.37)和2.04(1.44~2.90)]、N末端B型利钠肽前体(NT-proBNP)≥1 400 ng/L[OR(95%CI):9.88(6.23~15.66)]、右心房直径[OR(95%CI):1.11(1.07~1.15)]是COPD合并PH的相关因素,基于以上因素建立COPD合并PH列线图模型并开发为在线工具(https://ph-666.shinyapps.io/zhonghua-PH/)。训练集和验证集的偏差校正后C指数(95%CI)分别为0.82(0.77~0.87)和0.77(0.68~0.86),Calibration校准曲线在训练集与验证集中均接近于对角线。训练集和验证集列线图模型AUC(95%CI)分别为0.82(0.80~0.85)和0.77(0.73~0.82)。通过ROC曲线得出训练集中最佳阈值为0.60,该值下的灵敏度、特异度分别为0.74、0.78;验证集最佳阈值为0.70,该值下的灵敏度和特异度分别为0.76和0.65。DCA分析显示:列线图模型分别在训练集和验证集中以阈概率>15.0%和>13.0%提供了优于变量全选和变量均不选策略的净效益。 结论: 建立COPD合并PH诊断预测列线图模型简便、准确,具有良好的临床应用前景。.