• 1. School of Computer Science (School of Software), Sichuan University, Chengdu, 610065, P. R. China;
  • 2. Department of Cardiovascular Surgery, West China Hospital, Sichuan University, Chengdu, 610041, P. R. China;
  • 3. School of Electronic Information, Sichuan University, Chengdu, 610065, P. R. China;
QIAN Yongjun, Email: qianyongjun@scu.edu.cn; ZHAO Qijun, Email: qjzhao@scu.edu.cn
Export PDF Favorites Scan Get Citation

Objective  To establish a machine learning based framework to rapidly screen out high-risk patients who may develop atrial fibrillation (AF) from patients with valvular heart disease and provide the information related to risk prediction to clinicians as clinical guidance for timely treatment decisions. Methods  Clinical data were retrospectively collected from 1 740 patients with valvular heart disease at West China Hospital of Sichuan University and its branches, including 831 (47.76%) males and 909 (52.24%) females at an average age of 54 years. Based on these data, we built classical logistic regression, three standard machine learning models, and three integrated machine learning models for risk prediction and characterization analysis of AF. We compared the performance of machine learning models with classical logistic regression and selected the best two models, and applied the SHAP algorithm to provide interpretability at the population and single-unit levels. In addition, we provided visualization of feature analysis results. Results The Stack model performed best among all models (AF detection rate 85.6%, F1 score 0.753), while XGBoost outperformed the standard machine learning models (AF detection rate 71.9%, F1 score 0.732), and both models performed significantly better than the logistic regression model (AF detection rate 65.2%, F1 score 0.689). SHAP algorithm showed that left atrial internal diameter, mitral E peak flow velocity (Emv), right atrial internal diameter output per beat, and cardiac function class were the most important features affecting AF prediction. Both the Stack model and XGBoost had excellent predictive ability and interpretability. Conclusion The Stack model has the highest AF detection performance and comprehensive performance. The Stack model loaded with the SHAP algorithm can be used to screen high-risk patients for AF and reveal the corresponding risk characteristics. Our framework can be used to guide clinical intervention and monitoring of AF.

Citation: LEI Nuoyangfan, TONG Qi, ZHANG Yiwen, WANG Zhengjie, LI Tao, PAN Fan, QIAN Yongjun, ZHAO Qijun. Machine learning models for analyzing valvular heart disease combined with atrial fibrillation using electronic health records. Chinese Journal of Clinical Thoracic and Cardiovascular Surgery, 2022, 29(8): 953-962. doi: 10.7507/1007-4848.202204048 Copy

  • Previous Article

    Transcatheter edge-to-edge repair: Operating theories, basic principles, and predictors of prognosis
  • Next Article

    First exploration of postoperative pulmonary complications after transcatheter tricuspid valve replacement and recommendations for rehabilitation: A prospective cohort study