Alzheimer’s disease (AD) is a common and serious form of elderly dementia, but early detection and treatment of mild cognitive impairment can help slow down the progression of dementia. Recent studies have shown that there is a relationship between overall cognitive function and motor function and gait abnormalities. We recruited 302 cases from the Rehabilitation Hospital Affiliated to National Rehabilitation Aids Research Center and included 193 of them according to the screening criteria, including 137 patients with MCI and 56 healthy controls (HC). The gait parameters of the participants were collected during performing single-task (free walking) and dual-task (counting backwards from 100) using a wearable device. By taking gait parameters such as gait cycle, kinematics parameters, time-space parameters as the focus of the study, using recursive feature elimination (RFE) to select important features, and taking the subject’s MoCA score as the response variable, a machine learning model based on quantitative evaluation of cognitive level of gait features was established. The results showed that temporal and spatial parameters of toe-off and heel strike had important clinical significance as markers to evaluate cognitive level, indicating important clinical application value in preventing or delaying the occurrence of AD in the future.
Objective To systematically review prediction models of small for gestational age (SGA) based on machine learning and provide references for the construction and optimization of such a prediction model. Methods The PubMed, EMbase, Web of Science, CBM, WanFang Data, VIP and CNKI databases were electronically searched to collect studies on SGA prediction models from database inception to August 10, 2022. Two researchers independently screened the literature, extracted data, evaluated the risk of bias of the included studies, and conducted a systematic review. Results A total of 14 studies, comprising 40 prediction models constructed using 19 methods, such as logical regression and random forest, were included. The results of the risk of bias assessment from 13 studies were high; the area under the curve of the prediction models ranged from 0.561 to 0.953. Conclusion The overall risk of bias in the prediction models for SGA was high, and the predictive performance was average. Models built using extreme gradient boosting (XGBoost) demonstrated the best predictive performance across different studies. The stacking method can improve predictive performance by integrating different models. Finally, maternal blood pressure, fetal abdominal circumference, head circumference, and estimated fetal weight were important predictors of SGA.
Sleep apnea causes cardiac arrest, sleep rhythm disorders, nocturnal hypoxia and abnormal blood pressure fluctuations in patients, which eventually lead to nocturnal target organ damage in hypertensive patients. The incidence of obstructive sleep apnea hypopnea syndrome (OSAHS) is extremely high, which seriously affects the physical and mental health of patients. This study attempts to extract features associated with OSAHS from 24-hour ambulatory blood pressure data and identify OSAHS by machine learning models for the differential diagnosis of this disease. The study data were obtained from ambulatory blood pressure examination data of 339 patients collected in outpatient clinics of the Chinese PLA General Hospital from December 2018 to December 2019, including 115 patients with OSAHS diagnosed by polysomnography (PSG) and 224 patients with non-OSAHS. Based on the characteristics of clinical changes of blood pressure in OSAHS patients, feature extraction rules were defined and algorithms were developed to extract features, while logistic regression and lightGBM models were then used to classify and predict the disease. The results showed that the identification accuracy of the lightGBM model trained in this study was 80.0%, precision was 82.9%, recall was 72.5%, and the area under the working characteristic curve (AUC) of the subjects was 0.906. The defined ambulatory blood pressure features could be effectively used for identifying OSAHS. This study provides a new idea and method for OSAHS screening.
Emotion recognition refers to the process of determining and identifying an individual's current emotional state by analyzing various signals such as voice, facial expressions, and physiological indicators etc. Using electroencephalogram (EEG) signals and virtual reality (VR) technology for emotion recognition research helps to better understand human emotional changes, enabling applications in areas such as psychological therapy, education, and training to enhance people’s quality of life. However, there is a lack of comprehensive review literature summarizing the combined researches of EEG signals and VR environments for emotion recognition. Therefore, this paper summarizes and synthesizes relevant research from the past five years. Firstly, it introduces the relevant theories of VR and EEG signal emotion recognition. Secondly, it focuses on the analysis of emotion induction, feature extraction, and classification methods in emotion recognition using EEG signals within VR environments. The article concludes by summarizing the research’s application directions and providing an outlook on future development trends, aiming to serve as a reference for researchers in related fields.
Machine learning-based diagnostic tests have certain differences of measurement indicators with traditional diagnostic tests. In this paper, we elaborate the definitions, calculation methods and statistical inferences of common measurement indicators of machine learning-based diagnosis models in detail. We hope that this paper will be helpful for clinical researchers to better evaluate machine learning diagnostic models.
ObjectiveTo establish a predictive model of surgical site infection (SSI) following colorectal surgery using machine learning.MethodsMachine learning algorithm was used to analyze and model with the colorectal data set from Duke Infection Control Outreach Network Surveillance Network. The whole data set was divided into two parts, with 80% as the training data set and 20% as the testing data set. In order to improve the training effect, the whole data set was divided into two parts again, with 90% as the training data set and 10% as the testing data set. The predictive result of the model was compared with the actual infected cases, and the sensitivity, specificity, positive predictive value, and negative predictive value of the model were calculated, the area under receiver operating characteristic (ROC) curve was used to evaluate the predictive capacity of the model, odds ratio (OR) was calculated to tested the validity of evaluation with a significance level of 0.05.ResultsThere were 7 285 patients in the whole data set registered from January 15th, 2015 to June 16th, 2016, among whom 234 were SSI cases, with an incidence of SSI of 3.21%. The predictive model was established by random forest algorithm, which was trained by 90% of the whole data set and tested by 10% of that. The sensitivity, specificity, positive predictive value, and negative predictive value of the model were 76.9%, 59.2%, 3.3%, and 99.3%, respectively, and the area under ROC curve was 0.767 [OR=4.84, 95% confidence interval (1.32, 17.74), P=0.02].ConclusionThe predictive model of SSI following colorectal surgery established by random forest algorithm has the potential to realize semi-automatic monitoring of SSIs, but more data training should be needed to improve the predictive capacity of the model before clinical application.
Systematic reviews can provide important evidence support for clinical practice and health decision-making. In this process, literature screening and data extraction are extensively time-consuming procedures. Natural language processing (NLP), as one of the research directions of computer science and artificial intelligence, can accelerate the process of literature screening and data extraction in systematic reviews. This paper introduced the requirements of systematic reviews for rapid literature screening and data extraction, the development of NLP and types of machine learning; and systematically collated the NLP tools for the title and abstract screening, full-text screening and data extraction in systematic reviews; and discussed the problems in the application of NLP tools in the field of systematic reviews and proposed a prospect for its future development.
Myocardial infarction (MI) has the characteristics of high mortality rate, strong suddenness and invisibility. There are problems such as the delayed diagnosis, misdiagnosis and missed diagnosis in clinical practice. Electrocardiogram (ECG) examination is the simplest and fastest way to diagnose MI. The research on MI intelligent auxiliary diagnosis based on ECG is of great significance. On the basis of the pathophysiological mechanism of MI and characteristic changes in ECG, feature point extraction and morphology recognition of ECG, along with intelligent auxiliary diagnosis method of MI based on machine learning and deep learning are all summarized. The models, datasets, the number of ECG, the number of leads, input modes, evaluation methods and effects of different methods are compared. Finally, future research directions and development trends are pointed out, including data enhancement of MI, feature points and dynamic features extraction of ECG, the generalization and clinical interpretability of models, which are expected to provide references for researchers in related fields of MI intelligent auxiliary diagnosis.
Motor imagery (MI) is a mental process that can be recognized by electroencephalography (EEG) without actual movement. It has significant research value and application potential in the field of brain-computer interface (BCI) technology. To address the challenges posed by the non-stationary nature and low signal-to-noise ratio of MI-EEG signals, this study proposed a Riemannian spatial filtering and domain adaptation (RSFDA) method for improving the accuracy and efficiency of cross-session MI-BCI classification tasks. The approach addressed the issue of inconsistent data distribution between source and target domains through a multi-module collaborative framework, which enhanced the generalization capability of cross-session MI-EEG classification models. Comparative experiments were conducted on three public datasets to evaluate RSFDA against eight existing methods in terms of classification accuracy and computational efficiency. The experimental results demonstrated that RSFDA achieved an average classification accuracy of 79.37%, outperforming the state-of-the-art deep learning method Tensor-CSPNet (76.46%) by 2.91% (P < 0.01). Furthermore, the proposed method showed significantly lower computational costs, requiring only approximately 3 minutes of average training time compared to Tensor-CSPNet’s 25 minutes, representing a reduction of 22 minutes. These findings indicate that the RSFDA method demonstrates superior performance in cross-session MI-EEG classification tasks by effectively balancing accuracy and efficiency. However, its applicability in complex transfer learning scenarios remains to be further investigated.
Objective To establish a machine learning-based risk prediction model of combined chronic obstructive pulmonary disease (COPD) with lung cancer, so as to explore the high risk factors for COPD patients with lung cancer and to lay the foundation for early detection of lung cancer risk in COPD patients. Methods A total of 154 patients from the Second Hospital of Dalian Medical University from 2010 to 2021 were retrospectively analyzed, including 99 patients in the COPD group and 55 patients in the COPD with lung cancer group. the chest high resolution computed tomography (HRCT) scans and pulmonary function test of each patient were acquired. The main analyses were as follow: (1) to valid the statistically differences of the basic information (such as age, body mass index, smoking index), laboratory test results, pulmonary function parameters and quantitative parameters of chest HRCT between the two groups; (2) to analyze the indicators of high risk factors for lung cancer in COPD patients using univariate and binary logistic regression (LR) methods; and (3) to establish the machine learning model (such as LR and Gaussian process) for COPD with lung cancer patients. Results Based on the statistical analysis and LR methods, decreased BMI, increased whole lung emphysema index, increased whole lung mean density, and increased percentage activity of exertional spirometry and prothrombin time were risk factors for COPD with lung cancer patients. Based on the machine learning prediction model for COPD with lung cancer patients, the area under the receiver operating characteristic curve for LR and Gaussian process were obtained as 0.88 using the soluble fragments of prothrombin time percentage activity, whole lung emphysema index, whole lung mean density, and forced vital capacity combined with neuron-specific enolase and cytokeratin 19 as features. Conclusion The prediction model of COPD with lung cancer patients using a machine learning approach can be used for early detection of lung cancer risk in COPD patients.