Cheng Luping, Wu Siyang, Lu Bo. Analysis of Risk Factors and Prediction of Type 2 Diabetes Mellitus Based on Machine Learning. 2025. biomedRxiv.202509.00003
Analysis of Risk Factors and Prediction of Type 2 Diabetes Mellitus Based on Machine Learning
Corresponding author: Lu Bo, lubodf@163.com
DOI: 10.12201/bmr.202509.00003
-
Abstract: Purpose/Significance Aiming at the limitations of traditional evidence-based medicine (EBM) in dissecting the multi-factor interaction mechanisms of type 2 diabetes mellitus (T2DM), this study constructed a multidimensional data mining prediction framework to enhance the accuracy of risk assessment and the efficiency of clinical decision-making. Method/Process Based on the Pima dataset, univariate, bivariate, and multivariate analyses were conducted to screen core risk factors. Five machine learning models—LR, RF, SVM, XGBoost, and LightGBM—were employed for modeling. Hyperparameter optimization was performed using grid search and cross-validation. Result/Conclusion The identified key risk factors (e.g., blood glucose level, body mass index, and age) were consistent with conclusions from traditional EBM. Among the models, RF achieved the highest prediction accuracy of 0.8701, demonstrating superior overall performance. By integrating data mining and feature selection, this method significantly reduces data collection costs and shortens the cycle of risk factor identification, while revealing nonlinear interaction mechanisms among variables. It provides an efficient tool for community-based screening of high-risk populations.
Key words: type 2 diabetes; risk prediction; multidimensional data mining; machine learningSubmit time: 1 September 2025
Copyright: The copyright holder for this preprint is the author/funder, who has granted biomedRxiv a license to display the preprint in perpetuity. -
图表
-
wangyifan, shichaojun, maanning. Comparison of risk prediction models for atherosclerosis in type 2 diabetes mellitus. 2024. doi: 10.12201/bmr.202404.00007
liruiyao, xujingyi, daihaoyu, sunhuiwen, wutianxing. Deep confidence network-based prediction of microvascular complications in type 2 diabetes mellitus. 2024. doi: 10.12201/bmr.202404.00021
Analysis of risk factors for hypoglycemia in elderly outpatients with type 2 diabetes. 2024. doi: 10.12201/bmr.202412.00069
rui chen, chen yueqi, li jinbin, zhang shengfa. Progress and Trend of the Application of Artificial Intelligence in the Basic Health Management of Type 2 Diabetes. 2025. doi: 10.12201/bmr.202506.00072
WANG Li, PIAO Chun-li. Efficacy and safety of Yuye decoction with metformin in type 2 diabetes mellitus: A meta-analysis. 2024. doi: 10.12201/bmr.202407.00066
Liu Shiwei. The predictive value of neutrophil-to-lymphocyte ratio and platelet-to-lymphocyte ratio in the type 2 diabeticnephropathy. 2024. doi: 10.12201/bmr.202408.00066
Association Between Serum CXCL14 Levels and Carotid Atherosclerotic Plaques in Patients With Type 2 Diabetes Mellitus. 2025. doi: 10.12201/bmr.202506.00073
guolianmei, sunxiaohong, zhangzhe, tianye. Construction of a Prediction Model for Lower Limb Deep Vein Thrombosis in Patients with Hemorrhagic Stroke Based on Machine Learning. 2025. doi: 10.12201/bmr.202510.00023
HONG Suru, CHEN Yushuang, WU Xiayang. Mortality risk assessment and interpretability analysis of preterm infants in the NICU using machine learning models. 2025. doi: 10.12201/bmr.202503.00066
ZHOU Jie, XIAO Wei. To explore the mechanism of oxidative stress and metabolic imbalance in COPD combined with type 2 diabetes from Qi-blood-body fluid theory. 2025. doi: 10.12201/bmr.202506.00039
-
ID Submit time Number Download 1 2025-04-30 bmr.202509.00003V1
Download -
-
Public Anonymous To author only
Get Citation
Article Metrics
- Read: 130
- Download: 0
- Comment: 0