• 国家药监局综合司 国家卫生健康委办公厅
  • 国家药监局综合司 国家卫生健康委办公厅

Construction of a prediction model for postoperative survival of pancreatic cancer based on SMOTE-ENN combined with XGBoost algorithm

Corresponding author: GuoYarong, gyr5258@126.com
DOI: 10.12201/bmr.202506.00058
Statement: This article is a preprint and has not been peer-reviewed. It reports new research that has yet to be evaluated and so should not be used to guide clinical practice.
  •  

    Abstract: Purpose Different algorithms were used to build a prediction model for survival outcomes of patients after pancreatic cancer surgery based on the new version of AJCC staging and large-scale data.Methods? Based on the SEER database, SMOTE and SMOTE-ENN algorithms are used to process unbalanced data, LR, RF, SVM, DT, and XGBoost algorithms are used to build and compare prognostic models, and SHAP is introduced to interpret the models.Results? The performance of SMOTE-ENN combined with XGBoost model was the best (accuracy rate was 0.862, precision rate was 0.952, recall rate was 0.712, F1 value was 0.762, AUC value was 0.884, and Brier score was 0.108). The calibration curve and decision curve showed that the model had good calibration effect and high clinical application value respectively.Conclusion? The XGBoost model has the best performance and can be used as a new high-performance postoperative prognosis prediction model under AJCC staging that conforms to the current clinical staging system, providing theoretical support for predicting postoperative patient survival outcomes and formulating personalized treatment plans.

    Key words: pancreatic cancer; imbalanced data; XGBoost; outcome prediction

    Submit time: 23 June 2025

    Copyright: The copyright holder for this preprint is the author/funder, who has granted biomedRxiv a license to display the preprint in perpetuity.
  • 图表

  • ruanxuling, liuqi, guo zhiheng, yanjunfeng. Research on prediction model of breast cancer based on LDA and XGBoost algorithm. 2022. doi: 10.12201/bmr.202106.00007

    SHU Qijin. Experience of Shu Qijin in Treating Pancreatic Cancer from Stagnant Toxin due to Spleen Deficiency. 2025. doi: 10.12201/bmr.202507.00004

    zhou wei. Construction and Analysis of a Prediction Model for Hypertension Combined with Left Ventricular Diastolic Dysfunction Based on Random Forest AlgorithmWANG Tingting1 ,ZHOU Wei1*. 2025. doi: 10.12201/bmr.202503.00046

    ZhouMengqian, Tang Tong. Prediction of Lateral Cervical Lymph Node metastasis risk in Thyroid Cancer based on preoperative Lymph Node Ultrasonographic Characteristics. 2024. doi: 10.12201/bmr.202410.00035

    guolianmei, sunxiaohong, zhangzhe, tianye. Construction of a Prediction Model for Lower Limb Deep Vein Thrombosis in Patients with Hemorrhagic Stroke Based on Machine Learning. 2025. doi: 10.12201/bmr.202510.00023

    Zhang Wei, Cheng Weihan, Guo Fuxiang, Zhang Jianwei. Answer Quality Prediction for Online Mental Health Q & A Communities Based on BERT Pretraining model. 2025. doi: 10.12201/bmr.202509.00007

    Pan Yuzhen. . 2025. doi: 10.12201/bmr.202508.00014

    YOU Qing-hai. Construction of a TFF3-Related Prognostic Model for Lung Adenocarcinoma Based on Lasso-Cox Regression and Its Predictive Value for Immunotherapy. 2025. doi: 10.12201/bmr.202509.00054

    Mo Wei, Xiang Ya, Liao Qiujiao, He Liu, Ling Chaoling, Lu Qixiang, Liu Fangyin. Research progress on risk prediction model of postoperative delirium in elderly patients with hip fractureWEI Yunshi1? MO Wei1? XIANG YA1? LIAO Qiujiao1? HE Liu2? LING Chaoling2? LU Qixiang2? LIU Fangyin3▲. 2024. doi: 10.12201/bmr.202409.00029

    Construction and application evaluation of risk prediction model and nomogram for shivering during cesarean sectio. 2025. doi: 10.12201/bmr.202501.00053

  • ID Submit time Number Download
    1 2025-06-01

    10.12201/bmr.202506.00058V1

    Download
  • Public  Anonymous  To author only

Get Citation

LuoYanhong, GuoYarong. Construction of a prediction model for postoperative survival of pancreatic cancer based on SMOTE-ENN combined with XGBoost algorithm. 2025. biomedRxiv.202506.00058

Article Metrics

  • Read: 370
  • Download: 2
  • Comment: 0

Email This Article

User name:
Email:*请输入正确邮箱
Code:*验证码错误