• 国家药监局综合司 国家卫生健康委办公厅
  • 国家药监局综合司 国家卫生健康委办公厅

Research on Semantic Feature Enhancement for Medical Literature Classification

Corresponding author: anxinying, an.xinying@imicams.ac.cn
DOI: 10.12201/bmr.202411.00088
Statement: This article is a preprint and has not been peer-reviewed. It reports new research that has yet to be evaluated and so should not be used to guide clinical practice.
  •  

    Abstract: Purpose/Significance The rapid growth of medical literature poses new challenges for literature classification,it is very important to build an effective automatic classification model of medical literature.Method/Process Using medical literature as data source,this article utilizes the synonyms and hierarchical structure of the MeSH vocabulary to enhance the features of concept information,uses the BERT model for fine-tuning and testing,and compares the classification results with random forest algorithm.Result/Conclusion The results of the ten-fold cross-validation method show that the precision,recall and F1 score of the medical literature classification model based on Mesh and BERT are 95.42%,93.61%,94.47%, which are better than the classification results of random forest and pure BERT.The medical literature classification model based on Mesh and BERT shows high accuracy and effectiveness, and has certain applicability.

    Key words: medical literature; MeSH; BERT; automatic classification

    Submit time: 29 November 2024

    Copyright: The copyright holder for this preprint is the author/funder, who has granted biomedRxiv a license to display the preprint in perpetuity.
  • 图表

  • zhaocongpu, YUAN Da, ZHU Pu-jue, ZHOU Jiong, CHEN Zheng, PENG Hua. Research and practice on intelligent classification of medical safety incidents based on deep BERT. 2023. doi: 10.12201/bmr.202312.00021

    张胜发, zhou wei. A Research on the Classification of Biomedical big Data for Open Applications. 2024. doi: 10.12201/bmr.202411.00082

    Li Xiaoying, Cai Miaozhi, Li Junlian, Ren Huiling, Ji Yujing, Deng Panpan, Xia Guanghui. Research on the construction of COVID-19 knowledge graph for literature organization. 2020. doi: 10.12201/bmr.202010.00840

    wangjuan, HouLi. Analysis on the Classification of Problems in the Medical and Health Field. 2023. doi: 10.12201/bmr.202312.00023

    Yu Shirui, Li Aihua, Lin Ziluo, Chen Yifei, Tang Xiaoli. A review of research on the improvement of topic model based topic evolution analysis methods for scientific literature. 2023. doi: 10.12201/bmr.202305.00016

    wangjuan, HouLi. Research on automatic recommendation method for answer analysis of pediatric medical examination questions for knowledge question and answerWang Juan1, Hou Li1, Sun Yue-ping1, Li Jia-ming1,Dong Liang-guang2 Li Yun-han3. 2024. doi: 10.12201/bmr.202409.00026

    ZHANG Wen, ZHANG Jian-tong, GUO Yu-shan. Sentiment Analysis of Online Medical Reviews Based on BERT and Semantics Collaboration through Dual-channel. 2024. doi: 10.12201/bmr.202407.00042

    LI Yan-hong, 张迅, Huang Hailiang. Research on Frontier Identification of Medical Oncology Research Based on High Quality Literature. 2023. doi: 10.12201/bmr.202312.00010

    Zhang Shengfa, Ma Yuhuan, Zhangjing Chen, Wang Jiayang, Sun Jingwen, Zhang Yue, Zhang Xiaoyu, Zhou Wei. Research on the Guidelines for Data classification of Health and Medical Science Data From the perspective of data security. 2023. doi: 10.12201/bmr.202303.00026

    xiaoxiaoxia. Research on named entity recognition of Chinese medical records based on BERT-BiLSTM-CRF with Chinese radicals. 2023. doi: 10.12201/bmr.202303.00004

  • ID Submit time Number Download
    1 2024-09-26

    bmr.202411.00088V1

    Download
  • Public  Anonymous  To author only

Get Citation

gongxiaocui, anxinying. Research on Semantic Feature Enhancement for Medical Literature Classification. 2024. biomedRxiv.202411.00088

Article Metrics

  • Read: 171
  • Download: 0
  • Comment: 0

Email This Article

User name:
Email:*请输入正确邮箱
Code:*验证码错误