gongxiaocui, anxinying. Research on Semantic Feature Enhancement for Medical Literature Classification. 2024. biomedRxiv.202411.00088
Research on Semantic Feature Enhancement for Medical Literature Classification
Corresponding author: anxinying, an.xinying@imicams.ac.cn
DOI: 10.12201/bmr.202411.00088
-
Abstract: Purpose/Significance The rapid growth of medical literature poses new challenges for literature classification,it is very important to build an effective automatic classification model of medical literature.Method/Process Using medical literature as data source,this article utilizes the synonyms and hierarchical structure of the MeSH vocabulary to enhance the features of concept information,uses the BERT model for fine-tuning and testing,and compares the classification results with random forest algorithm.Result/Conclusion The results of the ten-fold cross-validation method show that the precision,recall and F1 score of the medical literature classification model based on Mesh and BERT are 95.42%,93.61%,94.47%, which are better than the classification results of random forest and pure BERT.The medical literature classification model based on Mesh and BERT shows high accuracy and effectiveness, and has certain applicability.
Key words: medical literature; MeSH; BERT; automatic classificationSubmit time: 29 November 2024
Copyright: The copyright holder for this preprint is the author/funder, who has granted biomedRxiv a license to display the preprint in perpetuity. -
图表
-
zhaocongpu, YUAN Da, ZHU Pu-jue, ZHOU Jiong, CHEN Zheng, PENG Hua. Research and practice on intelligent classification of medical safety incidents based on deep BERT. 2023. doi: 10.12201/bmr.202312.00021
张胜发, zhou wei. A Research on the Classification of Biomedical big Data for Open Applications. 2024. doi: 10.12201/bmr.202411.00082
Li Xiaoying, Cai Miaozhi, Li Junlian, Ren Huiling, Ji Yujing, Deng Panpan, Xia Guanghui. Research on the construction of COVID-19 knowledge graph for literature organization. 2020. doi: 10.12201/bmr.202010.00840
wangjuan, HouLi. Analysis on the Classification of Problems in the Medical and Health Field. 2023. doi: 10.12201/bmr.202312.00023
Yu Shirui, Li Aihua, Lin Ziluo, Chen Yifei, Tang Xiaoli. A review of research on the improvement of topic model based topic evolution analysis methods for scientific literature. 2023. doi: 10.12201/bmr.202305.00016
wangjuan, HouLi. Research on automatic recommendation method for answer analysis of pediatric medical examination questions for knowledge question and answerWang Juan1, Hou Li1, Sun Yue-ping1, Li Jia-ming1,Dong Liang-guang2 Li Yun-han3. 2024. doi: 10.12201/bmr.202409.00026
ZHANG Wen, ZHANG Jian-tong, GUO Yu-shan. Sentiment Analysis of Online Medical Reviews Based on BERT and Semantics Collaboration through Dual-channel. 2024. doi: 10.12201/bmr.202407.00042
LI Yan-hong, 张迅, Huang Hailiang. Research on Frontier Identification of Medical Oncology Research Based on High Quality Literature. 2023. doi: 10.12201/bmr.202312.00010
Zhang Shengfa, Ma Yuhuan, Zhangjing Chen, Wang Jiayang, Sun Jingwen, Zhang Yue, Zhang Xiaoyu, Zhou Wei. Research on the Guidelines for Data classification of Health and Medical Science Data From the perspective of data security. 2023. doi: 10.12201/bmr.202303.00026
xiaoxiaoxia. Research on named entity recognition of Chinese medical records based on BERT-BiLSTM-CRF with Chinese radicals. 2023. doi: 10.12201/bmr.202303.00004
-
ID Submit time Number Download 1 2024-09-26 bmr.202411.00088V1
Download -
-
Public Anonymous To author only
Get Citation
Article Metrics
- Read: 171
- Download: 0
- Comment: 0