pangzhen, GuJiYu, WuYuFei, YanSshiXing, LiWangYang, SunYue. A study on the solution of the problem of extracting essential substance of TCM diagnosis and treatment of hypertension based on triple extraction strategy. 2021. biomedRxiv.202107.00015
A study on the solution of the problem of extracting essential substance of TCM diagnosis and treatment of hypertension based on triple extraction strategy
Corresponding author: SunYue, sunyue@nhei.cn
DOI: 10.12201/bmr.202107.00015
-
Abstract: Objective To propose a new entity extraction model based on triplet information extraction strategy to solve the problem of entity dispersion when traditional named entity recognition method is applied to TCM medical entity recognition, and to provide methodological reference for automatic extraction of TCM medical entity. Methods Pre-trained BERT for TCM domain were established in this paper. And then, 2000 cases of TCM hypertension were selected to train and test several relational extraction models: conventional BiGRU model; The joint extraction model based on BERT_TCM; CASREL model based on BERT_TCM and FGM/PGD adversarial training. Results Precision, recall and F1-score were used as evaluation metrics. And CASREL model based on BERT_TCM and PGD work best, with a result of precision 0.8887, recall 0.8841 and F1-score 0.8932. Conclusion Based on triple extraction strategy of hypertension medical entity extraction model can effectively solve the problem; BERT_TCM for TCM specific scene has better performance in TCM hypertension relationship extraction task; The joint extraction model significantly improves the performance of the model compared with BiGRU model. CASREL model has better performance than joint extraction model. Introduction of confrontation training technology can effectively improve model robustness.
Key words: Joint extraction、Casrel extraction、Named entity recognition、hypertension、Cases of traditional Chinese MedicineSubmit time: 26 August 2021
Copyright: The copyright holder for this preprint is the author/funder, who has granted biomedRxiv a license to display the preprint in perpetuity. -
图表
-
HU Haiyang, ZHAO Congpu, Ma Lian, JIANG Huizhen, ZHANG Jing, ZHU Weiguo. Attention Mechanism And Dilated Convolution Neural Networks for Named Entity Recognition. 2021. doi: 10.12201/bmr.202102.00004
GUO ZheTao, SHI Wenli, YANG Tao. Design and implementation of intelligent speech assistant for clinical diagnosis and treatment of traditional Chinese medicine based on speech recognition. 2021. doi: 10.12201/bmr.202110.00013
Liu Zhongyu, Yao Jia, Yu Siwei, Zheng Ziqiang, Lan Lan, Yin Jin. Research on Analysis and Countermeasures of Medical Disputes Based on Knowledge Extraction. 2021. doi: 10.12201/bmr.202110.00022
wuxuehong. A method of recognizing entities from Chinese Electronic Medical Record based on domain word vector combined with word attributes reasoning. 2021. doi: 10.12201/bmr.202109.00016
WangXin. Clinical experience of Yang Zhihong in the treatment of hypertension vertigo with hyperactivity of liver Yang. 2020. doi: 10.12201/bmr.202004.00008
ZHOU Hai, PENG Feiling, WEI Changfa. Research and Practice on the Construction of Syndrome Differentiation Knowledge Graph of Diagnostics of Traditional Chinese Medicine. 2020. doi: 10.12201/bmr.202004.00036
guo xuan zhi, zhou wu jie, shang xin, lian chun hua, zhan kai ming, lin long yong. The Model based on UNILM of question conditional generation in the field of Chinese medicine. 2021. doi: 10.12201/bmr.202110.00036
zhangyiran, LiQiang, 肖勇, 车玉霞, 朱佳卿. Development Course and Prospect of Information Standards of Traditional Chinese Medicine in China. 2021. doi: 10.12201/bmr.202107.00008
xiaoyong. Thinking on the development of Internet accelerated speed plus Traditional Chinese medicine health service based on PEST analysis. 2020. doi: 10.12201/bmr.202011.00005
kangyishuai, shaochenjie. An Algorithm for Generating TCM Document Questions Based on Unified Language Model. 2022. doi: 10.12201/bmr.202110.00044
-
ID Submit time Number Download 1 2021-07-27 bmr.202107.00015V1
Download -
-
Public Anonymous To author only
Get Citation
Article Metrics
- Read: 1022
- Download: 6
- Comment: 0