chenjianqiu, huangxiaofang. Joint extraction of Chinese EMR entity relationship based on bert. 2022. biomedRxiv.202206.00003
Joint extraction of Chinese EMR entity relationship based on bert
Corresponding author: huangxiaofang, 448401501@qq.com
DOI: 10.12201/bmr.202206.00003
-
Abstract: Electronic medical record is some clinical information of patients generated by medical staff in the medical process, including a large number of medical entities related to patients health. How to extract medical information efficiently from unstructured medical record text has become a research hotspot in the field of natural language processing (NLP). At present, the joint entity relationship extraction model mainly identifies entities and then extracts relationships for classification. However, this method will be affected by redundant entities, and can not well capture the internal relationship between entities and relationships. In order to solve these problems, this paper uses a cascade decoder for relationship extraction, First, the head entity is identified by the head entity identification module, and then the tail entity is identified for different relationships by the relationship specific tail entity annotation module. In addition, the characteristics of EMR entities are mainly the high-density distribution of entities and the cross interconnection of relationships between entities. In view of this characteristic, this paper uses the pointer annotation method to solve the problem of entity nesting in EMR documents, and improves the tail entity relationship specific annotator module to solve the problem of cross interconnection of relationships between entities. The comparative experiment selects two mainstream models as the baseline and successively verifies them in the chip2020 data set. The F value of this method has increased by 3 percentage points. Experiments show that the proposed method is very effective for relationship extraction.
Key words: natural language process; Chinese EMR; relation extraction; joint extraction modelSubmit time: 1 June 2022
Copyright: The copyright holder for this preprint is the author/funder, who has granted biomedRxiv a license to display the preprint in perpetuity. -
图表
-
lizihao, Chen Mosha, Ma Zhenxin, Yin Kangping, Tong Yixuan, Tan Chuanqi, Lang ZhenZhen, Tang Buzhou. CMedCausal - A dataset of Chinese medical causal relationship extraction. 2022. doi: 10.12201/bmr.202211.00004
pangzhen, GuJiYu, WuYuFei, YanSshiXing, LiWangYang, SunYue. A study on the solution of the problem of extracting essential substance of TCM diagnosis and treatment of hypertension based on triple extraction strategy. 2021. doi: 10.12201/bmr.202107.00015
Liu Zhongyu, Yao Jia, Yu Siwei, Zheng Ziqiang, Lan Lan, Yin Jin. Research on Analysis and Countermeasures of Medical Disputes Based on Knowledge Extraction. 2021. doi: 10.12201/bmr.202110.00022
wuxuehong. A method of recognizing entities from Chinese Electronic Medical Record based on domain word vector combined with word attributes reasoning. 2021. doi: 10.12201/bmr.202109.00016
Li Wenfeng, 朱威, 王晓玲. Text2DT: Decision rule extraction technology for clinical medical texts. 2022. doi: 10.12201/bmr.202211.00002
You Liping, WangShiyu. Extraction of Adverse Drug Events from Social Media Based on FrameNet Semantic Analysis YOU Liping, WANG Shiyu, LI Chaofan, College of Economics and Management, Shanxi University, Taiyuan 030006, China.. 2022. doi: 10.12201/bmr.202211.00006
Xiang Fei. Research on the influencing factors of nurses protection of patients privacy in EMR. 2020. doi: 10.12201/bmr.202009.00013
Guan Zhihao, Shan Zhiyi, Lin Ziluo, yangxuemei, Tang Xiaoli. Discovery of potential comorbidity relationship based on co-occurrence and citation of entities. 2022. doi: 10.12201/bmr.202203.00003
kangyishuai, shaochenjie. An Algorithm for Generating TCM Document Questions Based on Unified Language Model. 2022. doi: 10.12201/bmr.202110.00044
SUN Chenghao, LIU Fen, ZHAO Feng. Research on electronic Medical Record System based on Block chain technology. 2020. doi: 10.12201/bmr.202007.00012
-
ID Submit time Number Download 1 2022-01-05 bmr.202206.00003V1
Download -
-
Public Anonymous To author only
Get Citation
Article Metrics
- Read: 1157
- Download: 13
- Comment: 0