Abstract
Intent classification and named entity recognition of medical questions are two key subtasks of the natural language understanding module in the question answering system. Most existing methods usually treat medical queries intent classification and named entity recognition as two separate tasks, ignoring the close relationship between the two tasks. In order to optimize the effect of medical queries intent classification and named entity recognition tasks, a multi-task learning model based on ALBERT-BILSTM is proposed for intent classification and named entity recognition of Chinese online medical questions. The multi-task learning model in this paper makes use of encoder parameter sharing, which enables the model’s underlying network to take into account both named entity recognition and intent classification features. The model learns the shared information between the two tasks while maintaining its unique characteristics during the decoding phase. The ALBERT pre-training language model is used to obtain word vectors containing semantic information and the bidirectional LSTM network is used for training. A comparative experiment of different models was conducted on Chinese medical questions dataset. Experimental results show that the proposed multi-task learning method outperforms the benchmark method in terms of precision, recall and F1 value. Compared with the single-task model, the generalization ability of the model has been improved.
Funder
National Natural Science Foundation of China
Natural Science Foundation of Xinjiang, China
Strengthening Plan of National Defense Science and Technology Foundation of China
Reference36 articles.
1. Gerner, M., Nenadic, G., and Bergman, C.M. (2010). LINNAEUS: A species name identification system for biomedical literature. BMC Bioinform., 11.
2. Toward information extraction: Identifying protein names from biological papers;Fukuda;Pac. Symp. Biocomput.,1998
3. Drug name recognition in biomedical texts: A machine-learning-based method;He;Drug Discov. Today,2014
4. Named entity recognition from Chinese adverse drug event reports with lexical feature based BiLSTM-CRF and tri-training;Chen;J. Biomed. Inform.,2019
5. Huang, Z., Xu, W., and Yu, K. (2015). Bidirectional LSTM-CRF Models for Sequence Tag-grog. arXiv.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献