1. Clinical information extraction applications: A literature review;Wang;J. Biomed. Inform.,2018
2. Attention is all you need;Vaswani,2017
3. BERT: Pre-training of deep bidirectional transformers for language understanding;Devlin,2019
4. ELECTRA: Pre-training text encoders as discriminators rather than generators;Clark,2020
5. ALBERT: A lite BERT for self-supervised learning of language representations;Lan,2020