BCDRRLE: A Bidirectional Cross-Dynamic Round Robin Learning Encoder Model for Medical Sentence Similarity (Preprint)

Author:

Huang BoORCID

Abstract

BACKGROUND

In recent years, the construction of medical informatization has achieved remarkable results. A large number of electronic medical records are stored in medical electronic systems. With the continuous progress and development of Natural Language Processing (NLP) technology, researchers are focusing their attention on developing Artificial intelligence (AI) systems using large electronic health records (EHRs). How to efficiently retrieve and recommend similar content from massive EHRs is an urgent problem to be solved.

OBJECTIVE

This paper mainly solves the following problems for medical text similarity—(1)our model can effectively alleviate the problem of insufficient data in supervised learning (2)existing methods cannot generate high-quality sentence vectors (3) how to mine enough knowledge from unsupervised data?

METHODS

We propose a bidirectional cross-dynamic polling learning encoder model. This model uses semi-supervised learning to generate high-quality sentence vectors when there is a small amount of labeled data.

RESULTS

Our proposed Bidirectional Cross-Dynamic Round Robin Learning Encoder(BCDRRLE)structural model outperforms state-of-the-art models on three medical text sentence similarity datasets. Furthermore, our model can also produce higher-quality sentence representations.

CONCLUSIONS

The experimental results demonstrate that our proposed BCDRRLE structure model can still produce very good results in the case of a small amount of data. By introducing contrastive learning and an enhanced version of the denoising autoencoder, our model can efficiently produce high-quality sentence representations. Our proposed dynamic polling learning algorithm helps to further improve the performance of the model. Our approach not only yields better performance but is also more general and can be extended to other related task domains that use text representations.

Publisher

JMIR Publications Inc.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3