Medical Text Simplification Using Reinforcement Learning (TESLEA): Deep Learning–Based Text Simplification Approach

Author:

Phatak AtharvaORCID,Savage David WORCID,Ohle RobertORCID,Smith JonathanORCID,Mago VijayORCID

Abstract

Background In most cases, the abstracts of articles in the medical domain are publicly available. Although these are accessible by everyone, they are hard to comprehend for a wider audience due to the complex medical vocabulary. Thus, simplifying these complex abstracts is essential to make medical research accessible to the general public. Objective This study aims to develop a deep learning–based text simplification (TS) approach that converts complex medical text into a simpler version while maintaining the quality of the generated text. Methods A TS approach using reinforcement learning and transformer–based language models was developed. Relevance reward, Flesch-Kincaid reward, and lexical simplicity reward were optimized to help simplify jargon-dense complex medical paragraphs to their simpler versions while retaining the quality of the text. The model was trained using 3568 complex-simple medical paragraphs and evaluated on 480 paragraphs via the help of automated metrics and human annotation. Results The proposed method outperformed previous baselines on Flesch-Kincaid scores (11.84) and achieved comparable performance with other baselines when measured using ROUGE-1 (0.39), ROUGE-2 (0.11), and SARI scores (0.40). Manual evaluation showed that percentage agreement between human annotators was more than 70% when factors such as fluency, coherence, and adequacy were considered. Conclusions A unique medical TS approach is successfully developed that leverages reinforcement learning and accurately simplifies complex medical paragraphs, thereby increasing their readability. The proposed TS approach can be applied to automatically generate simplified text for complex medical text data, which would enhance the accessibility of biomedical research to a wider audience.

Publisher

JMIR Publications Inc.

Subject

Health Information Management,Health Informatics

Reference47 articles.

1. CarrollJMinnenGPearceDCanningYDevlinSTaitJSimplifying text for language-impaired readers1999Ninth Conference of the European Chapter of the Association for Computational LinguisticsJune 8-12, 1999Bergen, NorwayNew Brunswick, NJAssociation for Computational Linguistics269270

2. Unsupervised Lexical Simplification for Non-Native Speakers

3. PPDB 2.0: Better paraphrase ranking, fine-grained entailment relations, word embeddings, and style classification

4. Rebecca ThomasSAndersonSWordNet-Based Lexical Simplification of a DocumentProceedings of the 11th Conference on Natural Language Processing (KONVENS 2012)2012The 11th Conference on Natural Language Processing (KONVENS 2012)September 19-21, 2012Vienna, Austria80

5. Lexical Simplification with Pretrained Encoders

Cited by 5 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. A review of reinforcement learning for natural language processing and applications in healthcare;Journal of the American Medical Informatics Association;2024-08-29

2. Hybrid Prompt Learning for Generating Justifications of Security Risks in Automation Rules;ACM Transactions on Intelligent Systems and Technology;2024-06-29

3. Diffusion Models for Image Generation to Enhance Health Literacy;2024 IEEE 12th International Conference on Healthcare Informatics (ICHI);2024-06-03

4. Biomedical text readability after hypernym substitution with fine-tuned large language models;PLOS Digital Health;2024-04-16

5. Next-Gen Language Mastery: Exploring Advances in Natural Language Processing Post-transformers;Lecture Notes in Networks and Systems;2024

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3