Improving Access to Medical Information for Multilingual Patients using Pipelined Ensemble Average based Machine Translation

Author:

Banik Debajyoty1,Paul Rahul2,Rathore Rajkumar Singh3,Jhaveri Rutvij H.4

Affiliation:

1. School Computer Science and Artificial Intelligence SR University, India

2. School of Computer Engineering Kalinga Institute of Industrial Technology, India

3. Department of Computer Science Cardiff School of Technologies, Cardiff Metropolitan University, United Kingdom

4. Department of Computer Science and Engineering, School of Technology Pandit Deendayal Energy University, India

Abstract

Machine translation has shown potential in improving access to medical information and healthcare services for multilingual patients. This research aims to enhance machine translation accuracy in the medical field, specifically for translating from Hindi to English. The study introduces a new approach that dynamically allocates decoding parameters using regression models, overcoming the limitations of fixed parameters in the decoder. A comprehensive dataset is created to address limited data availability, enabling regression models to predict optimal pruning parameters. The main motivation for the study is the introduction of a regression method for optimizing pruning parameters, which is a novel approach in this context. The proposed approach outperforms existing methods, achieving improved translation accuracy. Standard metrics such as the BLEU score are used to evaluate translations. Ensemble average and pipeline approaches further enhance performance. The improved performance of the proposed models can be attributed to the ensemble of diverse models (Extra Trees, LightGBM, XGBoost, and Random Forest) that employ various techniques to reduce overfitting, enhance prediction accuracy, and improve translation by correcting prediction errors. The study contributes to facilitating the translation and sharing of medical literature, promoting collaboration and knowledge exchange across languages. The research demonstrates the effectiveness of the regression method for optimizing pruning parameters in machine translation, leading to improved translation accuracy in the medical field. The proposed models offer promising results, paving the way for enhanced machine translation systems and promoting collaboration and knowledge exchange in the medical domain. The source code is available at https://huggingface.co/debajyoty/statistical-regression-Based-MT/tree/main/Statistical-Regression-SMT.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Reference37 articles.

1. Devansh Arpit Huan Wang Yingbo Zhou and Caiming Xiong. 2021. Ensemble of averages: Improving model selection and boosting performance in domain generalization. arXiv preprint arXiv:2110.10832(2021). Devansh Arpit Huan Wang Yingbo Zhou and Caiming Xiong. 2021. Ensemble of averages: Improving model selection and boosting performance in domain generalization. arXiv preprint arXiv:2110.10832(2021).

2. Phrase table re-adjustment for statistical machine translation

3. Debajyoty Banik , Pushpak Bhattacharyya , and Asif Ekbal . 2016 . Rule based hardware approach for machine transliteration: A first thought . In 2016 Sixth International Symposium on Embedded Computing and System Design (ISED). IEEE, 192–195 . Debajyoty Banik, Pushpak Bhattacharyya, and Asif Ekbal. 2016. Rule based hardware approach for machine transliteration: A first thought. In 2016 Sixth International Symposium on Embedded Computing and System Design (ISED). IEEE, 192–195.

4. Debajyoty Banik and Riya Roy Chowdhury . 2016 . Offline signature authentication: A back propagation-neural network approach . In 2016 Sixth International Symposium on Embedded Computing and System Design (ISED). IEEE, 334–339 . Debajyoty Banik and Riya Roy Chowdhury. 2016. Offline signature authentication: A back propagation-neural network approach. In 2016 Sixth International Symposium on Embedded Computing and System Design (ISED). IEEE, 334–339.

5. Machine learning based optimized pruning approach for decoding in statistical machine translation;Banik Debajyoty;IEEE Access,2018

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3