Affiliation:
1. Indian Institute of Information Technology, Nagpur
Abstract
Abstract
In the modern era, the necessity of digitization is increasing in a rapid manner day-to-day. The healthcare industries are working towards operating in a paperless environment. Digitizing the medical lab records help the patients in hassle-free management of their medical data. It may also prove beneficial for insurance companies for designing various medical insurance policies which can be patient-centric rather than being generalized. Optical Character Recognition (OCR) technology is demonstrated its usefulness for such cases and thus, to know the best possible solution for digitizing the medical lab records, there is a need to perform an extensive comparative study on the different OCR techniques available for this purpose. It is observed that the current research is focused mainly on the pre-processing image techniques for OCR development, however, their effects on OCR performance specially for medical report digitization yet not been studied. Herein this work, three OCR Engines viz Tesseract, EasyOCR and DocTR, and 6 pre-processing techniques: image binarization, brightness transformations, gamma correction, sigmoid stretching, bilateral filtering and image sharpening are surveyed in detail. In addition, an extensive comparative study of the performance of the OCR Engines while applying the different combinations of the image pre-processing techniques, and their effect on the OCR accuracy is presented.
Publisher
Research Square Platform LLC
Reference20 articles.
1. Measuring the operational impact of digitized hospital records: a mixed methods study;Scott PJ;BMC medical informatics and decision making,2016
2. Better patient outcomes through mining of biomedical big data;Suter-Crazzolara C;Frontiers in ICT,2018
3. An overview of feature extraction techniques in ocr for indian scripts focused on offline handwriting;Tawde GY;International Journal of Engineering Research and Applications,2013
4. A detailed analysis of optical character recognition technology;Hamad K;International Journal of Applied Mathematics Electronics and Computers,2016
5. Steps involved in text recognition and recent research in OCR; a study;Karthick K;International Journal of Recent Technology and Engineering,2019
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献