Author:
Muthusundari Muthusundari,Velpoorani A,Venkata Kusuma S,L Trisha,Rohini Om.k.
Abstract
Abstract A technique termed optical character recognition, or OCR, is used to extract text from images. An OCR the system's primary goal is to transform already present paper-based paperwork or picture data into usable papers. Character as well as word detection are the two main phases of an OCR, which is designed using many algorithms. An OCR also maintains a document's structure by focusing on sentence identification, which is a more sophisticated approach. Research has demonstrated that despite the efforts of numerous scholars, no error-free Bengali OCR has been produced. This issue is addressed by developing an OCR for the Bengali language using the latest 3.03 version of the Tesseract OCR engine for Windows.
Reference17 articles.
1. Zhu, Hu, Ahn, & Yau. (2012). Efficient audit service outsourcing for data integrity in clouds. Journal article.
2. Kwon, Kim, Shen, & Kim. (2011). Self-similarity-based lightweight intrusion detection method for cloud computing. Book chapter.
3. Subashini, Kavitha. (2011). A survey on security issues in service delivery models of cloud computing. Journal article.
4. Chhabra, Singh. (2016). Dynamic data leakage detection model-based approach for MapReduce computational security in cloud. Journal article.
5. Smith, J., & Johnson, A. (2018). Advances in Optical Character Recognition: A Comprehensive Review. Journal article.