International Journal of Engineering and Management Research
  • Year: 2024
  • Volume: 14
  • Issue: 3

Image Text to Speech Conversion using Optical Character Recognition Technique in Raspberry PI

  • Author:
  • Mangesh Sarak1,*, S. S. Patil2, Abhijit S. Mali3
  • Total Page Count: 7
  • Page Number: 78 to 84

1Student, Department of Electronics and Telecommunication Engineering, Tatyasaheb Kore Institute of Engineering and Technology (An Autonomous Institute), Warananagar, Kolhapur, 416113, Maharashtra, India

2Professor, Department of Electronics and Telecommunication Engineering, Tatyasaheb Kore Institute of Engineering and Technology (An Autonomous Institute), Warananagar, Kolhapur, 416113, Maharashtra, India

3Professor, Department of Electronics and Telecommunication Engineering, Tatyasaheb Kore Institute of Engineering and Technology (An Autonomous Institute), Warananagar, Kolhapur, 416113, Maharashtra, India

*Corresponding Author: mangeshsarak@gmail.com

Online Published on 16 December, 2024.

Abstract

Optical Character Recognition (OCR) is a subset of artificial intelligence and is a subset of computer vision. Optical Character Recognition (OCR) is the use of Raspberry Pi to convert scanned bitmap images of handwritten or written text into audio performance. OCRs designed for a variety of world languages are now in use. In this method the context subtraction method based on the Gaussian mixture is used to recover the area of the moving object. For text content, the function of text localization and recognition is used. The text localization algorithm and the Tesract algorithm and edge pixel distributions based on the gradient properties of the stroke directions were used to automatically translate text areas from the object in the Ada enhancement model. In the translated text areas text characters are converted to binaries, which OCR software understands. For the blind, known text symbols are strongly pronounced. The potential of the algorithm for the proposed text location. The text file describes the character codes using the Raspberry system, which recognises the characters by using Tesract’s and Python, and the audio output is heard in the recognition step.

Keywords

Image, Text, Speech, PI