In this project we are making document reader for visually impaired people, developed on Raspberry Pi. It uses the Optical character recognition technology for the identification of the printed characters using image sensing devices and computer programming. It converts images of typed, handwri
Raspberry pi based reader for blind people
In this project we are making document reader for visually impaired people, developed on Raspberry Pi. It uses the Optical character recognition technology for the identification of the printed characters using image sensing devices and computer programming. It converts images of typed, handwritten, or printed text into machine encoded text. In this research these images are converted into the audio output (Speech) audio output is achieved.
Character Recognition has become main aspect of Computer vision. Optical Character Recognition is a method where characters are recognized from images digitally. In this project an innovative, efficient and real-time cost beneficial technique that enables user to hear the contents of text images instead of reading through them has been introduced. Text-to-Speech converter is a device that scans and reads English alphabets in the image using OCR technique and changing it to voices. It combines the concept of Optical Character Recognition (OCR) and Text to Speech Synthesizer (TTS) in Raspberry pi. This kind of system helps visually impaired people to interact with computers effectively through vocal interface. Text Extraction from color images is a challenging task in computer vision. This project describes the design, implementation and experimental results of the device. This device consists of two modules, image processing module and voice processing module.
In this system, the printed text is to be placed under the camera view by the blind person to ensure the image of good quality and fewer distortions. Then an applicable blindassistive system, a text localization algorithm might prefer higher recall by sacrificing some precision. When the application starts at first, it checks the availability of all the devices and also for the connection. The GUI displays the status of the image clicked from the camera and a status box for representing the image. The Raspberry Pi has integrated peripheral devices like USB, ADC, Bluetooth and Serial.
The operating system under which the proposed project is executed is Raspbian which is derived from the Debian operating system. The algorithms are written using the python language which is a script language. The functions in algorithm are called from the OpenCV Library. OpenCV is an open source computer vision library, which is written under C and C++ and runs under Linux, Windows and Mac OS X. OpenCV was designed for computational efficiency and with a strong focus on real-time applications. OpenCV is written in optimized C and can take advantage of multi-core processors.The operating system under which the proposed project is executed is Raspbian which is derived from the Debian operating system.As the recognition process is completed, the character codes in the text file are processed using Raspberry Pi device on which recognize a character using Tesseract algorithm and python programming, the audio output listens.
It can save time of users by reading more data at a time than earlier system.
Earlier system concise to specific writing style but proposed system will present 2 to 3 writing styles and fonts.
Blind people can hear more documents in less time efficiently.
In this project an innovative, efficient and real-time cost beneficial technique that enables user to hear the contents of text images instead of reading through them has been introduced.
When capture button is clicked, this system captures the document image placed in front of the camera which is connected to ARM microcontroller through USB .After selecting the process button the captured document image undergoes Optical Character Recognition(OCR) Technology. OCR technology allows the conversion of scanned images of printed text or symbols into text or information that can be understood or edited using a computer program. In our system for OCR technology we are using TESSERACT library. Using Text-to-speech library the data will be converted to audio. Camera acts as main vision in detecting the image of the placed document, then image is processed internally and separates label from image by using open CV library and finally identifies the text which is pronounced through voice. Now the converted text into audio output is listened either by connecting headsets via 3.5mm audio jack or by connecting speakers via Bluetooth.
Raspberry Pi is a low cost, credit card sized computer that plugs into computer monitor or TV and uses standard keyboard and mouse.There are two models of it, Raspberry Pi 2 and Raspberry Pi 3.
The hardware components of the Raspberry Pi include power supply, storage, input, monitor and network. Power Supply Unit is the device that supplies electrical energy to the output loads. It gives a well regulated power supply of +5v with a output current compatibility of 100 mA. Camera feeds its images in real time to a computer or computer network, often via USB, Ethernet or Wi-Fi. HDMI to VGA Converter is used to connect the Raspberry Pi board to the Projectors, Monitors and TV.
SOFTWARE IMPLEMENTATION Operating system: Raspbian (Debian) Language: Python2.7 Platform: Tesseract, OpenCV (Linux-library) Library: OCR engine, TTS engine The operating system under which the proposed project is executed is Raspbian which is derived from the Debian operating system. The algorithms are written using the python language which is a script language. The functions in algorithm are called from the OpenCV Library. OpenCV is an open source computer vision library, which is written under C and C++ and runs under Linux, Windows and Mac OS X. OpenCV was designed for computational efficiency and with a strong focus on real-time applications. OpenCV is written in optimized C and can take advantage of multi-core processors.
The ASCII values of the recognized characters are processed by Raspberry Pi board. Here each of the characters is matched with its corresponding template and saved as normalized text transcription. This transcription is further delivered to the audio output.
| Item Name | Type | No. of Units | Per Unit Cost (in Rs) | Total (in Rs) |
|---|---|---|---|---|
| Raspbery pi 4 | Equipment | 1 | 10500 | 10500 |
| pi v2 camera | Equipment | 1 | 4600 | 4600 |
| hadware model | Equipment | 1 | 2500 | 2500 |
| SD card | Equipment | 2 | 1400 | 2800 |
| VGA cables and connectors | Equipment | 3 | 1000 | 3000 |
| Power adapter | Equipment | 1 | 550 | 550 |
| camera extension wire | Equipment | 1 | 600 | 600 |
| push button | Equipment | 1 | 50 | 50 |
| Pi case | Equipment | 1 | 450 | 450 |
| speaker | Equipment | 1 | 500 | 500 |
| paper printing(thesis) | Miscellaneous | 1 | 1000 | 1000 |
| Transport | Miscellaneous | 1 | 1500 | 1500 |
| Total in (Rs) | 28050 |
In today's world, almost all purposes. They find their application in defense, surve...
Temperature and humidity are two of the most closely studied environmental factors since t...
Tech Mart is an online e-commerce website where you can buy and explore products from diff...
The concept of VR-Mart is to enable users either the buyer or the seller to buy and sell f...