Book Prism

It is an ancient dream to replicate machines to perform human functions, like reading. However, machine learning has grown from a dream to reality, over the last five decades. Now, there are several techniques and algorithms to train a machine in order to perform things like humans. The purpose of o

2025-06-28 16:25:43 - Adil Khan

Project Title

Book Prism

Project Area of Specialization Software EngineeringProject Summary

It is an ancient dream to replicate machines to perform human functions, like reading. However, machine learning has grown from a dream to reality, over the last five decades. Now, there are several techniques and algorithms to train a machine in order to perform things like humans. The purpose of our project is to provide a platform to kids where they can listen to the audio of the storybook with the highlighted text. The whole system is categorized into five modules, the image processing module, voice processing module, removal of garbage text, syncing of text and speech, and video generation. Image processing module converts the image into text, whereas the voice processing module changes the text into sound. However, in removal of garbage text we will remove the garbage text which includes page numbers and title of the book at the top of the page.  Moreover, in the syncing module we will synchronize text and speech and highlight the text. In last module, which is video generation we will generate video of the synchronized text with audio.

Project Objectives

The main goals and objectives of our project are:

To highlight the text in order to make it visible to the user.

Project Implementation Method

Techniques:

Languages:

Tools:

PyCharm with Django Framework: It is an integrated development environment used in computer programming, specifically for the Python language. It provides smart code completion, code inspections, quick-fixes along with automated code refactoring and rich navigation capabilities. PyCharm takes care of creating specific directory structure and files required for a Django application and provides the correct settings.

Benefits of the Project

Our project is very important in today’s world. From this project, we devolved the interest of children in books. When different types of books be read and listened by children, this will develop creative and innovative skills among them. These children are growing youngsters of our country so it is very necessary to build them with right direction and aimful lives.

Technical Details of Final Deliverable

Electronic learning is gaining an educational foothold all over the world. This project aims to develop a web application that enables the user to hear the contents of the book with the highlighted text synchronized with an audio. Moreover, the concepts of Optical Character Recognition (OCR) [3] and Text to Speech (TTS) [4] synthesis will be incorporated in our application.

Our system will allow its users to read story books, listen to the story books, delete story books, download the video, search the story book from the category list, mark the story books as private or public and the user can also rate the story books.

Final Deliverable of the Project Software SystemCore Industry ITOther IndustriesCore Technology Artificial Intelligence(AI)Other TechnologiesSustainable Development Goals Quality EducationRequired Resources
Item Name Type No. of Units Per Unit Cost (in Rs) Total (in Rs)
Total in (Rs) 50000
Resources Equipment50100050000

More Posts