Visual Assistant

Visual Assistant is a mobile application for visually impaired people that will identify objects and their positions in real time by using mobile phone's camera. Suppose, a person has a laptop and a chair infront of him, it will identify the chair and its location and inform

2025-06-28 16:36:39 - Adil Khan

Project Title

Visual Assistant

Project Area of Specialization Artificial IntelligenceProject Summary

Visual Assistant is a mobile application for visually impaired people that will identify objects and their positions in real time by using mobile phone's camera. Suppose, a person has a laptop and a chair infront of him, it will identify the chair and its location and inform him by saying "There's laptop 8 feet away and chair 12 feet away, in front of you". Moreover, on the basis of above information, it will inform the user about his surroundings in the form of speech. Suppose, a person is sitting in office environment, it will identify the environment and inform him by saying "You are in the office environment" and inform him about the objects present in his surroundings. Furthermore, it can also be used as a service in other systems involving environment analysis where real time object information and scene analysis.

Project Objectives

The basic purpose of this project is to assist visually impaired people and decrease their dependability over people to guide them about their environment. It will also approximate the distance of object with respect to mobile's camera and inform the user about it, so he may avoid many inevitable accidents.

Project Implementation Method

The project is being implemented on Android Studio and Django Rest Framework. The android powered mobile phone will capture real time video and a frame will be send to the Django API after each particular event such as when user starts or stops moving, reaching a user defined threshold seconds, objects less than the threshold identified in the previous frame etc. The processing will be done on server (cloud, when it will be deployed) and the response will be returned back to the android, which will then use TextToSpeech API to inform the user about objects and their positions in the form of speech.

Benefits of the Project

The project is of great benefit for visually impaired people which will help in reducing the sense of deprivation in them. They will be able to somehow familiar with their environments and hurdles within their way. Furthermore, it can also act as a service which can be used in other relevant systems.

Technical Details of Final Deliverable

Our final application will use mobile phone's camera and tell user about the objects present in his surroundings along with their distances. It will also tell him about the environment he's currently in.

Final Deliverable of the Project Software SystemCore Industry ITOther Industries Education Core Technology OthersOther Technologies Artificial Intelligence(AI)Sustainable Development Goals Good Health and Well-Being for PeopleRequired Resources
Item Name Type No. of Units Per Unit Cost (in Rs) Total (in Rs)
Total in (Rs) 77000
Graphic Processing Unit (GPU Nvidia gtx1070) Equipment15200052000
Server Hosting Equipment11500015000
Document printing, poster printing, flex printing Miscellaneous 10100010000

More Posts