Visual Assistant
Visual Assistant is a mobile application for visually impaired people that will identify objects and their positions in real time by using mobile phone's camera. Suppose, a person has a laptop and a chair infront of him, it will identify the chair and its location and inform
2025-06-28 16:36:39 - Adil Khan
Visual Assistant
Project Area of Specialization Artificial IntelligenceProject SummaryVisual Assistant is a mobile application for visually impaired people that will identify objects and their positions in real time by using mobile phone's camera. Suppose, a person has a laptop and a chair infront of him, it will identify the chair and its location and inform him by saying "There's laptop 8 feet away and chair 12 feet away, in front of you". Moreover, on the basis of above information, it will inform the user about his surroundings in the form of speech. Suppose, a person is sitting in office environment, it will identify the environment and inform him by saying "You are in the office environment" and inform him about the objects present in his surroundings. Furthermore, it can also be used as a service in other systems involving environment analysis where real time object information and scene analysis.
Project ObjectivesThe basic purpose of this project is to assist visually impaired people and decrease their dependability over people to guide them about their environment. It will also approximate the distance of object with respect to mobile's camera and inform the user about it, so he may avoid many inevitable accidents.
Project Implementation MethodThe project is being implemented on Android Studio and Django Rest Framework. The android powered mobile phone will capture real time video and a frame will be send to the Django API after each particular event such as when user starts or stops moving, reaching a user defined threshold seconds, objects less than the threshold identified in the previous frame etc. The processing will be done on server (cloud, when it will be deployed) and the response will be returned back to the android, which will then use TextToSpeech API to inform the user about objects and their positions in the form of speech.
Benefits of the ProjectThe project is of great benefit for visually impaired people which will help in reducing the sense of deprivation in them. They will be able to somehow familiar with their environments and hurdles within their way. Furthermore, it can also act as a service which can be used in other relevant systems.
Technical Details of Final DeliverableOur final application will use mobile phone's camera and tell user about the objects present in his surroundings along with their distances. It will also tell him about the environment he's currently in.
Final Deliverable of the Project Software SystemCore Industry ITOther Industries Education Core Technology OthersOther Technologies Artificial Intelligence(AI)Sustainable Development Goals Good Health and Well-Being for PeopleRequired Resources| Item Name | Type | No. of Units | Per Unit Cost (in Rs) | Total (in Rs) |
|---|---|---|---|---|
| Total in (Rs) | 77000 | |||
| Graphic Processing Unit (GPU Nvidia gtx1070) | Equipment | 1 | 52000 | 52000 |
| Server Hosting | Equipment | 1 | 15000 | 15000 |
| Document printing, poster printing, flex printing | Miscellaneous | 10 | 1000 | 10000 |