Adil Khan 10 months ago
AdiKhanOfficial #FYP Ideas

SummItUp Content Summary Generator

In today?s day and age, information is in excess and time is limited. SUMM-IT-UP is a web application that generates accurate, coherent and concise extractive summaries of single/multiple documents, without restrictions of domain specificity, on the basis of concept extraction using an ensemble of N

Project Title

SummItUp Content Summary Generator

Project Area of Specialization

Artificial Intelligence

Project Summary

In today’s day and age, information is in excess and time is limited. SUMM-IT-UP is a web application that generates accurate, coherent and concise extractive summaries of single/multiple documents, without restrictions of domain specificity, on the basis of concept extraction using an ensemble of Natural Language Processing techniques. We’ve managed to achieve accuracy much higher than state-of-the-art approaches in multi document summarization and very comparable results in single document summarization.

Project Objectives

This project was developed keeping in mind the problem of excess information and limited time to process it. The aim of this project was to create an application that provides users with relatively accurate extractive summaries in a short span of time, almost instantaneously in real time. The model developed should be light weight such as to be deployed on web. 

Project Implementation Method

The project consited of two modules, the summarization model and web app. Both modules were created using Python as the primary programing language. Bootstrap, flask and vanila js were used to develop the web app and integrate it with the summarization model.

The model was developed focusing on the content and context of the data. It consists of the following sub modules: data cleaning and preprocessing, coreference resolution, sentence encodation, clustering, ranking and assembling of selected sentences. An ensemble of NLP tools and techniques were used to create it. Libraries such as tensorflow, scikit-learn, genism, NLTK etc were used in the process. 

Benefits of the Project

This project was prove to be very benificial in all walks of life. Regardless of what profession the user belongs to, this application has the ability to summarize any content to produce a concise version of the actual document. As a result saving time, effort and resources. 

Technical Details of Final Deliverable

The project can be divided into two modules: summarization model and web app. Python was the primary programing language for development of both modules. 

The model was developed focusing on the content and context of the data. It consists of the following sub modules:

  • Data Cleaning and Preprocessing
  • Coreference Resolution
  • Sentence Embeddings Generation: For this a skip thoughts encoder and decoders were trained on wikipedia 2016 articles dump.  
  • Clustering 
  • Ranking based on contextual similarity and Assembling of selected sentences

An ensemble of NLP tools and techniques were used to create it. Libraries such as tensorflow, scikit-learn, genism, NLTK etc were used in the process. 

Final Deliverable of the Project

Software System

Type of Industry

IT

Technologies

Artificial Intelligence(AI)

Sustainable Development Goals

Decent Work and Economic Growth

Required Resources

Item Name Type No. of Units Per Unit Cost (in Rs) Total (in Rs)
Printing Miscellaneous 300103000
Poster Miscellaneous 120002000
GPU For Training Equipment16000060000
Total in (Rs) 65000
If you need this project, please contact me on contact@adikhanofficial.com
Multi-Shape Classification Algorithm Development for Pick and Place Ro...

 These days, industries are growing at a very fast pace, and we are required to have...

1675638330.png
Adil Khan
10 months ago
FOUR VIEW HOLOGRAPHIC DISPLAY USING SEMI-REFLECTIVE MIRRORS

Augmented reality is becoming the biggest market in the world, but Pakistan is a country w...

1675638330.png
Adil Khan
10 months ago
Flood detection and monitoring using IOT

Natural disasters around the world cause widespread destruction of property, physical inju...

1675638330.png
Adil Khan
10 months ago
MISSING PEOPLE FACIAL RECOGNITION SYSTEM

The proposed system will be a web-based application. Where people can find their missing l...

1675638330.png
Adil Khan
10 months ago
VIRA Virtual Reality Telepresence Robot

A problem exists in the remote locations of our country that lack qualified medical staff....

1675638330.png
Adil Khan
10 months ago