In today?s day and age, information is in excess and time is limited. SUMM-IT-UP is a web application that generates accurate, coherent and concise extractive summaries of single/multiple documents, without restrictions of domain specificity, on the basis of concept extraction using an ensemble of N
SummItUp Content Summary Generator
In today’s day and age, information is in excess and time is limited. SUMM-IT-UP is a web application that generates accurate, coherent and concise extractive summaries of single/multiple documents, without restrictions of domain specificity, on the basis of concept extraction using an ensemble of Natural Language Processing techniques. We’ve managed to achieve accuracy much higher than state-of-the-art approaches in multi document summarization and very comparable results in single document summarization.
This project was developed keeping in mind the problem of excess information and limited time to process it. The aim of this project was to create an application that provides users with relatively accurate extractive summaries in a short span of time, almost instantaneously in real time. The model developed should be light weight such as to be deployed on web.
The project consited of two modules, the summarization model and web app. Both modules were created using Python as the primary programing language. Bootstrap, flask and vanila js were used to develop the web app and integrate it with the summarization model.
The model was developed focusing on the content and context of the data. It consists of the following sub modules: data cleaning and preprocessing, coreference resolution, sentence encodation, clustering, ranking and assembling of selected sentences. An ensemble of NLP tools and techniques were used to create it. Libraries such as tensorflow, scikit-learn, genism, NLTK etc were used in the process.
This project was prove to be very benificial in all walks of life. Regardless of what profession the user belongs to, this application has the ability to summarize any content to produce a concise version of the actual document. As a result saving time, effort and resources.
The project can be divided into two modules: summarization model and web app. Python was the primary programing language for development of both modules.
The model was developed focusing on the content and context of the data. It consists of the following sub modules:
An ensemble of NLP tools and techniques were used to create it. Libraries such as tensorflow, scikit-learn, genism, NLTK etc were used in the process.
| Item Name | Type | No. of Units | Per Unit Cost (in Rs) | Total (in Rs) |
|---|---|---|---|---|
| Printing | Miscellaneous | 300 | 10 | 3000 |
| Poster | Miscellaneous | 1 | 2000 | 2000 |
| GPU For Training | Equipment | 1 | 60000 | 60000 |
| Total in (Rs) | 65000 |
These days, industries are growing at a very fast pace, and we are required to have...
Augmented reality is becoming the biggest market in the world, but Pakistan is a country w...
Natural disasters around the world cause widespread destruction of property, physical inju...
The proposed system will be a web-based application. Where people can find their missing l...
A problem exists in the remote locations of our country that lack qualified medical staff....