21-01-2013, 10:15 AM
Operating system Windows 7
1Operating system.docx (Size: 37.36 KB / Downloads: 28)
Introduction
In our project we will –In shaallah- make a tool used to generate summaries of electronic documents. This has some applications like summarizing the search-engine results, providing briefs of big documents that do not have an abstract. There are two categories of summarizers, linguistic and statistical. Linguistic summarizers use knowledge about the language (syntax/semantics/usage …) to summarize a document. For our project we concern in Statistical ones that operate by finding the important sentences using statistical methods (like frequency of a particular word). Statistical summarizers normally do not use any linguistic information. In this project, an auto-summarization tool is developed using statistical techniques. The techniques involve finding the frequency of words, scoring the sentences, ranking the sentences. The summary is obtained by selecting a particular number of sentences (specified by the user) from the top of the list. It operates on a single document (but can be made to work on multiple documents by choosing proper algorithms for integration) and provides a summary of the document. The size of the summary can be specified by the user when invoking the tool. Pre-processing interfaces are there to handle the following document types: Plain Text, HTML, Word Document.
constraint :
- It can only process plain text only. whether it “.pdf” or “Ms-word” type.
- Some amount of information is lost while generation of the summary . the amount of information lost depend on the specified number of sentences by the user .
- It can only summarize one document at a time.
project methodology
There are a lot of a design processes which often used in software development processes and one of them is the waterfall that is considered as the basic model in this project.
The Waterfall Model
The waterfall model is a sequential software development model in which development is seen as flowing steadily downwards (like a waterfall) through several phases which are shown in figure