Metrics and Information Retrieval ppt

**seminar flower** · 25-08-2017, 09:32 PM

Metrics and Information Retrieval

.pptx

Metrics and IR.pptx (Size: 1.33 MB / Downloads: 64)

Introduction: -

Primary Concern : -

Effectiveness in Information Retrieval (IR) field.
Information retrieval (IR) is the task of representing, storing, organizing, and offering access to information items.
To evaluate an IR system is to measure how well the system meets the information needs of the users.
This is troublesome, given that a same result set might be interpreted differently by distinct users
To deal with this problem, some metrics have been defined that, on average, have a correlation with the preferences of a group of users
Without proper retrieval evaluation, one cannot
determine how well the IR system is performing
compare the performance of the IR system with that of other systems, objectively
Retrieval evaluation is a critical and integral component of any modern IR system
Retrieval performance evaluation consists of associating a quantitative metric to the results produced by an IR system
This metric should be directly associated with the relevance of the results to the user
Usually, its computation requires comparing the results produced by the system with results suggested by humans for a same set of queries
As per now total no. of various metrics available are 44.
Classification is based on two factors: Relevance & Retrieval.

Precision @ N ( P@N ) and R-Precision: -

Precision and recall are not enough for evaluating IR systems.
For example, if we have two systems that retrieve 10 documents each, 5 relevant and 5 not relevant, both have precision 0.5, but a system that has the first 5 retrieved documents relevant and the next 5 irrelevant is much better than a system that has the first 5 retrieved documents irrelevant and the next 5 relevant (because the user will be annoyed to have to check the irrelevant documents first).
Thus, modified measures that combine precision and recall and consider the order of the retrieved documents are needed.
Some good measures are: precision at 5 retrieved documents, precision at 10 retrieved documents or some other cut-off point (N); the R-Precision.
The R-precision is the precision at the R-th position in the ranking of the results for a query that has R known relevant documents. Precision is equal to Recall at the R-th position.

Other Measures

Relative recall: the ratio between the number of relevant documents found and the number of relevant documents the user expected to find
Recall effort: the ratio between the number of relevant documents the user expected to find and the number of documents examined

Q – measure and R – measure: -

The following are some properties of Q-measure and R-measure:
Q-measure is equal to one iff a system output is an ideal one.
R-measure is equal to one iff all of the top R documents are (at least partially) relevant; That is, it cannot tell the difference between an ideal ranked output and (say) an output that has all B-relevant documents at the very top, followed by all A-relevant ones, followed by all S-relevant ones.
In a binary relevance environment, Q-measure = AveP holds iff there is no relevant document below Rank R, and Q-measure > AveP holds otherwise.
In a binary relevance environment, R-measure = R-Prec.
With small gain values, Q-measure behaves like AveP.
With small gain values, R-measure behaves like R-prec.
IR metrics based on graded-relevance are required to:
Prefer systems that return highly relevant documents to those that return partially relevant documents;
Prefer systems that have relevant documents near the top of the ranked list to those that have relevant documents near the bottom.
Q-measure an averageable graded relevance metric.
Q-measure which is very highly correlated with AveP and is at least as stable and discriminative as AveP.
Q-measure uses recall as the basis, whereas nDCG is rank based .
Q-measure is more flexible than normalized Discounted Cumulative Gain and generalized Average Precision.

**jaseela123** · 15-09-2017, 03:47 PM

Information retrieval (IR) is the activity of obtaining information resources relevant to a need for information from a collection of information resources. Searches can be based on full text or other content-based indexing. Information retrieval is the science of finding information in a document, searching for documents themselves, and also finding metadata describing data, and for databases of texts, images or sounds.

Automated information retrieval systems are used to reduce what has been termed information overload. Many universities and public libraries use IR systems to provide access to books, magazines, and other documents. Web search engines are the most visible IR applications. An information retrieval process begins when a user enters a query into the system. Queries are formal declarations of information needs, for example search engines in web search engines. In information retrieval, a query does not uniquely identify a single object in the collection. Instead, several objects may match the query, perhaps with varying degrees of relevance.

An object is an entity that is represented by information in a collection of content or a database. User queries are compared to the database information. However, unlike the classic SQL queries of a database, in retrieving information the returned results may or may not match the query, so the results are usually sorted. This ranking of results is a key difference in the search for information retrieval compared to database search.

Depending on the application, the data objects may be, for example, text documents, images, audio, mind maps or videos. Often, the documents themselves are not saved or stored directly in the IR system, but are represented on the system using replacement documents or metadata.

Most IR systems calculate a numerical score on how each object in the database matches the query and sorts the objects according to this value. The top ranking objects are then shown to the user. The process can then be iterated if the user wants to refine the query.

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	SOLID SHAPES PPT	seminar projects maker	0	543	22-03-2014, 03:25 PM Last Post: seminar projects maker
	STATE SPACE MODELS PPT	seminar projects maker	0	569	19-03-2014, 11:33 AM Last Post: seminar projects maker
	Transaction Processing Concepts ppt	seminar projects maker	0	711	24-09-2013, 03:55 PM Last Post: seminar projects maker
	Control Dependences PPT	study tips	0	704	28-08-2013, 04:18 PM Last Post: study tips
	TWO PHASE simplex METHOD PPT	study tips	0	1,775	18-07-2013, 04:53 PM Last Post: study tips
	Mathematical Models of Plant Growth for Applications in Agriculture ppt	study tips	0	2,266	05-07-2013, 04:50 PM Last Post: study tips
	Angular Measurements ppt	study tips	0	1,053	03-07-2013, 01:06 PM Last Post: study tips
	Generating Random Variates PPT	study tips	0	620	28-06-2013, 03:35 PM Last Post: study tips
	Bayes Theorem PPT	study tips	0	1,660	08-05-2013, 12:47 PM Last Post: study tips
	Problems based Exponential and Trigonometric Fourier Series and its applications	study tips	0	1,685	12-04-2013, 04:41 PM Last Post: study tips

Quick Reply
Message Type your reply to this message here. Disable Smilies	You have selected one or more posts to quote. Quote these posts now or deselect them.