Abstract on Incremental Information Extraction Using Relational Databases

**project girl** · 13-11-2012, 01:53 PM

Incremental Information Extraction Using Relational Databases

Abstract

Information extraction systems are traditionally
implemented as a pipeline of special-purpose processing
modules targeting
the extraction of a particular kind of information. A
major drawback of such an approach is that whenever a
new extraction goal emerges or a module is improved,
extraction has to be reapplied from scratch to the entire
text corpus even though only a small part of the corpus
might be affected. In this paper, we describe a novel
approach for information extraction in which extraction
needs are expressed in the form of database queries,
which are evaluated and optimized by database systems.
Using database queries for information extraction
enables generic extraction and minimizes reprocessing
of data by performing incremental extraction to identify
which part of the data is affected by the change of
components or goals. Furthermore, our approach
provides automated query generation components so
that casual users do not have to learn the query language
in order to perform extraction. To demonstrate the
feasibility of our incremental extraction approach, we
performed experiments to highlight two important
aspects of an information extraction system: efficiency
and quality of extraction results. Our experiments show
that in the event of deployment of a new module, our
incremental extraction approach reduces the processing
time by 89.64 percent as compared to a traditional
pipeline approach. By applying our methods to a corpus
of 17 million biomedical abstracts, our experiments
show that the query performance is efficient for realtime
applications. Our experiments also revealed that
our approach achieves high quality extraction results.

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	A Novel Data Embedding Method Using Adaptive Pixel Pair Matching Report	project girl	3	4,489	15-01-2018, 01:56 PM Last Post: dhanabhagya
	Detecting False Data in Wireless Sensor Network using Efficient Becan Scheme	seminar tips	1	3,235	20-09-2017, 01:03 PM Last Post: jaseela123
	MANAGEMENT INFORMATION SYSTEMS	seminar ideas	1	2,282	19-09-2017, 04:45 PM Last Post: jaseela123
	Color Image Indexing Using BTC	seminar tips	1	1,436	19-09-2017, 02:52 PM Last Post: jaseela123
	Mobile Messenger Using Ad-hoc Networks	seminar code	1	682	19-09-2017, 02:50 PM Last Post: jaseela123
	District Collector Office Information Integration	project report helper	7	18,713,174	18-09-2017, 04:41 PM Last Post: jaseela123
	System Analysis (Modeling of the Existing and Proposed System using OOD)	seminar flower	1	2,459	15-09-2017, 03:39 PM Last Post: jaseela123
	DESIGN AND PERFORMANCE ANALYSIS OF OPTICAL CDMA SYSTEM USING NEWLY DESIGNED MULTIWAVE	project girl	1	1,270	15-09-2017, 01:34 PM Last Post: jaseela123
	INTERNATIONAL JOURNAL OF ADVANCES IN COMPUTING AND INFORMATION TECHNOLOGY PROJECT	project maker	1	708	14-09-2017, 01:15 PM Last Post: jaseela123
	Secure Online Examination Management using XML full report	seminar class	1	294,875	14-09-2017, 12:51 PM Last Post: jaseela123

Quick Reply
Message Type your reply to this message here. Disable Smilies	You have selected one or more posts to quote. Quote these posts now or deselect them.