Discovering Conditional Functional Dependencies Abstract

**project girl** · 12-12-2012, 03:01 PM

Discovering Conditional Functional Dependencies

.docx

Discovering Conditional.docx (Size: 55.87 KB / Downloads: 21)

Abstract:

This paper investigates the discovery of conditional functional dependencies (CFDs). CFDs are a recent extension of functional dependencies (FDs) by supporting patterns of semantically related constants, and can be used as rules for cleaning relational data. However, finding CFDs is an expensive process that involves intensive manual effort. To effectively identify data cleaning rules, we develop techniques for discovering CFDs from sample relations. We provide three methods for CFD discovery. The first, referred to as CFDMiner, is based on techniques for mining closed itemsets, and is used to discover constant CFDs, namely, CFDs with constant patterns only. The other two algorithms are developed for discovering general CFDs. The first algorithm, referred to as CTANE, is a levelwise algorithm that extends TANE, a well-known algorithm for mining FDs. The other, referred to as FastCFD, is based on the depthfirst approach used in FastFD, a method for discovering FDs. It leverages closed-itemset mining to reduce search space. Our experimental results demonstrate the following. (a) CFDMiner can be multiple orders of magnitude faster than CTANE and FastCFD for constant CFD discovery. (b) CTANE works well when a given sample relation is large, but it does not scale well with the arity of the relation. © FastCFD is far more efficient than CTANE when the arity of the relation is large.

Existing System:

As remarked earlier, constant CFDs are particularly important for object identification, and thus deserve a separate treatment. One wants efficient methods to discover constant CFDs alone, without paying the price of discovering all CFDs. Indeed, as will be seen later, constant CFD discovery is often several orders of magnitude faster than general CFD discovery.
Levelwise algorithms may not perform well on sample relations of large arity, given their inherent exponential complexity. More effective methods have to be in place to deal with datasets with a large arity. A host of techniques have been developed for (non-redundant) association rule mining, and it is only natural to capitalize on these for CFD discovery. As we shall see, these techniques can not only be readily used in constant CFD discovery, but also significantly speed up general CFD discovery. To our knowledge, no previous work has considered these issues for CFD discovery.

Proposed System:

In light of these considerations we provide three algorithms for CFD discovery: one for discovering constant CFDs, and the other two for general CFDs.

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	SMART NOTE TAKER ABSTRACT	study tips	1	1,109	18-09-2017, 03:59 PM Last Post: jaseela123
	Abstract on ZigBee report	project girl	1	836	14-09-2017, 03:45 PM Last Post: jaseela123
	PHONE ROOTING SEMINAR ABSTRACT	project girl	1	1,032	07-09-2017, 11:42 AM Last Post: jaseela123
	FIREWALLS ABSTRACT	study tips	1	954	02-09-2017, 10:27 AM Last Post: jaseela123
	BRAIN GATE SYSTEM ABSTRACT	study tips	1	1,010	31-08-2017, 03:57 PM Last Post: jaseela123
	delay tolerant network (Download Full Report And Abstract)	computer science crazy	0	21,138,274	25-08-2017, 09:32 PM Last Post: computer science crazy
	Conditional Access System	Computer Science Clay	0	8,669,255	25-08-2017, 09:32 PM Last Post: Computer Science Clay
	Resilent Packet Ring Networks (Download Full Report And Abstract)	computer science crazy	0	12,487,392	25-08-2017, 09:32 PM Last Post: computer science crazy
	Computer Science Seminar Abstract And Report 1	computer science crazy	0	11,845,795	25-08-2017, 09:32 PM Last Post: computer science crazy
	Computer Science Seminar Abstract And Report 4	computer science crazy	0	21,942,923	25-08-2017, 09:32 PM Last Post: computer science crazy

Quick Reply
Message Type your reply to this message here. Disable Smilies	You have selected one or more posts to quote. Quote these posts now or deselect them.