Decision Trees for Uncertain Data DETAILS

**seminar ideas** · 10-08-2012, 03:32 PM

Decision Trees for Uncertain Data

.pptx

Decision Trees for Uncertain Data.pptx (Size: 1.11 MB / Downloads: 46)

Abstract

Traditional decision tree classifiers work with data whose values are known and precise. We extend such classifiers to handle data with uncertain information.
With uncertainty, the value of a data item is often represented not by one single value, but by multiple values forming a probability distribution.
Since processing pdf’s is computationally more costly than processing single values (e.g., averages), decision tree construction on uncertain data is more CPU demanding than that for certain data.
To tackle this problem, we propose a series of pruning techniques that can greatly improve construction efficiency.

Existing System

In traditional decision-tree classification, a feature (an attribute) of a tuple is either categorical or numerical.
For the latter, a precise and definite point value is usually assumed.
In many applications, however, data uncertainty is common.
Although the previous techniques can improve the efficiency of means, they do not consider the spatial relationship among cluster representatives, to perform pruning in batch.

Proposed System

A simple way to handle data uncertainty is to abstract probability distributions by summary statistics such as means and variances. We call this approach Averaging.
Another approach is to consider the complete information carried by the probability distributions to build a decision tree. We call this approach Distribution-based.
We study the problem of constructing decision tree classifiers on data with uncertain numerical attributes.

Data Insertion :

In many applications, however, data uncertainty is common. The value of a feature/attribute is thus best captured not by a single point value, but by a range of values giving rise to a probability distribution.
With uncertainty, the value of a data item is often represented not by one single value, but by multiple values forming a probability distribution.
This uncertain data is inserted by user.

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	Ranked, Efficient and Secure Keyword search over encrypted cloud data PPT	seminar post	1	814	21-09-2017, 11:55 AM Last Post: jaseela123
	Data Mining: What is Data Mining? Report	project girl	1	2,262	21-09-2017, 11:47 AM Last Post: jaseela123
	B-Trees report	project girl	1	913	19-09-2017, 01:21 PM Last Post: jaseela123
	DEMONSTRATING DATAPOSSESSION AND UN CHEATABLE DATA TRANSFER	seminar flower	1	1,466	19-09-2017, 11:05 AM Last Post: jaseela123
	Processing of collected data PPT	seminar projects maker	1	718	15-09-2017, 12:48 PM Last Post: jaseela123
	Enabling Secure and Efficient Ranked Keyword Search over Outsourced Cloud Data pdf	study tips	1	2,018	13-09-2017, 12:59 PM Last Post: jaseela123
	Data Warehouse Report	study tips	1	879	12-09-2017, 12:23 PM Last Post: jaseela123
	Decision Support Systems	seminar code	1	672	11-09-2017, 12:11 PM Last Post: jaseela123
	A Decision Support System to improve e-Learning Environments pdf	project girl	1	1,265	09-09-2017, 09:33 AM Last Post: jaseela123
	CONFIDENTIAL DATA STORAGE AND DELETION details	seminar ideas	1	1,668	06-09-2017, 01:23 PM Last Post: jaseela123

Quick Reply
Message Type your reply to this message here. Disable Smilies	You have selected one or more posts to quote. Quote these posts now or deselect them.