Decision Trees ppt

**seminar ideas** · 11-05-2012, 01:09 PM

Decision Trees

.ppt

lecture11-DecisionTrees.ppt (Size: 455 KB / Downloads: 50)

Classification Decision Trees

Nonmetric. Data is list of binary attributes.
Apple is red and round. Banana is long and yellow.
Difficult to apply nearest neighbor and other techniques.

Decision Tree: based on game of twenty questions.
Apply a series of tests to the input pattern
Each test asks a question: e.g. “is the pattern yellow?”
The answer is “yes” or “no”.
The answers give the classification -- e.g. the pattern is yellow, not-round, and long – so it is a banana.

CART: Design Decision Tree

CART is a general framework for designing decision trees.
Basic Issues::
(1). Should we restrict ourselves to binary questions?
(2) Which attributes should be tested at each node?
(3) When should a node be declared a leaf?
(4) How can we prune a large tree?
(5) How do we assign a class label to a leaf node?

When to Stop.

Generalization versus Memorization. Key issue in learning.
Don’t want a decision tree that simply memorizes the training
data. The tree should generalize, give good classification, to data
that it has not been trained on.
The decision tree gives a rule for classifying data. It has
empirical risk

Cross Validation

General Principle for testing whether you are generalizing or memorizing.
Combine with validation or cross-validation.
Validation – learn the decision tree on part of the dataset and evaluate performance on the other part.
Cross-validation – split the dataset up into subsets. Learn on each subset and evaluate on the other subsets.
E.G. learn decision trees with different impurity threshold beta.
Select the tree, and hence the beta, which has best validation.

**seminar ideas** · 29-05-2012, 01:23 PM

Decision Trees

.ppt

Decision Trees.ppt (Size: 555 KB / Downloads: 49)

Introduction

Decision Trees
Powerful/popular for classification & prediction
Represent rules
Rules can be expressed in English
IF Age <=43 & Sex = Male & Credit Card Insurance = NoTHEN Life Insurance Promotion = No
Rules can be expressed using SQL for query
Useful to explore data to gain insight into relationships of a large number of candidate input variables to a target (output) variable
You use mental decision trees often!
Game: “I’m thinking of…” “Is it …?”

Decision Tree – What is it?

A structure that can be used to divide up a large collection of records into successively smaller sets of records by applying a sequence of simple decision rules
A decision tree model consists of a set of rules for dividing a large heterogeneous population into smaller, more homogeneous groups with respect to a particular target variable

Decision Tree Types

Binary trees – only two choices in each split. Can be non-uniform (uneven) in depth
N-way trees or ternary trees – three or more choices in at least one of its splits (3-way, 4-way, etc.)

Scoring

Often it is useful to show the proportion of the data in each of the desired classes

Decision Tree Splits

The best split at root or child nodes is defined as one that does the best job of separating the data into groups where a single class predominates in each group
Example: US Population data input categorical variables/attributes include:
Zip code
Gender
Age
Split the above according to the above “best split” rule

Decision Tree Advantages

Easy to understand
Map nicely to a set of business rules
Applied to real problems
Make no prior assumptions about the data
Able to process both numerical and categorical data

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	Human Computer Interface : Seminar Report and PPT	seminar post	1	1,337	22-09-2017, 11:23 AM Last Post: jaseela123
	4G Broadband : Seminar Report and PPT	study tips	1	1,261	22-09-2017, 11:19 AM Last Post: jaseela123
	Software Life-Cycle Models ppt	seminar flower	1	3,852	22-09-2017, 10:54 AM Last Post: jaseela123
	PPT ON LINUX	project girl	1	1,829	21-09-2017, 03:56 PM Last Post: jaseela123
	Public Key Infrastructure (Digital Certificates and Digital Signatures) PPT	project girl	1	2,364	21-09-2017, 01:18 PM Last Post: jaseela123
	Itanium Processor : Seminar Report and PPT	seminar projects maker	1	1,052	21-09-2017, 12:46 PM Last Post: jaseela123
	Design and Analysis Of Algorithms : Seminar Report and PPT	seminar projects maker	1	1,315	21-09-2017, 12:04 PM Last Post: jaseela123
	Ranked, Efficient and Secure Keyword search over encrypted cloud data PPT	seminar post	1	814	21-09-2017, 11:55 AM Last Post: jaseela123
	Biometric Authentication PPT	project girl	1	1,109	19-09-2017, 02:32 PM Last Post: jaseela123
	B-Trees report	project girl	1	913	19-09-2017, 01:21 PM Last Post: jaseela123

Quick Reply
Message Type your reply to this message here. Disable Smilies	You have selected one or more posts to quote. Quote these posts now or deselect them.