30-01-2012, 02:34 PM
Data Mining
Data Mining.pdf (Size: 536.5 KB / Downloads: 125)
What Is Data Mining?
Data mining (knowledge discovery from data)
Extraction of interesting patterns or knowledge from huge amount of
data
Alternative names
Knowledge discovery (mining) in databases (KDD), knowledge
extraction, data/pattern analysis, information harvesting, business
intelligence, etc.
Predictive Modeling
Model is developed using a supervised
learning approach, which has two phases:
training and testing.
– Training builds a model using a large sample of
historical data called a training set.
– Testing involves trying out the model on new,
previously unseen data to determine its
accuracy and physical performance
characteristics.
Predictive Modeling -
Classification
Used to establish a specific predetermined
class for each record in a database from a
finite set of possible, class values.
Two specializations of classification: tree
induction and neural induction.
Predictive Modeling - Value Prediction
Used to estimate a continuous numeric value that
is associated with a database record.
Uses the traditional statistical techniques of linear
regression and nonlinear regression.
Relatively easy-to-use and understand.
Data Mining.pdf (Size: 536.5 KB / Downloads: 125)
What Is Data Mining?
Data mining (knowledge discovery from data)
Extraction of interesting patterns or knowledge from huge amount of
data
Alternative names
Knowledge discovery (mining) in databases (KDD), knowledge
extraction, data/pattern analysis, information harvesting, business
intelligence, etc.
Predictive Modeling
Model is developed using a supervised
learning approach, which has two phases:
training and testing.
– Training builds a model using a large sample of
historical data called a training set.
– Testing involves trying out the model on new,
previously unseen data to determine its
accuracy and physical performance
characteristics.
Predictive Modeling -
Classification
Used to establish a specific predetermined
class for each record in a database from a
finite set of possible, class values.
Two specializations of classification: tree
induction and neural induction.
Predictive Modeling - Value Prediction
Used to estimate a continuous numeric value that
is associated with a database record.
Uses the traditional statistical techniques of linear
regression and nonlinear regression.
Relatively easy-to-use and understand.