09-05-2012, 12:53 PM
Data Mining
Data_Mining_Oct06.pdf (Size: 565.88 KB / Downloads: 161)
Moore’s Law:
The information density on silicon-integrated circuits doubles every 18 to 24 months.
Parkinson’s Law:
Work expands to fill the time available for its completion.
Solving the Data Puzzle -a Step-by-Step Approach
Data collection
•Transactional systems
•Customer information systems
Data organization -data warehousing
Data analysis -data mining
Reporting
Action
Data Collection and Data Organization
What data has been collected and where is it?
How do I combine legacy systems with current data systems?
•Customer Story
What is the meaning of some of these data values?
Modeling Issues and Data Difficulties
Data Preparation
Rare or Unknown Targets
•Over Sampling
Undercoverage
Dirty Data
•Errors
•Missing Values
Dimension Reduction (Variable Selection)
Under and Over Fitting
Temporal Infidelity
Model Evaluation
Data_Mining_Oct06.pdf (Size: 565.88 KB / Downloads: 161)
Moore’s Law:
The information density on silicon-integrated circuits doubles every 18 to 24 months.
Parkinson’s Law:
Work expands to fill the time available for its completion.
Solving the Data Puzzle -a Step-by-Step Approach
Data collection
•Transactional systems
•Customer information systems
Data organization -data warehousing
Data analysis -data mining
Reporting
Action
Data Collection and Data Organization
What data has been collected and where is it?
How do I combine legacy systems with current data systems?
•Customer Story
What is the meaning of some of these data values?
Modeling Issues and Data Difficulties
Data Preparation
Rare or Unknown Targets
•Over Sampling
Undercoverage
Dirty Data
•Errors
•Missing Values
Dimension Reduction (Variable Selection)
Under and Over Fitting
Temporal Infidelity
Model Evaluation