DATA QUALITY MINING USING GENETIC ALGORITHM PPT

**project girl** · 03-12-2012, 05:43 PM

DATA QUALITY MINING USING GENETIC ALGORITHM

.pptx

DATA QUALITY MINING.pptx (Size: 73.9 KB / Downloads: 29)

ABSTRACT

Data Quality Mining (DQM) is a new data mining approach from the business point of view.
People use information attribute as a tool for accessing data quality.
The goal of DQM is to employ data mining methods in order to detect, quantify, explain and correct data qualify deficiencies in very large databases.

GENETIC ALGORITHM

GA process is an iteration manner by generating new populations of strings from old ones.
Standard GA apply genetic operators such selection, crossover and mutation on an initially random population in order to compute a whole generation of new strings.
Selection deals with the probabilistic survival of the fittest, in that more fit chromosomes are chosen to survive. Where fitness is a comparable measure of how well a chromosome solves the problem at hand.
Crossover takes individual chromosomes from population combines them to form new ones.
Mutation alters the new solutions so as to add stochasticity in the search for better solutions.

METHODOLOGY

Steps in methodology
1. Load a sample of records from the database that fits in the memory.
2. Generate N chromosomes randomly.
3. Decode them to get the values of the different attributes.
4. Scan the loaded sample to find the support of antecedent part, consequent part and the rule.
5. Find the confidence, comprehensibility, completeness and interestingness values.
6. Rank the chromosomes depending on the non-dominance property.
7. Assign fitness to the chromosomes using the ranks, as mentioned earlier.

CONCLUSION

In this present work, we have used a Pareto based genetic algorithm to solve the multi-objective rule mining problem using four measures––completeness, comprehensibility, interestingness and the predictive accuracy
This approach may not work properly in the given dataset and it is not homogeneous as this is applied on a sample of dataset.

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	data mining full report	project report tiger	37	374,184,749	16-03-2019, 05:22 PM Last Post: TitkinWY
	A Novel Data Embedding Method Using Adaptive Pixel Pair Matching Report	project girl	3	4,489	15-01-2018, 01:56 PM Last Post: dhanabhagya
	Detecting False Data in Wireless Sensor Network using Efficient Becan Scheme	seminar tips	1	3,235	20-09-2017, 01:03 PM Last Post: jaseela123
	Different Initialization Data and the Performance by the BFM	seminar flower	1	680	20-09-2017, 12:44 PM Last Post: jaseela123
	Color Image Indexing Using BTC	seminar tips	1	1,436	19-09-2017, 02:52 PM Last Post: jaseela123
	Mobile Messenger Using Ad-hoc Networks	seminar code	1	682	19-09-2017, 02:50 PM Last Post: jaseela123
	Wide Area Mobile Data Services	seminar ideas	1	2,373	19-09-2017, 02:35 PM Last Post: jaseela123
	ppt on ONLINE AUCTION	project girl	1	1,881	19-09-2017, 09:49 AM Last Post: jaseela123
	System Analysis (Modeling of the Existing and Proposed System using OOD)	seminar flower	1	2,459	15-09-2017, 03:39 PM Last Post: jaseela123
	Integrating and Designing the Data Mining Technique System Based on Customer	seminar projects maker	1	782	15-09-2017, 02:45 PM Last Post: jaseela123

Quick Reply
Message Type your reply to this message here. Disable Smilies	You have selected one or more posts to quote. Quote these posts now or deselect them.