Generalization-Based Mining of Plan Databases by Divide-and-Conquer

**study tips** · 13-05-2013, 04:27 PM

Generalization-Based Mining of Plan Databases by Divide-and-Conquer

.docx

Generalization-Based Mining.docx (Size: 487.46 KB / Downloads: 26)

INTRODUCTION

To show how generalization can play an important role in mining complex databases,
we examine a case of mining significant patterns of successful actions in a plan database
using a divide-and-conquer strategy.
A plan consists of a variable sequence of actions. A plan database, or simply a
planbase, is a large collection of plans. Plan mining is the task of mining significant
patterns or knowledge from a planbase. Plan mining can be used to discover travel
patterns of business passengers in an air flight database or to find significant patterns
from the sequences of actions in the repair of automobiles. Plan mining is different
from sequential pattern mining, where a large number of frequently occurring
sequences are mined at a very detailed level. Instead, plan mining is the extraction
of important or significant generalized (sequential) patterns from a planbase.
Let’s examine the plan mining process using an air travel example.
Example 10.4 An air flight planbase. Suppose that the air travel planbase shown in Table 10.1 stores
customer flight sequences, where each record corresponds to an action in a sequential
database, and a sequence of records sharing the same plan number is considered as one
plan with a sequence of actions. The columns departure and arrival specify the codes of
the airports involved. Table 10.2 stores information about each airport.
There could be many patterns mined from a planbase like Table 10.1. For example,
we may discover that most flights fromcities in the Atlantic United States toMidwestern
cities have a stopover at ORD in Chicago, which could be because ORD is the principal
hub for several major airlines. Notice that the airports that act as airline hubs (such
as LAX in Los Angeles, ORD in Chicago, and JFK in New York) can easily be derived
from Table 10.2 based on airport size. However, there could be hundreds of hubs in a
travel database. Indiscriminate mining may result in a large number of “rules” that lack
substantial support, without providing a clear overall picture.

SEQUENTIAL DATA MINING:

Sequence mining is a topic of data mining concerned with finding statistically relevant patterns between data examples where the values are delivered in a sequence.[1] It is usually presumed that the values are discrete, and thus time series mining is closely related, but usually considered a different activity. Sequence mining is a special case of structured data mining.
There are several key traditional computational problems addressed within this field. These include building efficient databases and indexes for sequence information, extracting the frequently occurring patterns, comparing sequences for similarity, and recovering missing sequence members. In general, sequence mining problems can be classified as string mining which is typically based on string processing algorithms and itemset mining which is typically based on association rule learning.

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	Design and Analysis Of Algorithms : Seminar Report and PPT	seminar projects maker	1	1,315	21-09-2017, 12:04 PM Last Post: jaseela123
	Data Mining: What is Data Mining? Report	project girl	1	2,262	21-09-2017, 11:47 AM Last Post: jaseela123
	A TECHNICAL SEMINOR REPORT ON EYE-MOVEMENT BASED HUMAN-COMPUTER INTERACTION	study tips	1	1,101	14-09-2017, 09:49 AM Last Post: jaseela123
	INCREMENTAL MINING USING FREQUENT PATTERN TREE	project topics	1	10,061,816	13-09-2017, 09:40 AM Last Post: jaseela123
	Case Based Reasoning System	presentation Abstract	1	653	06-09-2017, 03:15 PM Last Post: jaseela123
	Computer-Based Information System	seminar tips	1	1,021	06-09-2017, 01:00 PM Last Post: jaseela123
	Report on Data Mining Technique	study tips	1	986	31-08-2017, 12:45 PM Last Post: jaseela123
	OBJECT ORIENTED ANALYSIS AND DESIGN TWO MARK AND SIXTEEN MARK Q and A	seminar ideas	1	1,982	29-08-2017, 11:23 AM Last Post: jaseela123
	Attendance System Applied in Classroom Based on Face Image	dhanabhagya	0	640	25-08-2017, 09:32 PM Last Post: dhanabhagya
	Seminar Report On ASSOCIATION MINING	Computer Science Clay	0	13,214,371	25-08-2017, 09:32 PM Last Post: Computer Science Clay

Quick Reply
Message Type your reply to this message here. Disable Smilies	You have selected one or more posts to quote. Quote these posts now or deselect them.