DATA MINING PROCESS pdf

**seminar projects maker** · 20-03-2014, 12:22 PM

DATA MINING PROCESS

.pdf

DATA MINING.pdf (Size: 759.57 KB / Downloads: 18)

INTRODUCTION

Dramatic advances in data capture, processing power, data transmission, and
storage capabilities are enabling organizations to integrate their various databases
into data warehouses. Data warehousing is defined as a process of centralized data
management and retrieval. Data warehousing, like data mining, is a relatively new
term although the concept itself has been around for years. Data warehousing
represents an ideal vision of maintaining a central repository of all organizational
data. Centralization of data is needed to maximize user access and analysis.
Dramatic technological advances are making this vision a reality for many
companies. And, equally dramatic advances in data analysis software are allowing
users to access this data freely. The data analysis software is what supports data
mining.

Necessity Of Data Mining

Data mining is primarily used today by companies with a strong consumer focus -
retail, financial, communication, and marketing organizations. It enables these
companies to determine relationships among "internal" factors such as price,
product positioning, or staff skills, and "external" factors such as economic
indicators, competition, and customer demographics. And, it enables them to
determine the impact on sales, customer satisfaction, and corporate profits. Finally,
it enables them to "drill down" into summary information to view detail
transactional data.

STATISTICA TOOL:

STATISTICA is a statistics and analytics software package developed by StatSoft .
STATISTICA provides data analysis, data management, data mining, and data
visualization procedures. STATISTICA product categories include Enterprise (for use
across a site or organization), Web-Based (for use with a server and web browser),
Concurrent Network Desktop, and Single-User Desktop.
Different packages of analytical techniques are available in six product lines:
(1) Desktop , (2) Data Mining , (3) Enterprise , (4) Web-Based , (5) Connectivity
and Data Integration Solutions, and (6) Power Solutions .

DATA SET

A data set (or dataset) is a collection of data, usually presented in tabular form.
Each column represents a particular variable. Each row corresponds to a given
member of the data set in question. It lists values for each of the variables, such as
height and weight of an object. Each value is known as a datum. The data set may
comprise data for one or more members, corresponding to the number of rows.
Non tabular data sets can take the form of marked up strings of characters, such as
an XML file.

DATA DUPLICATION

In computing , data duplication is a specialized data compression technique for eliminating
coarse-grained redundant data. The technique is used to improve storage utilization and can
also be applied to network data transfers to reduce the number of bytes that must be sent
across a link. In the duplication process, unique chunks of data, or byte patterns, are
identified and stored during a process of analysis. As the analysis continues, other chunks are
compared to the stored copy and whenever a match occurs, the redundant chunk is replaced
with a small reference that points to the stored chunk.

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	Software Crisis pdf	study tips	1	2,117	21-09-2017, 04:31 PM Last Post: jaseela123
	Ranked, Efficient and Secure Keyword search over encrypted cloud data PPT	seminar post	1	814	21-09-2017, 11:55 AM Last Post: jaseela123
	Data Mining: What is Data Mining? Report	project girl	1	2,262	21-09-2017, 11:47 AM Last Post: jaseela123
	HOW EMAIL WORKS pdf	project girl	1	3,067	20-09-2017, 11:39 AM Last Post: jaseela123
	Cyber crime detection, investigation and prosecution pdf	seminar projects maker	1	958	20-09-2017, 11:31 AM Last Post: jaseela123
	Review: Context Aware Tools for Smart Home Development pdf	study tips	1	1,227	20-09-2017, 11:22 AM Last Post: jaseela123
	STATISTICAL PROCESS CONTROL REPORT	project girl	1	985	19-09-2017, 11:33 AM Last Post: jaseela123
	DEMONSTRATING DATAPOSSESSION AND UN CHEATABLE DATA TRANSFER	seminar flower	1	1,466	19-09-2017, 11:05 AM Last Post: jaseela123
	Getting Started with the MAXQ1103 Evaluation Kit and the CrossWorks Compiler pdf	project girl	1	969	15-09-2017, 03:11 PM Last Post: jaseela123
	Wireless Application Protocol (WAP) pdf	project girl	1	1,531	15-09-2017, 02:42 PM Last Post: jaseela123

Quick Reply
Message Type your reply to this message here. Disable Smilies	You have selected one or more posts to quote. Quote these posts now or deselect them.