To design a fuzzy similarity based self-constructing feature clustering algorithm

**study tips** · 15-02-2013, 11:12 AM

To design a fuzzy similarity based self-constructing feature clustering algorithm

.pptx

To design a fuzzy.pptx (Size: 155.79 KB / Downloads: 25)

INTRODUCTION

The dimensionality of the feature vector is usually huge.
Two real-world data sets.
20 Newsgroups and
Reuters21578
severe obstacle for classification
Feature reduction
Two major approaches:
Feature selection
Feature extraction

EXISTING SYSTEM

The parameter K, indicating the desired number of extracted features, has to be specified in advance.This gives a burden to the user,since trial-and-error has to be done until the appropriate number of extracted features is found.
When calculating similarities, the variance of the underlying cluster is not considered.Intuitively, the distribution of the data in a cluster is an important factor in the claculation of similarity.
All words in a cluster have the same degree of contribution to the resulting extracted feature.

A FUZZY SELF-CONSTRUCTING FEATURE CLUSTERING ALGORITHM FOR TEXT CLASSIFICATION

A Fuzzy similarity based self-constructing algorithm for feature clustering.
The words in the feature vector of a document set are grouped into clusters,based on the similarity test.
Each cluster is characterized by a membership function with statistical mean and deviation.
The extracted feature corresponding to a cluster, is a weighted combination of the words contained in the cluster.
The derived membership functions match closely with and describe properly the real distribution of the training data.

FEATURE EXTRACTION

Feature extraction can be expressed as D’=DT
word patterns have been grouped into clusters
Three weighting approaches:
1.In the hard-weighting approach, each word is only allowed to belong to a cluster, and so it only contributes to a new extracted feature.
2.In the soft-weighting approach, each word is allowed to contribute to all new extracted features
3.The mixed-weighting approach is a combination of the hard-weighting approach and the soft-weighting approach.

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	Development of a workflow based Complaint Management System (where the complaints are	mechanical engineering crazy	2	28,844,331	26-11-2018, 12:11 PM Last Post: Guest
	JPEG Decoder Design Report	seminar flower	1	3,349	25-03-2018, 06:49 PM Last Post: magnumZ
	RIA based E- Shopping Portal for Electronic Gadgets Report	study tips	1	1,588	21-09-2017, 01:25 PM Last Post: jaseela123
	Design and Implementation of High-Performance FPGA Signal Processing Datapaths	seminar class	1	300,513	20-09-2017, 01:31 PM Last Post: jaseela123
	A Design & Implementation of Collision Avoidance System (CAS) for Automobiles	seminar flower	1	1,142	19-09-2017, 04:25 PM Last Post: jaseela123
	Integrating and Designing the Data Mining Technique System Based on Customer	seminar projects maker	1	782	15-09-2017, 02:45 PM Last Post: jaseela123
	DESIGN AND PERFORMANCE ANALYSIS OF OPTICAL CDMA SYSTEM USING NEWLY DESIGNED MULTIWAVE	project girl	1	1,270	15-09-2017, 01:34 PM Last Post: jaseela123
	Uisce: Characteristic-based Routing in Mobile Ad Hoc Networks	project uploader	1	1,721	14-09-2017, 03:30 PM Last Post: jaseela123
	DEVELOPMENT OF A GSM BASED VEHICLE MONITORING & SECURITY SYSTEM	seminar flower	1	1,547	14-09-2017, 10:15 AM Last Post: jaseela123
	Using Rapid Prototyping Data to Enhance a Knowledge-Based Framework for Product Redes	smart paper boy	1	115,120	13-09-2017, 09:54 AM Last Post: jaseela123

Quick Reply
Message Type your reply to this message here. Disable Smilies	You have selected one or more posts to quote. Quote these posts now or deselect them.