k-means++

**seminar ideas** · 19-04-2012, 12:42 PM

k-means++

kMeansPlusPlus.ppt (Size: 242.5 KB / Downloads: 33)

The k-means Problem

Given integer k and n data points in Rd
Partition points into k “similar” clusters

More formally:
Choose k centers ci so as to minimize potential function:

Lloyd’s Algorithm

Often just the “k-means method”

Guess k initial centers ci:
How? Uniformly at random from data points

Repeat until stable:
Assign each point to the closest ci
Set ci to be the center of mass of points assigned to it

Approximation Algorithms

Lots of approximation algorithms available:
Har-Peled and Mazumdar ’04:
1 + ε approximation in O(n + kk+2 ε -2dk logk(n/ε)) time
Kanungo, Mount, et al. ’04:
9 + ε approximation in O(n3 / εd) time
Facility location with “relaxed” metric spaces

The Intuition

Easy way to fix this mistake:
Make centers far away from each other

The k-means++ way:
Choose starting centers iteratively
Let D(x) be distance from x to nearest existing center
Take x as new center with prob. proportional to D(x)2
Run standard Lloyd’s method with these centers

Conclusion

Friends don’t let friends use k-means!

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	Fuzzy C-Means Clustering With Local Information and Kernel Metric for Image pdf	seminar post	2	1,114	28-04-2015, 11:03 AM Last Post: mkaasees
	Abstracts: Our project is based on theft vehicle prevention that means if your vehic	project maker	0	184	15-10-2014, 10:07 AM Last Post: project maker
	COMPARATIVE LEARNING OF COLLECTIVE BEHAVIOUR USING K-MEANS AND K-MEDIOD ALGORITHMS	project maker	0	203	11-09-2014, 02:57 PM Last Post: project maker
	Segmentation of CT Brain Images Using K-means and EM Clustering	project maker	0	168	04-08-2014, 11:32 AM Last Post: project maker
	A Palmprint Feature Extraction and Pattern Classification Based on Hybrid PSO-K-Means	seminar tips	0	507	13-11-2012, 04:53 PM Last Post: seminar tips
	Image Reduction Using Means on Discrete Product Lattices Abstract	project girl	0	613	13-11-2012, 02:27 PM Last Post: project girl
	Determining the number of cluster and Color Image Segmentation using K means	seminar tips	0	602	25-10-2012, 05:41 PM Last Post: seminar tips
	EFFICIENT WAY OF CLUSTERING GENE DATA USING K-MEANS EXTENSIONS	seminar flower	0	878	08-08-2012, 01:43 PM Last Post: seminar flower
	Realization of clustering uning k-means algorithm.	dwaii31	0	34,307	22-04-2011, 03:03 PM Last Post: dwaii31

Quick Reply
Message Type your reply to this message here. Disable Smilies	You have selected one or more posts to quote. Quote these posts now or deselect them.