How does the shape of clusters create a challenge when implementing a clustering algorithm? How would...

Question

Question

How does the shape of clusters create a challenge when implementing a clustering algorithm? How would...

How does the shape of clusters create a challenge when implementing a clustering algorithm?

How would you pick k when using the k-Means algorithm? Explain your reasoning.

engineering Computer-Science

Add a comment Improve this question Transcribed image text

Answer 1

Answer #1

Assuming the number of clusters k is predefined, and consequently, we get k clusters as an output, this may lead to addition of those data points that share no similarity with any cluster in a wrong cluster. These additional data points may change the shape of the cluster, which they are part of, leading to less accurate interpretation of the resulting clusters.

For example, in the graph below, for k=2, clearly 2 different clusters are formed, but there lies a point x almost equidistant from both the clusters. Whichever cluster x becomes a part of affects the shape of that cluster and also reduces the similarity among that cluster data points.

We can pick the most accurate k using elbow method. We plot the graph between the values of k and wcss (within cluster sum of squares, which is sum of squares of all centroids of each cluster for the k clusters). The value of k after which the wcss starts decreasing the most linearly is the optimal number of k. This is because minimising the wcss increases the distance between different clusters, leading to more similar data points being grouped together in each cluster.

For example, for the below elbow plot, k=y is the most optimal value for k.

Add a comment

Answer 2

How does the shape of clusters create a challenge when implementing a clustering algorithm? How would...

Homework Answers

Add Answer to:
How does the shape of clusters create a challenge when implementing a clustering algorithm? How would...

Post as a guest

Earn Coins

Hierarchical clustering is sometimes used to generate K clusters, K > 1 by taking the clusters...

Suppose you have been building a model using the k-means clustering algorithm and you keep finding...

a) Why is implementing a K-means clustering algorithm multiple times with a fixed K important to do? 119 b) Why is cross-validation preferred over resubstituting as a method to measure classificat...

a) How does PAM (K-medoids) form clusters; how does DBSCAN form clusters? b) Assume you apply...

1) For the following set of two-dimensional points, draw a sketch of how they would be split into...

c++ question. implementing linked list how would you create the bellman-ford algorithm for weighted graphs?

1. Implement the K-means algorithm using these two as a reference. 2.Use Matlab’s implementation of kmeans...

Business Analytics, Assignment on Clustering As part of the quarterly reviews, the manager of a r...

In C++ program a simple k-means clustering algorithm, kmeans, using the Euclidean distance for 2-dimensional numerical...

Data clustering and the k means algorithm. However, I'm not able to list all of the...

How does the shape of clusters create a challenge when implementing a clustering algorithm? How would...

Homework Answers

Add Answer to: How does the shape of clusters create a challenge when implementing a clustering algorithm? How would...

Post as a guest

Earn Coins

Add Answer to:
How does the shape of clusters create a challenge when implementing a clustering algorithm? How would...