You have performed an unsupervised k-means clustering on a data set with two attributes and the...

Question

Question

You have performed an unsupervised k-means clustering on a data set with two attributes and the...

You have performed an unsupervised k-means clustering on a data set with two attributes and the results indicate a k of 2. Later, you determine the class values for each data instance (there are four class values) and a supervised clustering results in a k of 4. Provide a possible explanation for why the two clustering methods disagree on a k value and a draw a sketch of the two clusterings to go along with your explanation.

engineering Computer-Science

Add a comment Improve this question Transcribed image text

Answer 1

Answer #1

The below-shown scenario is an example of what can make such a clustering case,

Supervised Clustering is the problem of training a clustering algorithm to produce desirable clusterings: given sets of items and complete clusterings over these sets, we learn how to cluster future sets of items. Clustering algorithms accept a set of items and produce a partitioning of that set.

Whereas in the case of Unsupervised algorithm, we don't have any guidance we have a change possible, the reason is that we have the goal of this unsupervised machine learning technique is to find similarities in the data point and group similar data points together.

Now when we have a set of items let say of four categories in the supervised clustering then we compare the similarities, of each unclassified element to all the four clusters and thus we have 4 clusters.

Whereas as you can see in the picture we can have the same set of elements without any guidance, the key thing is here there is no particular way of separating the two clusters like blue and red similarly light blue and black.

So when we, have the case of the different algorithms we can have different clusters, actually, even with two unsupervised algorithms we can have the separate number of cluster and that is because it all depends on the similarity and difference measure.

If there is any doubt, you can ask in comments.

Add a comment

Answer 2

You have performed an unsupervised k-means clustering on a data set with two attributes and the...

Homework Answers

Add Answer to:
You have performed an unsupervised k-means clustering on a data set with two attributes and the...

Post as a guest

Earn Coins

1. apply k-means clustering to a dataset Task Consider the following set of two-dimensional records: RID...

Please write full justification for (a) and (b). Will uprate/vote! 4. K-means The goal of K-means clustering is to divide a set of n points into k< n subgroups of points that are "close" t...

You are given the follow information: You need to apply k-means clustering,Your dataset has 1,000 observations,...

1) For the following set of two-dimensional points, draw a sketch of how they would be split into...

Data clustering and the k means algorithm. However, I'm not able to list all of the...

K-means clustering Problem 1. (10 pts) Suppose that we have the gene expression values for 5...

1. Implement the K-means algorithm using these two as a reference. 2.Use Matlab’s implementation of kmeans...

K-means clustering K-means clustering is a very well-known method of clustering unlabeled data. The simplicity of...

Q1. In a digital classification process “training” a computer can be performed with supervised or unsupervised...

Question: Use the data file DemoKTC file to conduct the following analysis. (a) Use k-means clustering...

You have performed an unsupervised k-means clustering on a data set with two attributes and the...

Homework Answers

Add Answer to: You have performed an unsupervised k-means clustering on a data set with two attributes and the...

Post as a guest

Earn Coins

Add Answer to:
You have performed an unsupervised k-means clustering on a data set with two attributes and the...