Clustering

Question 1
Marks : +2 | -2
Pass Ratio : 33%
Which of the following is required by K-means clustering?
defined distance metric
number of clusters
initial guess as to cluster centroids
all of the mentioned
Explanation:
K-means clustering follows partitioning approach.
Question 2
Marks : +2 | -2
Pass Ratio : 33%
Hierarchical clustering should be primarily used for exploration.
True
False
Explanation:
Hierarchical clustering is deterministic.
Question 3
Marks : +2 | -2
Pass Ratio : 33%
K-means is not deterministic and it also consists of number of iterations.
True
False
Explanation:
K-means clustering produces the final estimate of cluster centroids.
Question 4
Marks : +2 | -2
Pass Ratio : 33%
Which of the following clustering requires merging approach?
Partitional
Hierarchical
Naive Bayes
None of the mentioned
Explanation:
Hierarchical clustering requires a defined distance as well.
Question 5
Marks : +2 | -2
Pass Ratio : 33%
Point out the wrong statement.
k-means clustering is a method of vector quantization
k-means clustering aims to partition n observations into k clusters
k-nearest neighbor is same as k-means
none of the mentioned
Explanation:
k-nearest neighbor has nothing to do with k-means.
Question 6
Marks : +2 | -2
Pass Ratio : 33%
Which of the following is finally produced by Hierarchical Clustering?
final estimate of cluster centroids
tree showing how close things are to each other
assignment of each point to clusters
all of the mentioned
Explanation:
Hierarchical clustering is an agglomerative approach.
Question 7
Marks : +2 | -2
Pass Ratio : 33%
Point out the correct statement.
The choice of an appropriate metric will influence the shape of the clusters
Hierarchical clustering is also called HCA
In general, the merges and splits are determined in a greedy manner
All of the mentioned
Explanation:
Some elements may be close to one another according to one distance and farther away according to another.
Question 8
Marks : +2 | -2
Pass Ratio : 33%
Which of the following function is used for k-means clustering?
k-means
k-mean
heatmap
none of the mentioned
Explanation:
K-means requires a number of clusters.
Question 9
Marks : +2 | -2
Pass Ratio : 33%
Which of the following clustering type has characteristic shown in the below figure?
Partitional
Hierarchical
Naive bayes
None of the mentioned
Explanation:
Hierarchical clustering groups data over a variety of scales by creating a cluster tree or dendrogram.
Question 10
Marks : +2 | -2
Pass Ratio : 33%
Which of the following combination is incorrect?
Continuous – euclidean distance
Continuous – correlation similarity
Binary – manhattan distance
None of the mentioned
Explanation:
You should choose a distance/similarity that makes sense for your problem.