MMiDS 1.3: Self-Assessment Quiz

Which of these is NOT a property of a valid partition \( C_1, \ldots, C_k \) in the context of \( k \)-means?

In the \( k \)-means objective function, what does the variable \( \boldsymbol{\mu}_i \) represent?

The \( k \)-means objective function is a measure of what?

How does the \( k \)-means algorithm update the partition in each iteration?

Which of the following statements about the \( k \)-means algorithm is TRUE?

What is a key property of the sequence of objective function values produced by the \( k \)-means algorithm?

Which mathematical property is essential in the equivalence of minimizing \( \|\mathbf{x}_j - \boldsymbol{\mu}_i\|^2 \) and \( \|\mathbf{x}_j - \boldsymbol{\mu}_i\| \)?

What is the interpretation of the matrix \( Z \) in the matrix formulation of \( k \)-means?

What property must a matrix \( Z \) representing cluster assignments satisfy?

Why is standardization often applied to data before running k-means clustering?

What is the significance of the Frobenius norm in the context of \( k \)-means clustering?