Given the set of n data points xi, i =1,n (m-dimensional vectors (xi,1, ... , xi,m) ), k - means clustering partitions them into set C = {C1, ..., Ck} of k clusters, so that each data point xi belongs to the j cluster Cj with the nearest mean μj (cluster centers or cluster centroids): Cj = argminC
k
∑
j=1
∑
‖ xi - μj ‖2
xi ∈ Cj
, where μj = 1| Cj |
∑
x
x ∈ Cj
BR> Please Note:
We have limited the dimension m in the "Manual Entry" tab to 10 due to the limited space on the screen.
Rows (data points) which have all of the cells empty will be disregarded in calculations.