Distance based k-means clustering algorithm for determining number of clusters for high dimensional data
To improve the clustering task on high dimensional data sets, the distance based k-means algorithm is proposed. The proposed algorithm is tested using eighteen sets of normal and non-normal multivariate simulation data under various combinations.