%0 Journal Article %T Clustering Algorithms for Categorical Data: A Monte Carlo Study %J International Journal of Statistics and Applications %@ 2168-5215 %D 2012 %I %R 10.5923/j.statistics.20120204.01 %X In this paper the clustering algorithms: average linkage, ROCK, k-modes, fuzzy k-modes and k-populations were compared by means of Monte Carlo simulation. Data were simulated from Beta and Uniform distributions considering factors such as clusters overlapping, number of groups, variables and categories. A total of 64 population structures of clusters were simulated considering smaller and higher degree of overlapping, number of clusters, variables and categories. The results showed that overlapping was the factor with major impact in the algorithm¡¯s accuracy which decreases as the number of clusters increases. In general, ROCK presented the best performance considering overlapping and non-overlapping cases followed by k-modes and fuzzy k-Modes. The k-populations algorithm showed better accuracy only in cases where there was a small degree of overlapping with performance similar to the average linkage. The superiority of k-populations algorithm over k-modes and fuzzy k-modes presented in previous studies, which were based only in benchmark data, was not confirmed in this simulation study. %K Clustering %K Categorical Data %K Monte Carlo Simulation %U http://article.sapub.org/10.5923.j.statistics.20120204.01.html