Article Details

Title An evaluation of the IFCS Cluster Benchmarking Data Analysis Challenge
Authors Hennig, Christian
Year 2019
Volume Archives of Data Science, Series B 1(1) / 2019
Abstract Eight clusterings of a lower back pain dataset were submitted to the IFCS Benchmarking Cluster Analysis Challenge. The aim of the challenge was to find clusterings of the 112 baseline variables that help with predicting 9 outcome variables. These clusterings are compared here, using data visualisation (multidimensional scaling and discriminant coordinates on both baseline and outcome variables), outcome means and uncertainty intervals, and four cluster validation indices, namely the Average SilhouetteWidth, the Pearson correlation version of Hubert’s Γ, the Calinski/Harabasz index, and the Adjusted Rand Index. The different comparison approaches give quite different assessments of the clustering quality.