Your browser doesn't support javascript.
loading
Clusters that are not there: An R tutorial and a Shiny app to quantify a priori inferential risks when using clustering methods.
Toffalini, Enrico; Gambarota, Filippo; Perugini, Ambra; Girardi, Paolo; Tobia, Valentina; Altoè, Gianmarco; Giofrè, David; Feraco, Tommaso.
Afiliación
  • Toffalini E; Department of General Psychology, University of Padova, Padova, Italy.
  • Gambarota F; Department of Developmental Psychology and Socialization, University of Padova, Padova, Italy.
  • Perugini A; Department of Developmental Psychology and Socialization, University of Padova, Padova, Italy.
  • Girardi P; Department of Environmental Sciences, Informatics and Statistics-University Ca' Foscari, Venice, Italy.
  • Tobia V; Department of Psychology, University Vita-Salute San Raffaele, Milan, Italy.
  • Altoè G; Department of Developmental Psychology and Socialization, University of Padova, Padova, Italy.
  • Giofrè D; DISFOR-University of Genova, Italy.
  • Feraco T; Department of General Psychology, University of Padova, Padova, Italy.
Int J Psychol ; 2024 Sep 19.
Article en En | MEDLINE | ID: mdl-39300789
ABSTRACT
Clustering methods are increasingly used in social science research. Generally, researchers use them to infer the existence of qualitatively different types of individuals within a larger population, thus unveiling previously "hidden" heterogeneity. Depending on the clustering technique, however, valid inference requires some conditions and assumptions. Common risks include not only failing to detect existing clusters due to a lack of power but also revealing clusters that do not exist in the population. Simple data simulations suggest that under conditions of sample size, number, correlation and skewness of indicators that are frequently encountered in applied psychological research, commonly used clustering methods are at a high risk of detecting clusters that are not there. Generally, this is due to some violations of assumptions that are not usually considered critical in psychology. The present article illustrates a simple R tutorial and a Shiny app (for those who are not familiar with R) that allow researchers to quantify a priori inferential risks when performing clustering methods on their own data. Doing so is suggested as a much-needed preliminary sanity check, because conditions that inflate the number of detected clusters are very common in applied psychological research scenarios.
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Idioma: En Revista: Int J Psychol Año: 2024 Tipo del documento: Article País de afiliación: Italia Pais de publicación: Reino Unido

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Idioma: En Revista: Int J Psychol Año: 2024 Tipo del documento: Article País de afiliación: Italia Pais de publicación: Reino Unido