Principal Cluster Axes: A Projection Pursuit Index for the Preservation of Cluster Structures in the Presence of Data Reduction

Type: Article

Publication Date: 2012-06-15

Citations: 13

DOI: https://doi.org/10.1080/00273171.2012.673952

Abstract

A measure of "clusterability" serves as the basis of a new methodology designed to preserve cluster structure in a reduced dimensional space. Similar to principal component analysis, which finds the direction of maximal variance in multivariate space, principal cluster axes find the direction of maximum clusterability in multivariate space. Furthermore, the principal clustering approach falls into the class of projection pursuit techniques. Comparisons are made with existing methodologies both in a simulation study and analysis of real-world data sets. Furthermore, a demonstration of how to interpret the results of the principal cluster axes is provided on the analysis of Supreme Court voting data and similarities between the interpretation of competing procedures (e.g., factor analysis and principal component analysis) are provided. In addition to the Supreme Court analysis, we analyze several data sets often used to test cluster analysis procedures, including Fisher's Iris data, Agresti's Crab data, and a data set on glass fragments. Finally, discussion is provided to help determine when the proposed procedure will be the most beneficial to the researcher.

Locations

  • Europe PMC (PubMed Central) - View - PDF
  • PubMed Central - View
  • PubMed - View
  • Multivariate Behavioral Research - View

Similar Works

Action Title Year Authors
+ Projection Pursuit Clustering for Exploratory Data Analysis 2003 Richard J. Bolton
W. J. Krzanowski
+ A Genetic Clustering Algorithm by Monomial Projection Pursuit 2012 Mihaela Breaban
Henri Luchian
Dan A. Simovici
+ Supervised projection pursuit – A dimensionality reduction technique optimized for probabilistic classification 2019 Andrei Barcaru
+ PDF Chat Exploratory Projection Pursuit 1987 Jerome H. Friedman
+ Projection pursuit 2009 J. Rodney Jee
+ PDF Chat Principal Component Analysis 2021 Felipe L. Gewers
Gustavo Rodrigues Ferreira
Henrique Ferraz de Arruda
Filipi N. Silva
César H. Comin
Diego R. Amancio
Luciano da Fontoura Costa
+ PDF Chat REPPlab: An R package for detecting clusters and outliers using exploratory projection pursuit 2019 Daniel Fischer
Alain Berro
Klaus Nordhausen
Anne Ruiz‐Gazen
+ REPPlab: An R package for detecting clusters and outliers using exploratory projection pursuit 2019 Daniel Fischer
Alain Berro
Klaus Nordhausen
Anne Ruiz‐Gazen
+ Principal Component Analysis: A Natural Approach to Data Exploration 2018 Felipe L. Gewers
Gustavo Rodrigues Ferreira
Henrique Ferraz de Arruda
Filipi N. Silva
César H. Comin
Diego R. Amancio
Luciano da Fontoura Costa
+ From projection pursuit to other unsupervised chemometric techniques 2007 M. Daszykowski
+ Guided projections for analysing the structure of high-dimensional data 2017 Thomas Ortner
Peter Filzmoser
Maia Zaharieva
Christian Breiteneder
Šárka Brodinová
+ Guided projections for analysing the structure of high-dimensional data 2017 Thomas Ortner
Peter Filzmoser
Maia Zaharieva
Christian Breiteneder
Šárka Brodinová
+ REPPlab: An R package for detecting clusters and outliers using exploratory projection pursuit 2016 Daniel Fischer
Alain Berro
Klaus Nordhausen
Anne Ruiz‐Gazen
+ REPPlab: An R package for detecting clusters and outliers using exploratory projection pursuit 2016 Daniel Fischer
Alain Berro
Klaus Nordhausen
Anne Ruiz‐Gazen
+ PDF Chat Guided Projections for Analyzing the Structure of High-Dimensional Data 2018 Thomas Ortner
Peter Filzmoser
Maia Rohm
Christian Breiteneder
Šárka Brodinová
+ PDF Chat Robust principal components: A generalized variance perspective 2008 Rand R. Wilcox
+ Sequential Projection Pursuit Using Genetic Algorithms for Data Mining of Analytical Data 2000 Qian Guo
W. Wu
Frederik Questier
D. L. Massart
C. Boucon
S. De Jong
+ PDF Chat Projection Clustering Unfolding: A New Algorithm for Clustering Individuals or Items in a Preference Matrix 2020 Mariangela Sciandra
Antonio D’Ambrosio
Antonella Plaia
+ PDF Chat Principal Component Analysis 2010 Tormod Næs
Per B. Brockhoff
Oliver Tomić
+ PDF Chat Assessment of Projection Pursuit Index for Classifying High Dimension Low Sample Size Data in R 2023 Zhaoxing Wu
Chunming Zhang