Go to:
Logótipo
You are in:: Start > Publications > View > Robust Clustering Method for the Detection of Outliers: Using AIC to Select the Number of Clusters
Map of Premises
FC6 - Departamento de Ciência de Computadores FC5 - Edifício Central FC4 - Departamento de Biologia FC3 - Departamento de Física e Astronomia e Departamento GAOT FC2 - Departamento de Química e Bioquímica FC1 - Departamento de Matemática
Publication

Robust Clustering Method for the Detection of Outliers: Using AIC to Select the Number of Clusters

Title
Robust Clustering Method for the Detection of Outliers: Using AIC to Select the Number of Clusters
Type
Chapter or Part of a Book
Year
2013
Authors
Carla Santos Pereira
(Author)
FEUP
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications Without AUTHENTICUS Without ORCID
Ana M. Pires
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Scientific classification
FOS: Natural sciences
Other information
Abstract (EN): In [14] we proposed a method to detect outliers in multivariate data based on clustering and robust estimators. To implement this method in practice it is necessary to choose a clustering method, a pair of location and scatter estimators, and the number of clusters, k. After several simulation experiments it was possible to give a number of guidelines regarding the first two choices. However the choice of the number of clusters depends entirely on the structure of the particular data set under study. Our suggestion is to try several values of k (e.g. from 1 to a maximum reasonable k which depends on the number of observations and on the number of variables) and select k minimizing an adapted AIC. In this paper we analyze this AIC based criterion for choosing the number of clusters k (and also the clustering method and the location and scatter estimators) by applying it to several simulated data sets with and without outliers.
Language: English
Type (Professor's evaluation): Scientific
No. of pages: 8
License type: Click to view license CC BY-NC
Documents
File name Description Size
csp196807_revised 96.57 KB
Related Publications

Of the same authors

Classificação com dúvidas: compromisso erro/rejeição (2000)
Poster in a National Conference
Carla Santos-Pereira; Ana M. Pires
Algumas questões em aberto na análise discriminante para três grupos. (1998)
Poster in a National Conference
Carla Santos-Pereira; Ana M. Pires
Robustness of AIC based criterion for selecting the number of clusters (2008)
Poster in an International Conference
Carla Santos-Pereira; Ana M. Pires
Using Clustering and Robust Estimators to Detect Outliers in Multivariate Data. (2005)
Summary of Presentation in an International Conference
Ana M. Pires; Carla Santos-Pereira

See all (10)

Of the same book

Scaling exponents in heart rate variability (2013)
Chapter or Part of a Book
Argentina Leite; Maria Eduarda Silva; Ana Paula Rocha
Asymptotic distribution of the maximum for a chaotic economic model. (2013)
Chapter or Part of a Book
Ana Cristina Moreira Freitas
Adaptive Choice of Thresholds and the Bootstrap Methodology: An Empirical Study (2013)
Chapter or Part of a Book
M. Ivette Gomes; Fernanda Figueiredo; M. Manuela Neves
Recommend this page Top
Copyright 1996-2025 © Faculdade de Ciências da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z  I Guest Book
Page created on: 2025-06-27 at 19:27:23 | Acceptable Use Policy | Data Protection Policy | Complaint Portal