Go to:
Logótipo
Você está em: Start » Publications » View » The search of conditional outliers
Publication

The search of conditional outliers

Title
The search of conditional outliers
Type
Article in International Scientific Journal
Year
2019
Authors
Portela, E
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Rita Ribeiro
(Author)
FCUP
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page View ORCID page
João Gama
(Author)
FEP
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page View ORCID page
Journal
Vol. 23
Pages: 23-39
ISSN: 1088-467X
Publisher: IOS PRESS
Other information
Authenticus ID: P-00Q-917
Abstract (EN): There is no standard definition of outliers, but most authors agree that outliers are points far from other data points. Several outlier detection techniques have been developed mainly for two different purposes. On one hand, outliers are considered error measurement observations that should be removed from the analysis, e.g. robust statistics. On the other hand, outliers are the interesting observations, like in fraud detection, and should be modelled by some learning method. In this work, we start from the observation that outliers are affected by the so-called simpson paradox: a trend that appears in different groups of data but disappears or reverses when these groups are combined. Given a data set, we learn a regression tree. The tree grows by partitioning the data into groups more and more homogeneous of the target variable. At each partition defined by the tree, we apply a box plot on the target variable to detect outliers. We would expect that the deeper nodes of the tree would contain less and less outliers. We observe that some points previously signalled as outliers are no more signalled as such, but new outliers appear.
Language: English
Type (Professor's evaluation): Scientific
No. of pages: 17
Documents
We could not find any documents associated to the publication.
Related Publications

Of the same authors

Outliers and the Simpson's Paradox (2017)
Article in International Conference Proceedings Book
Portela, E; Rita Ribeiro; João Gama

Of the same journal

Ubiquitous Knowledge Discovery Introduction (2011)
Another Publication in an International Scientific Journal
João Gama; May, M
Mining official data (2003)
Another Publication in an International Scientific Journal
brito, p; malerba, d
Knowledge discovery from data streams (2008)
Another Publication in an International Scientific Journal
João Gama; Aguilar Ruiz, J; Klinkenberg, R
Knowledge discovery from data streams (2007)
Another Publication in an International Scientific Journal
João Gama; Aguilar Ruiz, J
Incremental learning and concept drift: Editor's introduction (2004)
Another Publication in an International Scientific Journal
Kubat, M; João Gama; Utgoff, P

See all (39)

Recommend this page Top
Copyright 1996-2024 © Faculdade de Medicina da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z  I Guest Book
Page created on: 2024-08-27 at 18:13:59
Acceptable Use Policy | Data Protection Policy | Complaint Portal | Política de Captação e Difusão da Imagem Pessoal em Suporte Digital