Você está em: Início » Publicações » Visualização » The search of conditional outliers

Publicação

Pesquisa de Publicações

The search of conditional outliers

Título

The search of conditional outliersExportar publicação no formato APA Exportar publicação no formato EXCEL Exportar publicação no formato RIS

Tipo

Artigo em Revista Científica Internacional

Data

2019

Título

The search of conditional outliers

Tipo

Artigo em Revista Científica Internacional

Ano

2019

Autores

Portela, E

(Autor)

Outra

A pessoa não pertence à instituição. A pessoa não pertence à instituição. A pessoa não pertence à instituição. Sem AUTHENTICUS Sem ORCID

Rita Ribeiro

(Autor)

FCUP

Ver página pessoal Sem permissões para visualizar e-mail institucional Pesquisar Publicações do Participante Ver página do Authenticus Ver página ORCID

João Gama

(Autor)

FEP

Ver página pessoal Sem permissões para visualizar e-mail institucional Pesquisar Publicações do Participante Ver página do Authenticus Ver página ORCID

Revista

Título: Intelligent Data AnalysisImportada do Authenticus Pesquisar Publicações da Revista

Vol. 23

Páginas: 23-39

ISSN: 1088-467X

Editora: IOS PRESS

Indexação

ISI Web of Knowledge - 5 Citações

Scopus - 5 Citações

Outras Informações

ID Authenticus: P-00Q-917

DOI: 10.3233/ida-173619

Abstract (EN): There is no standard definition of outliers, but most authors agree that outliers are points far from other data points. Several outlier detection techniques have been developed mainly for two different purposes. On one hand, outliers are considered error measurement observations that should be removed from the analysis, e.g. robust statistics. On the other hand, outliers are the interesting observations, like in fraud detection, and should be modelled by some learning method. In this work, we start from the observation that outliers are affected by the so-called simpson paradox: a trend that appears in different groups of data but disappears or reverses when these groups are combined. Given a data set, we learn a regression tree. The tree grows by partitioning the data into groups more and more homogeneous of the target variable. At each partition defined by the tree, we apply a box plot on the target variable to detect outliers. We would expect that the deeper nodes of the tree would contain less and less outliers. We observe that some points previously signalled as outliers are no more signalled as such, but new outliers appear.

Idioma: Inglês

Tipo (Avaliação Docente): Científica

Nº de páginas: 17

Documentos

Não foi encontrado nenhum documento associado à publicação.

Publicações Relacionadas

Dos mesmos autores

Outliers and the Simpson's Paradox (2017)
Artigo em Livro de Atas de Conferência Internacional
Portela, E; Rita Ribeiro; João Gama

Da mesma revista

Ubiquitous Knowledge Discovery Introduction (2011)
Outra Publicação em Revista Científica Internacional
João Gama; May, M

Mining official data (2003)
Outra Publicação em Revista Científica Internacional
brito, p; malerba, d

Knowledge discovery from data streams (2008)
Outra Publicação em Revista Científica Internacional
João Gama; Aguilar Ruiz, J; Klinkenberg, R

Knowledge discovery from data streams (2007)
Outra Publicação em Revista Científica Internacional
João Gama; Aguilar Ruiz, J

Incremental learning and concept drift: Editor's introduction (2004)
Outra Publicação em Revista Científica Internacional
Kubat, M; João Gama; Utgoff, P

Ver todas (39)

Recomendar Página Voltar ao Topo

Política de Utilização Aceitável | Política de Proteção de Dados Pessoais | Denúncias | Política de Captação e Difusão da Imagem Pessoal em Suporte Digital