Você está em: Início > Publicações > Visualização > Exploring the effects of data distribution in missing data imputation

Mapa das Instalações

Publicação

Pesquisa de Publicações

Publicações

Exploring the effects of data distribution in missing data imputation

Título

Exploring the effects of data distribution in missing data imputationExportar publicação no formato APA Exportar publicação no formato EXCEL Exportar publicação no formato RIS

Tipo

Artigo em Livro de Atas de Conferência Internacional

Data

2018

Título

Exploring the effects of data distribution in missing data imputation

Tipo

Artigo em Livro de Atas de Conferência Internacional

Ano

2018

Autores

Pompeu Soares, J

(Autor)

Outra

A pessoa não pertence à instituição. A pessoa não pertence à instituição. A pessoa não pertence à instituição. Sem AUTHENTICUS Sem ORCID

Seoane Santos, M

(Autor)

Outra

Ver página pessoal Sem permissões para visualizar e-mail institucional Pesquisar Publicações do Participante Ver página do Authenticus Sem ORCID

Pedro Henriques Abreu

(Autor)

Outra

Ver página pessoal Enviar mensagem Pesquisar Publicações do Participante Ver página do Authenticus Sem ORCID

Araújo, H

(Autor)

Outra

A pessoa não pertence à instituição. A pessoa não pertence à instituição. A pessoa não pertence à instituição. Sem AUTHENTICUS Sem ORCID

Santos, J

(Autor)

Outra

A pessoa não pertence à instituição. A pessoa não pertence à instituição. A pessoa não pertence à instituição. Ver página do Authenticus Sem ORCID

Ata de Conferência Internacional

Título: Advances in Intelligent Data Analysis XVII - 17th International Symposium, IDA 2018, 's-Hertogenbosch, The Netherlands, October 24-26, 2018, Proceedings Pesquisar Publicações da Ata de Conferência

Páginas: 251-263

17th International Symposium on Intelligent Data Analysis, IDA 2018

24 October 2018 through 26 October 2018

Indexação

Scopus - 7 Citações

Outras Informações

ID Authenticus: P-00P-TN0

DOI: 10.1007/978-3-030-01768-2_21

Abstract (EN): In data imputation problems, researchers typically use several techniques, individually or in combination, in order to find the one that presents the best performance over all the features comprised in the dataset. This strategy, however, neglects the nature of data (data distribution) and makes impractical the generalisation of the findings, since for new datasets, a huge number of new, time consuming experiments need to be performed. To overcome this issue, this work aims to understand the relationship between data distribution and the performance of standard imputation techniques, providing a heuristic on the choice of proper imputation methods and avoiding the needs to test a large set of methods. To this end, several datasets were selected considering different sample sizes, number of features, distributions and contexts and missing values were inserted at different percentages and scenarios. Then, different imputation methods were evaluated in terms of predictive and distributional accuracy. Our findings show that there is a relationship between features¿ distribution and algorithms¿ performance, and that their performance seems to be affected by the combination of missing rate and scenario at state and also other less obvious factors such as sample size, goodness-of-fit of features and the ratio between the number of features and the different distributions comprised in the dataset. © Springer Nature Switzerland AG 2018.

Idioma: Inglês

Tipo (Avaliação Docente): Científica

Documentos

Não foi encontrado nenhum documento associado à publicação.

Recomendar Página Voltar ao Topo

Copyright 1996-2025 © Faculdade de Direito da Universidade do Porto I Termos e Condições I Acessibilidade I Índice A-Z
Página gerada em: 2025-09-17 às 04:40:32 | Política de Privacidade | Política de Proteção de Dados Pessoais | Denúncias | Livro Amarelo Eletrónico