Go to:
Logótipo
Você está em: Start > Publications > View > Benchmark of Encoders of Nominal Features for Regression
Map of Premises
Principal
Publication

Benchmark of Encoders of Nominal Features for Regression

Title
Benchmark of Encoders of Nominal Features for Regression
Type
Article in International Conference Proceedings Book
Year
2021
Authors
Diogo Seca
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
João Mendes Moreira
(Author)
FEUP
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page Without ORCID
Conference proceedings International
Pages: 146-155
World Conference on Information Systems and Technologies, WorldCIST 2021
1 April 2021 through 2 April 2021
Indexing
Other information
Authenticus ID: P-00T-XXJ
Resumo (PT):
Abstract (EN): Mixed-type data is common in the real world. However, supervised learning algorithms such as support vector machines or neural networks can only process numerical features. One may choose to drop qualitative features, at the expense of possible loss of information. A better alternative is to encode them as new numerical features. Under the constraints of time, budget, and computational resources, we were motivated to search for a general-purpose encoder but found the existing benchmarks to be limited. We review these limitations and present an alternative. Our benchmark tests 16 encoding methods, on 15 regression datasets, using 7 distinct predictive models. The top general-purpose encoders were found to be Catboost, LeaveOneOut, and Target. © 2021, The Author(s), under exclusive license to Springer Nature Switzerland AG.
Language: English
Type (Professor's evaluation): Scientific
No. of pages: 10
Documents
We could not find any documents associated to the publication.
Related Publications

Of the same authors

Estimating the Likelihood of Financial Behaviours Using Nearest Neighbors A case study on market sensitivities (2024)
Article in International Scientific Journal
Tiago Mendes-Neves; Diogo Seca; Ricardo Sousa; Claúdia Ribeiro; João Mendes-Moreira
Recommend this page Top
Copyright 1996-2025 © Faculdade de Medicina Dentária da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2025-08-08 at 22:24:20 | Privacy Policy | Personal Data Protection Policy | Whistleblowing | Electronic Yellow Book