help

Você está em: Start > Publications > View > Benchmark of Encoders of Nominal Features for Regression

Map of Premises

Publication

Publication Search

Benchmark of Encoders of Nominal Features for Regression

Title

Benchmark of Encoders of Nominal Features for RegressionExport publication in the APA format Export publication in the EXCEL format Export publication in the RIS format

Type

Article in International Conference Proceedings Book

Date

2021

Title

Benchmark of Encoders of Nominal Features for Regression

Type

Article in International Conference Proceedings Book

Year

2021

Authors

Diogo Seca

(Author)

Other

The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID

João Mendes Moreira

(Author)

FEUP

View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page Without ORCID

Conference proceedings International

Title: Trends and Applications in Information Systems and Technologies - Volume 1, WorldCIST 2021, Terceira Island, Azores, Portugal, 30 March - 2 April, 2021. Search for Conference Proceedings Publications

Pages: 146-155

World Conference on Information Systems and Technologies, WorldCIST 2021

1 April 2021 through 2 April 2021

Indexing

Scopus - 4 Citations

Other information

Authenticus ID: P-00T-XXJ

DOI: 10.1007/978-3-030-72657-7_14

Resumo (PT):

Abstract (EN): Mixed-type data is common in the real world. However, supervised learning algorithms such as support vector machines or neural networks can only process numerical features. One may choose to drop qualitative features, at the expense of possible loss of information. A better alternative is to encode them as new numerical features. Under the constraints of time, budget, and computational resources, we were motivated to search for a general-purpose encoder but found the existing benchmarks to be limited. We review these limitations and present an alternative. Our benchmark tests 16 encoding methods, on 15 regression datasets, using 7 distinct predictive models. The top general-purpose encoders were found to be Catboost, LeaveOneOut, and Target. © 2021, The Author(s), under exclusive license to Springer Nature Switzerland AG.

Language: English

Type (Professor's evaluation): Scientific

No. of pages: 10

Documents

We could not find any documents associated to the publication.

Related Publications

Of the same authors

Estimating the Likelihood of Financial Behaviours Using Nearest Neighbors A case study on market sensitivities (2024)
Article in International Scientific Journal
Tiago Mendes-Neves; Diogo Seca; Ricardo Sousa; Claúdia Ribeiro; João Mendes-Moreira

Recommend this page Top

Copyright 1996-2025 © Faculdade de Medicina Dentária da Universidade do Porto I Terms and Conditions I Acessibility I Index A-Z
Page created on: 2025-10-16 at 01:40:48 | Privacy Policy | Personal Data Protection Policy | Whistleblowing | Electronic Yellow Book