Go to:
Logótipo
Você está em: Start > Publications > View > Pycol: A Python package for dataset complexity measures
Publication

Pycol: A Python package for dataset complexity measures

Title
Pycol: A Python package for dataset complexity measures
Type
Article in International Scientific Journal
Year
2025
Authors
Apóstolo, D
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Santos, MS
(Author)
FCUP
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page Without ORCID
Lorena, AC
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Journal
Title: NeurocomputingImported from Authenticus Search for Journal Publications
Vol. 640
ISSN: 0925-2312
Publisher: Elsevier
Indexing
Publicação em ISI Web of Knowledge ISI Web of Knowledge - 0 Citations
Publicação em Scopus Scopus - 0 Citations
Other information
Authenticus ID: P-018-SHG
Abstract (EN): Class overlap presents a significant challenge to machine learning algorithms, especially when class imbalance is present. These factors contribute substantially to the complexity of classification tasks, particularly in realworld scenarios. As a result, measuring overlap is crucial, yet it remains difficult to quantify due to its intricate nature, since it can manifest and be measured in multiple ways. To help mitigate this, recent research has conceptualized a new taxonomy of class overlap measures, divided into multiple families, which allows researchers to obtain a more complete overview of the complexity of the datasets. In line with recent research, we introduce a new Python package for class overlap measurement named pycol. This package implements 29 overlap measures, divided into four overlap families specifically designed to capture class overlap in imbalanced real-world scenarios. This makes pycol an essential tool for researchers dealing with complex classification problems, providing robust solutions to quantify the joint-effect of class overlap and class imbalance effectively.
Language: English
Type (Professor's evaluation): Scientific
No. of pages: 10
Documents
We could not find any documents associated to the publication.
Related Publications

Of the same journal

The vitality of pattern recognition and image analysis (2015)
Another Publication in an International Scientific Journal
Luisa Mico; Joao M Sanches; Jaime S Cardoso
ydata-profiling: Accelerating data-centric AI with high-quality data (2023)
Article in International Scientific Journal
Clemente, F; Ribeiro, GM; Quemy, A; Santos, MS; Pereira, RC; Barros, A
The vitality of pattern recognition and image analysis (2015)
Article in International Scientific Journal
Micó, L; Sanches, JM; Jaime S Cardoso
QIDLEARNINGLIB: A Python library for quasi-identifier recognition and evaluation (2025)
Article in International Scientific Journal
Simoes, SA; João P. Vilela; Santos, MS; Abreu, PH
Pre-processing approaches for imbalanced distributions in regression (2019)
Article in International Scientific Journal
Branco, P; Torgo, L; Rita Ribeiro

See all (25)

Recommend this page Top
Copyright 1996-2026 © Faculdade de Farmácia da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2026-02-15 at 20:45:07 | Privacy Policy | Personal Data Protection Policy | Whistleblowing | Electronic Yellow Book