Go to:
Logótipo
Comuta visibilidade da coluna esquerda
Você está em: Start > Publications > View > Community-Based Topic Modeling with Contextual Outlier Handling
Publication

Publications

Community-Based Topic Modeling with Contextual Outlier Handling

Title
Community-Based Topic Modeling with Contextual Outlier Handling
Type
Article in International Conference Proceedings Book
Year
2024
Authors
Andrade, C
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Rita Ribeiro
(Author)
FCUP
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page View ORCID page
João Gama
(Author)
FEP
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page View ORCID page
Conference proceedings International
Pages: 173-183
20th Conference of the Spanish Association for Artificial Intelligence (CAEPIA)
A Coruna, SPAIN, JUN 19-21, 2024
Indexing
Publicação em ISI Web of Knowledge ISI Web of Knowledge - 0 Citations
Publicação em Scopus Scopus - 0 Citations
Other information
Authenticus ID: P-010-NBP
Abstract (EN): E-commerce has become an essential aspect of modern life, providing consumers globally with convenience and accessibility. However, the high volume of short and noisy product descriptions in text streams of massive e-commerce platforms translates into an increased number of clusters, presenting challenges for standard model-based stream clustering algorithms. Standard LDA-based methods often lead to clusters dominated by single elements, effectively failing to manage datasets with varied cluster sizes. Our proposed Community-Based Topic Modeling with Contextual Outlier Handling (CB-TMCOH) algorithm introduces an approach to outlier detection in text data using transformer models for similarity calculations and graph-based clustering. This method efficiently separates outliers and improves clustering in large text datasets, demonstrating its utility not only in e-commerce applications but also proving effective for news and tweets datasets.
Language: English
Type (Professor's evaluation): Scientific
No. of pages: 11
Documents
We could not find any documents associated to the publication.
Related Publications

Of the same authors

Topic Model with Contextual Outlier Handling: a Study on Electronic Invoice Product Descriptions (2023)
Article in International Conference Proceedings Book
Andrade, C; Rita Ribeiro; João Gama
Recommend this page Top
Copyright 1996-2025 © Faculdade de Direito da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2025-08-17 at 20:35:51 | Privacy Policy | Personal Data Protection Policy | Whistleblowing