Go to:
Logótipo
Comuta visibilidade da coluna esquerda
Você está em: Start > Publications > View > Clustering and Classifying Text Documents - A Revisit to Tagging Integration Methods
Publication

Publications

Clustering and Classifying Text Documents - A Revisit to Tagging Integration Methods

Title
Clustering and Classifying Text Documents - A Revisit to Tagging Integration Methods
Type
Article in International Conference Proceedings Book
Year
2013
Authors
Cunha, E
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Figueira, A
(Author)
FCUP
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page View ORCID page
Mealha, O
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. View Authenticus page Without ORCID
Conference proceedings International
Pages: 160-168
5th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2013 and the 5th International Conference on Knowledge Management and Information Sharing, KMIS 2013
Vilamoura, Algarve, 19 September 2013 through 22 September 2013
Indexing
Publicação em ISI Web of Knowledge ISI Web of Knowledge
Other information
Authenticus ID: P-008-GVR
Abstract (EN): In this paper we analyze and discuss two methods that are based on the traditional k-means for document clustering and that feature integration of social tags in the process. The first one allows the integration of tags directly into a Vector Space Model, and the second one proposes the integration of tags in order to select the initial seeds. We created a predictive model for the impact of the tags' integration in both models, and compared the two methods using the traditional k-means++ and the novel k-C algorithm. To compare the results, we propose a new internal measure, allowing the computation of the cluster compactness. The experimental results indicate that the careful selection of seeds on the k-C algorithm present better results to those obtained with the k-means++, with and without integration of tags.
Language: English
Type (Professor's evaluation): Scientific
Documents
We could not find any documents associated to the publication.
Recommend this page Top
Copyright 1996-2025 © Faculdade de Direito da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2025-08-17 at 05:33:52 | Privacy Policy | Personal Data Protection Policy | Whistleblowing