help

Você está em: Start > Publications > View > Improving the offline clustering stage of data stream algorithms in scenarios with variable number of clusters

Map of Premises

Publication

Publication Search

Improving the offline clustering stage of data stream algorithms in scenarios with variable number of clusters

Title

Improving the offline clustering stage of data stream algorithms in scenarios with variable number of clustersExport publication in the APA format Export publication in the EXCEL format Export publication in the RIS format

Type

Article in International Conference Proceedings Book

Date

2012

Title

Improving the offline clustering stage of data stream algorithms in scenarios with variable number of clusters

Type

Article in International Conference Proceedings Book

Year

2012

Authors

Faria, ER

(Author)

Other

The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID

Barros, RC

(Author)

Other

The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID

João Gama

(Author)

FEP

View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page View ORCID page

Carvalho, ACPLF

(Author)

Other

The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID

Conference proceedings International

Title: Proceedings of the ACM Symposium on Applied Computing Search for Conference Proceedings Publications

Pages: 829-830

27th Annual ACM Symposium on Applied Computing, SAC 2012

Trento, 26 March 2012 through 30 March 2012

Indexing

Scopus - 3 Citations

Other information

Authenticus ID: P-008-4SN

DOI: 10.1145/2245276.2245437

Abstract (EN): Many data stream clustering algorithms operate in two well-defined steps: (i) online statistical data collection stage; and (ii) offline macro-clustering stage. The well-known k-means algorithm is often employed for performing the offline macro-clustering step. The conventional k-means algorithm assumes that the number of clusters (k) is defined a priori by the user. Given the difficulty of defining the value of k a priori in real-world problems, we describe a new approach that allows estimating k dynamically from streams with variable number of clusters, which is a common scenario in data with a non-stationary distribution. In addition, we combine our dynamic approach with two different strategies for initializing the centroids during the offline clustering. Analysis of results suggest that, using the dynamic approach, the method k-means++ for centroids initialization present better results. © 2012 Authors.

Language: English

Type (Professor's evaluation): Scientific

Documents

We could not find any documents associated to the publication.

Recommend this page Top

Copyright 1996-2025 © Faculdade de Medicina Dentária da Universidade do Porto I Terms and Conditions I Acessibility I Index A-Z
Page created on: 2025-08-08 at 17:55:21 | Privacy Policy | Personal Data Protection Policy | Whistleblowing | Electronic Yellow Book