Go to:
Logótipo
Você está em: Start > Publications > View > Term frequency dynamics in collaborative articles
Map of Premises
Principal
Publication

Term frequency dynamics in collaborative articles

Title
Term frequency dynamics in collaborative articles
Type
Article in International Conference Proceedings Book
Year
2010
Authors
Sérgio Nunes
(Author)
FEUP
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page Without ORCID
Cristina Ribeiro
(Author)
FEUP
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page View ORCID page
Conference proceedings International
Pages: 267-270
10th ACM Symposium on Document Engineering
Manchester, ENGLAND, SEP 21-24, 2010
Indexing
Publicação em ISI Proceedings ISI Proceedings
Publicação em Scopus Scopus - 0 Citations
COMPENDEX
Scientific classification
FOS: Engineering and technology > Electrical engineering, Electronic engineering, Information engineering
Other information
Authenticus ID: P-003-AKN
Abstract (EN): Documents on the World Wide Web are dynamic entities. Mainstream information retrieval systems and techniques are primarily focused on the latest version a document, generally ignoring its evolution over time. In this work, we study the term frequency dynamics in web documents over their lifespan. We use the Wikipedia as a document collection because it is a broad and public resource and, more important, because it provides access to the complete revision history of each document. We investigate the progression of similarity values over two projection variables, namely revision order and revision date. Based on this investigation we find that term frequency in encyclopedic documents - i.e. comprehensive and focused on a single topic - exhibits a rapid and steady progression towards the document's current version. The content in early versions quickly becomes very similar to the present version of the document.
Language: English
Type (Professor's evaluation): Scientific
No. of pages: 4
License type: Click to view license CC BY-NC
Documents
File name Description Size
p267-nunes 362.03 KB
Related Publications

Of the same authors

Information Retrieval on Time-Dependent Collections (2010)
Thesis
Sérgio Nunes; Cristina Ribeiro; Gabriel David
The impact of time in link-based Web ranking (2013)
Article in International Scientific Journal
Sérgio Nunes; Cristina Ribeiro; Gabriel David
Term Weighting Based on Document Revision History (2011)
Article in International Scientific Journal
Sérgio Nunes; Cristina Ribeiro; Gabriel David
Improving Web user experience with document activity sparklines (2009)
Article in International Conference Proceedings Book
Sérgio Nunes; Cristina Ribeiro; Gabriel David
WikiChanges : exposing Wikipedia revision activity (2008)
Article in International Conference Proceedings Book
Sérgio Sobral Nunes; Maria Cristina de Carvalho Alves Ribeiro; Gabriel de Sousa Torcato David

See all (10)

Recommend this page Top
Copyright 1996-2025 © Faculdade de Medicina Dentária da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2025-07-10 at 06:26:30 | Privacy Policy | Personal Data Protection Policy | Whistleblowing | Electronic Yellow Book