Go to:
Logótipo
Comuta visibilidade da coluna esquerda
Você está em: Start > Publications > View > Using neighbors to date web documents
Publication

Publications

Using neighbors to date web documents

Title
Using neighbors to date web documents
Type
Article in International Conference Proceedings Book
Year
2007
Authors
Sérgio Nunes
(Author)
FEUP
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page Without ORCID
Cristina Ribeiro
(Author)
FEUP
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page View ORCID page
Conference proceedings International
Indexing
Scientific classification
CORDIS: Technological sciences > Technology > Information technology
Other information
Authenticus ID: P-007-T02
Abstract (EN): Time has been successfully used as a feature in web information retrieval tasks. In this context, estimating a document's inception date or last update date is a necessary task. Classic approaches have used HTTP header fields to estimate a document's last update time. The main problem with this approach is that it is applicable to a small part of web documents. In this work, we evaluate an alternative strategy based on a document's neighborhood. Using a random sample containing 10,000 URLs from the Yahoo! Directory, we study each document's links and media assets to determine its age. If we only consider isolated documents, we are able to date 52% of them. Including the document's neighborhood, we are able to estimate the date of more than 85\% of the same sample. Also, we find that estimates differ significantly according to the type of neighbors used. The most reliable estimates are based on the document's media assets, while the worst estimates are based on incoming links. These results are experimentally evaluated with a real world application using different datasets.
Language: Portuguese
Type (Professor's evaluation): Scientific
No. of pages: 7
License type: Click to view license CC BY-NC
Documents
We could not find any documents associated to the publication with allowed access.
Related Publications

Of the same authors

Information Retrieval on Time-Dependent Collections (2010)
Thesis
Sérgio Nunes; Cristina Ribeiro; Gabriel David
The impact of time in link-based Web ranking (2013)
Article in International Scientific Journal
Sérgio Nunes; Cristina Ribeiro; Gabriel David
Term Weighting Based on Document Revision History (2011)
Article in International Scientific Journal
Sérgio Nunes; Cristina Ribeiro; Gabriel David
Improving Web user experience with document activity sparklines (2009)
Article in International Conference Proceedings Book
Sérgio Nunes; Cristina Ribeiro; Gabriel David
WikiChanges : exposing Wikipedia revision activity (2008)
Article in International Conference Proceedings Book
Sérgio Sobral Nunes; Maria Cristina de Carvalho Alves Ribeiro; Gabriel de Sousa Torcato David

See all (10)

Of the same scientific areas

O Repositório Aberto da Universidade do Porto (2009)
Academic Work
Maria Eugénia Matos Fernandes; Lígia Maria Ribeiro
Deepfakes: uma nova ameaça à segurança e à confiança na informação (2024)
Academic Work
Filipa Lopes; Inês Aparício; Sara Esteves
Plano de Intervenção Estrutural do Sector Cultural no Horizonte 2007-2013 (2006)
Technical Report
Elisa Pérez Babo; Filipa César; José Portugal; Paula Guerra; Pedro Costa
Os Repositórios da Dados Científicos: Estado da Arte (2010)
Technical Report
Cristina Ribeiro; Eloy Rodrigues; Maria Eugénia Matos Fernandes; Ricardo Saraiva
Formulação de Políticas Públicas no Horizonte 2013 relativas ao tema Sociedade da Informação (2005)
Technical Report
Artur Pimenta Alves; Carlos José Rodrigues; Eduardo Anselmo Castro ; Flávio Nunes; Gonçalo Alves Santinho; Jorge Bateira; José Carlos Caldeira; José Manuel Mendonça; Maria José Marques; Maria Teresa Pinto; Mário Jorge Leitão; Paula Guerra; Paulo Monteiro; Pedro Guedes de Oliveira ; Teresa Sá Marques

See all (80)

Recommend this page Top
Copyright 1996-2025 © Faculdade de Direito da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2025-07-12 at 19:52:08 | Privacy Policy | Personal Data Protection Policy | Whistleblowing