Go to:
Logótipo
Você está em: Start > Publications > View > Evaluating deterministic motif significance measures in protein databases
Publication

Evaluating deterministic motif significance measures in protein databases

Title
Evaluating deterministic motif significance measures in protein databases
Type
Article in International Scientific Journal
Year
2007
Authors
Azevedo, PJ
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. View Authenticus page Without ORCID
Journal
Vol. 2
Final page: 16
ISSN: 1748-7188
Publisher: Springer Nature
Scientific classification
CORDIS: Physical sciences > Computer science > Informatics
FOS: Natural sciences > Computer and information sciences
Other information
Authenticus ID: P-004-5Q7
Abstract (EN): Background: Assessing the outcome of motif mining algorithms is an essential task, as the number of reported motifs can be very large. Significance measures play a central role in automatically ranking those motifs, and therefore alleviating the analysis work. Spotting the most interesting and relevant motifs is then dependent on the choice of the right measures. The combined use of several measures may provide more robust results. However caution has to be taken in order to avoid spurious evaluations. Results: From the set of conducted experiments, it was verified that several of the selected significance measures show a very similar behavior in a wide range of situations therefore providing redundant information. Some measures have proved to be more appropriate to rank highly conserved motifs, while others are more appropriate for weakly conserved ones. Support appears as a very important feature to be considered for correct motif ranking. We observed that not all the measures are suitable for situations with poorly balanced class information, like for instance, when positive data is significantly less than negative data. Finally, a visualization scheme was proposed that, when several measures are applied, enables an easy identification of high scoring motifs. Conclusion: In this work we have surveyed and categorized 14 significance measures for pattern evaluation. Their ability to rank three types of deterministic motifs was evaluated. Measures were applied in different testing conditions, where relations were identified. This study provides some pertinent insights on the choice of the right set of significance measures for the evaluation of deterministic motifs extracted from protein databases.
Language: English
Type (Professor's evaluation): Scientific
No. of pages: 20
Documents
We could not find any documents associated to the publication.
Related Publications

Of the same authors

Deterministic pattern mining on genetic sequences (2009)
Chapter or Part of a Book
Ferreira, PG; Azevedo, PJ
Deterministic motif mining in protein databases (2007)
Chapter or Part of a Book
Ferreira, PG; Azevedo, PJ
Deterministic Motif Mining in Protein Databases (2009)
Chapter or Part of a Book
Ferreira, PG; Azevedo, PJ
Protein sequence pattern mining with constraints (2005)
Article in International Scientific Journal
Ferreira, PG; Azevedo, PJ

See all (12)

Of the same scientific areas

SIGA-Sistema Integrado de Gestão Autárquica, (1987)
Technical Report
Gabriel David; Vladimiro Miranda; Maria Cristina Ribeiro
Moodle at FEUP (2005)
Technical Report
Jaime Enrique Villate Matiz
Studying the Impact of the Organizational Structure on Airline Operations Control (2015)
Chapter or Part of a Book
Nuno Machado; António Castro; Eugénio Oliveira
Normative and trust-based systems as enabler technologies for automated negotiation (2014)
Chapter or Part of a Book
Maria Joana Urbano; Henrique Lopes Cardoso; Eugénio Oliveira; Ana Paula Rocha

See all (65)

Recommend this page Top
Copyright 1996-2024 © Faculdade de Arquitectura da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z  I Guest Book
Page created on: 2024-08-30 at 10:15:53 | Acceptable Use Policy | Data Protection Policy | Complaint Portal