Abstract (EN):
We present a new, efficient and scalable tool, named BIORED, for pattern discovery in proteomic and genomic sequences. It uses a genetic algorithm to find interesting patterns in the form of regular expressions, and a new efficient pattern matching procedure to count pattern occurrences. We studied the performance, scalability and usefulness of BIORED using several databases of biosequences. The results show that BIORED was successful in finding previously known patterns, thus an excellent indicator for its potential. BIORED is available for download under the GNU Public License at http://www.dcc.fc.up.pt/bi-ored/. An online demo is available at the same address.
Language:
English
Type (Professor's evaluation):
Scientific
Contact:
pdr@dcc.fc.up.pt; fds@dcc.fc.up.pt; nf@ibmc.up.pt
No. of pages:
10