Indexing strategies for rapid searches of short words in genome sequences.

Details

Ressource 1Download: BIB_DD138EBA55CC.P001.pdf (249.05 [Ko])
State: Public
Version: author
Serval ID
serval:BIB_DD138EBA55CC
Type
Article: article from journal or magazin.
Collection
Publications
Institution
Title
Indexing strategies for rapid searches of short words in genome sequences.
Journal
PLoS ONE
Author(s)
Iseli C., Ambrosini G., Bucher P., Jongeneel C.V.
ISSN
1932-6203
Publication state
Published
Issued date
2007
Peer-reviewed
Oui
Volume
2
Number
6
Pages
e579
Language
english
Notes
Publication types: Journal Article
Abstract
Searching for matches between large collections of short (14-30 nucleotides) words and sequence databases comprising full genomes or transcriptomes is a common task in biological sequence analysis. We investigated the performance of simple indexing strategies for handling such tasks and developed two programs, fetchGWI and tagger, that index either the database or the query set. Either strategy outperforms megablast for searches with more than 10,000 probes. FetchGWI is shown to be a versatile tool for rapidly searching multiple genomes, whose performance is limited in most cases by the speed of access to the filesystem. We have made publicly available a Web interface for searching the human, mouse, and several other genomes and transcriptomes with oligonucleotide queries.
Pubmed
Web of science
Open Access
Yes
Create date
24/01/2008 16:39
Last modification date
20/08/2019 17:01
Usage data