Briefings in Functional Genomics and Proteomics Advance Access originally published online on September 24, 2009
Briefings in Functional Genomics and Proteomics 2009 8(6):451-460; doi:10.1093/bfgp/elp035
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
This article appears in the following Briefings in Functional Genomics and Proteomics issue: Special Issue: Non-Coding RNAs [View the issue table of contents]
Special Issue Papers |
Customized strategies for discovering distant ncRNA homologs
Corresponding author. Peter F. Stadler, Chair of Bioinformatics, Department of Computer Science, University of Leipzig, Härtelstraße 16-18, D-04107 Leipzig, Germany. Tel: +49 341 97 16690, Fax: +49 341 97 16709, E-mail: studla{at}bioinf.uni-leipzig.de
A large fraction of non-coding RNAs is short and/or poorly conserved in sequence. Most of the longer examples, furthermore, consist of a collection of conserved structural motifs rather than a coherent globally conserved secondary structure. As a consequence, the conceptually simple problem of homology search becomes a complex and technically demanding task. Despite the best efforts of databases such as Rfam, the situation is complicated further by the sparsity of information in many—in particular prokaryotic—RNA families. In this contribution, we review recent efforts to customize sequence-based search tools for ncRNA applications. In particular, semi-global alignments and the development of methods for fragmented pattern search have brought significant practical advances. Current developments in this area focus on the integration of fragmented sequence pattern search with search algorithms for secondary structure patterns. We focus here, in particular, on strategies that can be successful in the twilight zone where generic approaches from blast to infernal to start to fail.
Keywords: non-coding RNA, homology search, fragmented patterns, telomerase RNA, vault RNA