TY - GEN
T1 - Local alignment of RNA sequences with arbitrary scoring schemes
AU - Backofen, Rolf
AU - Hermelin, Danny
AU - Landau, Gad M.
AU - Weimann, Oren
PY - 2006
Y1 - 2006
N2 - Local similarity is an important tool in comparative analysis of biological sequences, and is therefore well studied. In particular, the Smith-Waterman technique and its normalized version are two established metrics for measuring local similarity in strings. In RNA sequences however, where one must consider not only sequential but also structural features of the inspected molecules, the concept of local similarity becomes more complicated. First, even in global similarity, computing global sequence-structure alignments is more difficult than computing standard sequence alignments due to the bi-dimensionality of information. Second, one can view locality in two different ways, in the sequential or structural sense, leading to different problem formulations. In this paper we introduce two sequentially-local similarity metrics for comparing RNA sequences. These metrics combine the global RNA alignment metric of Shasha and Zhang [16] with the Smith-Waterman metric [17] and its normalized version [2] used in strings. We generalize the familiar alignment graph used in string comparison to apply also for RNA sequences, and then utilize this generalization to devise two algorithms for computing local similarity according to our two suggested metrics. Our algorithms run in O(m2n lg n) and O(m 2n lg n+n2m) time respectively, where m ≤ n are the lengths of the two given RNAs. Both algorithms can work with any arbitrary scoring scheme.
AB - Local similarity is an important tool in comparative analysis of biological sequences, and is therefore well studied. In particular, the Smith-Waterman technique and its normalized version are two established metrics for measuring local similarity in strings. In RNA sequences however, where one must consider not only sequential but also structural features of the inspected molecules, the concept of local similarity becomes more complicated. First, even in global similarity, computing global sequence-structure alignments is more difficult than computing standard sequence alignments due to the bi-dimensionality of information. Second, one can view locality in two different ways, in the sequential or structural sense, leading to different problem formulations. In this paper we introduce two sequentially-local similarity metrics for comparing RNA sequences. These metrics combine the global RNA alignment metric of Shasha and Zhang [16] with the Smith-Waterman metric [17] and its normalized version [2] used in strings. We generalize the familiar alignment graph used in string comparison to apply also for RNA sequences, and then utilize this generalization to devise two algorithms for computing local similarity according to our two suggested metrics. Our algorithms run in O(m2n lg n) and O(m 2n lg n+n2m) time respectively, where m ≤ n are the lengths of the two given RNAs. Both algorithms can work with any arbitrary scoring scheme.
UR - http://www.scopus.com/inward/record.url?scp=33746063235&partnerID=8YFLogxK
U2 - 10.1007/11780441_23
DO - 10.1007/11780441_23
M3 - Conference contribution
AN - SCOPUS:33746063235
SN - 3540354557
SN - 9783540354550
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 246
EP - 257
BT - Combinatorial Pattern Matching - 17th Annual Symposium, CPM 2006, Proceedings
PB - Springer Verlag
T2 - 17th Annual Symposium on Combinatorial Pattern Matching, CPM 2006
Y2 - 5 July 2006 through 7 July 2006
ER -