A rapid method for detecting interconnections between functionally and/or evolutionary close biological sequences
Gusev, V.D.; Nemytikova, L.A.; Chuzhanova, N.A.
Molekuliarnaia Biologiia 35(6): 1015-1022
2001
ISSN/ISBN: 0026-8984 PMID: 11771125 Document Number: 527749
It is now common practice to retrieve, by key words, highly specialized selections of sequences from general-purpose databases such as EMBL, GenBank, etc. The sequences included in a selection are often interconnected, which means that there are duplications, embeddings, intersections, homology, common structural elements. Knowledge of these interconnections is necessary for further processing of the sequences. We propose a rapid (single scan) method for identification of such interconnections by means of complexity analysis that generalizes the Lempel-Ziv approach. Analysis of a selection of 5'-flanking regions of vertebrate growth hormone genes from EMBL is presented as an example.