Abstract
The association of nucleotide composition of genome sequences with their functional characteristics is widely known, among the most studied characteristics correlated with GC content are gene density and expression and recombination rate. Previously, we found that similar in nucleotide composition genomic regions may exhibit considerable differences in sequence organization and hypothesized that organizationally different regions may also exhibit functional and evolutionary heterogeneity. Here we examine this hypothesis by classifying 100 kbp segments of human genome into 14 compositionally homogeneous groups according to their GC content and differentiating the segments within each group by organization patterns (OP) using oligonucleotide (k-mer) counting, referred to as Compositional Spectra (CS) Analysis. We identified 141 groups of segments different in their CS organization and found that obtained compositionally similar OP groups (OPG) differ significantly in their recombination rate. This conclusion was robust with respect to the selected window size (confirmed by independent analysis for 50 kb and 200 kb segments). We further performed a test of contribution of specific k-mers in clustering of 100 kbp segments to OPGs with contrast levels of recombination rates. Eight k-mers, which demonstrated highest importance for such clustering, allowed correct classification at least 76% of segments in all 14 OPG pairs. Moreover, these k-mers proved similar with five previously described patterns related to recombination hotspots including the most known 13 bp recombination motif CCNCCNTNNCCNC.
Original language | English |
---|---|
Title of host publication | Proceedings - 2nd International Symposium on Stochastic Models in Reliability Engineering, Life Science, and Operations Management, SMRLO 2016 |
Editors | Anatoly Lisnianski, Ilia Frenkel |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 414-420 |
Number of pages | 7 |
ISBN (Electronic) | 9781467399418 |
DOIs | |
State | Published - 11 Mar 2016 |
Event | 2nd International Symposium on Stochastic Models in Reliability Engineering, Life Science, and Operations Management, SMRLO 2016 - Beer Sheva, Israel Duration: 15 Feb 2016 → 18 Feb 2016 |
Publication series
Name | Proceedings - 2nd International Symposium on Stochastic Models in Reliability Engineering, Life Science, and Operations Management, SMRLO 2016 |
---|
Conference
Conference | 2nd International Symposium on Stochastic Models in Reliability Engineering, Life Science, and Operations Management, SMRLO 2016 |
---|---|
Country/Territory | Israel |
City | Beer Sheva |
Period | 15/02/16 → 18/02/16 |
Bibliographical note
Publisher Copyright:© 2016 IEEE.
Keywords
- Compositional spectra analysis
- Genome evolution
- Genome features
- Genome heterogeneity
- Repetitive sequences
- Sequence organization pattern
ASJC Scopus subject areas
- Safety, Risk, Reliability and Quality
- Modeling and Simulation
- Statistics and Probability
- Agricultural and Biological Sciences (miscellaneous)
- Strategy and Management