Organizational heterogeneity of the human genome: significant variation of recombination rate of 100 kbp sequences within GC ranges

Svetlana Frenkel, Valery Kirzhner, Zeev Frenkel, Abraham B. Korol

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The association of nucleotide composition of genome sequences with their functional characteristics is widely known, among the most studied characteristics correlated with GC content are gene density and expression and recombination rate. Previously, we found that similar in nucleotide composition genomic regions may exhibit considerable differences in sequence organization and hypothesized that organizationally different regions may also exhibit functional and evolutionary heterogeneity. Here we examine this hypothesis by classifying 100 kbp segments of human genome into 14 compositionally homogeneous groups according to their GC content and differentiating the segments within each group by organization patterns (OP) using oligonucleotide (k-mer) counting, referred to as Compositional Spectra (CS) Analysis. We identified 141 groups of segments different in their CS organization and found that obtained compositionally similar OP groups (OPG) differ significantly in their recombination rate. This conclusion was robust with respect to the selected window size (confirmed by independent analysis for 50 kb and 200 kb segments). We further performed a test of contribution of specific k-mers in clustering of 100 kbp segments to OPGs with contrast levels of recombination rates. Eight k-mers, which demonstrated highest importance for such clustering, allowed correct classification at least 76% of segments in all 14 OPG pairs. Moreover, these k-mers proved similar with five previously described patterns related to recombination hotspots including the most known 13 bp recombination motif CCNCCNTNNCCNC.

Original languageEnglish
Title of host publicationProceedings - 2nd International Symposium on Stochastic Models in Reliability Engineering, Life Science, and Operations Management, SMRLO 2016
EditorsAnatoly Lisnianski, Ilia Frenkel
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages414-420
Number of pages7
ISBN (Electronic)9781467399418
DOIs
StatePublished - 11 Mar 2016
Event2nd International Symposium on Stochastic Models in Reliability Engineering, Life Science, and Operations Management, SMRLO 2016 - Beer Sheva, Israel
Duration: 15 Feb 201618 Feb 2016

Publication series

NameProceedings - 2nd International Symposium on Stochastic Models in Reliability Engineering, Life Science, and Operations Management, SMRLO 2016

Conference

Conference2nd International Symposium on Stochastic Models in Reliability Engineering, Life Science, and Operations Management, SMRLO 2016
Country/TerritoryIsrael
CityBeer Sheva
Period15/02/1618/02/16

Bibliographical note

Publisher Copyright:
© 2016 IEEE.

Keywords

  • Compositional spectra analysis
  • Genome evolution
  • Genome features
  • Genome heterogeneity
  • Repetitive sequences
  • Sequence organization pattern

ASJC Scopus subject areas

  • Safety, Risk, Reliability and Quality
  • Modeling and Simulation
  • Statistics and Probability
  • Agricultural and Biological Sciences (miscellaneous)
  • Strategy and Management

Fingerprint

Dive into the research topics of 'Organizational heterogeneity of the human genome: significant variation of recombination rate of 100 kbp sequences within GC ranges'. Together they form a unique fingerprint.

Cite this