Lengths of orthologous prokaryotic proteins are affected by evolutionary factors

Tatiana Tatarinova, Bilal Salih, Jennifer Dien Bard, Irit Cohen, Alexander Bolshoy

Research output: Contribution to journalArticlepeer-review

Abstract

Proteins of the same functional family (for example, kinases) may have significantly different lengths. It is an open question whether such variation in length is random or it appears as a response to some unknown evolutionary driving factors. The main purpose of this paper is to demonstrate existence of factors affecting prokaryotic gene lengths. We believe that the ranking of genomes according to lengths of their genes, followed by the calculation of coefficients of association between genome rank and genome property, is a reasonable approach in revealing such evolutionary driving factors. As we demonstrated earlier, our chosen approach, Bubble-sort, combines stability, accuracy, and computational efficiency as compared to other ranking methods. Application of Bubble Sort to the set of 1390 prokaryotic genomes confirmed that genes of Archaeal species are generally shorter than Bacterial ones. We observed that gene lengths are affected by various factors: within each domain, different phyla have preferences for short or long genes; thermophiles tend to have shorter genes than the soil-dwellers; halophiles tend to have longer genes. We also found that species with overrepresentation of cytosines and guanines in the third position of the codon (GC3 content) tend to have longer genes than species with low GC3 content.

Original languageEnglish
Article number786861
JournalBioMed Research International
Volume2015
DOIs
StatePublished - 2015

Bibliographical note

Publisher Copyright:
© 2015 Tatiana Tatarinova et al.

ASJC Scopus subject areas

  • General Biochemistry, Genetics and Molecular Biology
  • General Immunology and Microbiology

Fingerprint

Dive into the research topics of 'Lengths of orthologous prokaryotic proteins are affected by evolutionary factors'. Together they form a unique fingerprint.

Cite this