SplittingHeirs: Inferring haplotypes by optimizing resultant dense graphs

Sharlee Climer, Alan R. Templeton, Weixiong Zhang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Phasing genotype data to identify the composite haplotype pairs is a widely-studied problem due to its value for understanding genetic contributions to diseases, population genetics research, and other significant endeavors. The accuracy of the phasing is crucial as identification of haplotypes is frequently the first step of expensive and vitally important studies. We present a combinatorial approach to this problem which we call SplittingHeirs. This approach is biologically motivated as it is based on three widely accepted principles: there tend to be relatively few unique haplotypes within a population, there tend to be clusters of haplotypes that are similar to each other, and some haplotypes are relatively common. We have tested SplittingHeirs, along with several popular existing phasing methods including PHASE, HAP, EM, and Pure Parsimony, on seven sets of haplotype data for which the true phase is known. Our method yields the highest accuracy obtainable by these methods in all cases. Furthermore, SplittingHeirs is robust and had higher accuracy than any of the other approaches for the two datasets with high recombination rates. The success of SplittingHeirs validates the assumptions made by the dense graph model and highlights the benefits of finding globally optimal solutions.

Original languageEnglish
Title of host publication2010 ACM International Conference on Bioinformatics and Computational Biology, ACM-BCB 2010
Pages127-136
Number of pages10
DOIs
StatePublished - 2010
Externally publishedYes
Event2010 ACM International Conference on Bioinformatics and Computational Biology, ACM-BCB 2010 - Niagara Falls, NY, United States
Duration: 2 Aug 20104 Aug 2010

Publication series

Name2010 ACM International Conference on Bioinformatics and Computational Biology, ACM-BCB 2010

Conference

Conference2010 ACM International Conference on Bioinformatics and Computational Biology, ACM-BCB 2010
Country/TerritoryUnited States
CityNiagara Falls, NY
Period2/08/104/08/10

ASJC Scopus subject areas

  • Biomedical Engineering
  • Health Information Management

Fingerprint

Dive into the research topics of 'SplittingHeirs: Inferring haplotypes by optimizing resultant dense graphs'. Together they form a unique fingerprint.

Cite this