Multi-SpaM: A maximum-likelihood approach to phylogeny reconstruction using multiple spaced-word matches and quartet trees

Thomas Dencker, Chris André Leimeister, Michael Gerth, Christoph Bleidorn, Sagi Snir, Burkhard Morgenstern

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Word-based or ‘alignment-free’ methods for phylogeny reconstruction are much faster than traditional, alignment-based approaches, but they are generally less accurate. Most alignment-free methods calculate pairwise distances for a set of input sequences, for example from word frequencies, from so-called spaced-word matches or from the average length of common substrings. In this paper, we propose the first word-based phylogeny approach that is based on multiple sequence comparison and Maximum Likelihood. Our algorithm first samples small, gap-free alignments involving four taxa each. For each of these alignments, it then calculates a quartet tree and, finally, the program Quartet MaxCut is used to infer a super tree for the full set of input taxa from the calculated quartet trees. Experimental results show that trees calculated with our approach are of high quality.

Original languageEnglish
Title of host publicationComparative Genomics - 16th International Conference, RECOMB-CG 2018, Proceedings
EditorsAïda Ouangraoua, Mathieu Blanchette
PublisherSpringer Verlag
Pages227-241
Number of pages15
ISBN (Print)9783030008338
DOIs
StatePublished - 2018
Event16th International Conference on Comparative Genomics, RECOMB-CG 2018 - Magog-Orford, Canada
Duration: 9 Oct 201812 Oct 2018

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11183 LNBI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference16th International Conference on Comparative Genomics, RECOMB-CG 2018
Country/TerritoryCanada
CityMagog-Orford
Period9/10/1812/10/18

Bibliographical note

Publisher Copyright:
© Springer Nature Switzerland AG 2018.

Keywords

  • Alignment-free
  • Likelihood
  • Phylogeny
  • Spaced words

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Multi-SpaM: A maximum-likelihood approach to phylogeny reconstruction using multiple spaced-word matches and quartet trees'. Together they form a unique fingerprint.

Cite this