Reconstructing ancient literary texts from noisy manuscripts

Moshe Koppel, Moty Michaely, Alex Tal

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    Given multiple corrupted versions of the same text, as is common with ancient manuscripts, we wish to reconstruct the original text from which the extant corrupted versions were copied (typically via latent intermediary versions). This is a challenge of cardinal importance in the humanities. We use a variant of expectation-maximization (EM), to solve this problem. We prove the efficacy of our method on both synthetic and real-world data.

    Original languageEnglish
    Title of host publicationProceedings of the 5th Workshop on Computational Linguistics for Literature, CLfL 2016 at the 2016 Conference of the North American Chapter of the Association for Computational Linguistics
    Subtitle of host publicationHuman Language Technologies, NAACL-HLT 2016
    EditorsAnna Feldman, Anna Kazantseva, Stan Szpakowicz
    PublisherAssociation for Computational Linguistics (ACL)
    Pages40-46
    Number of pages7
    ISBN (Electronic)9781941643808
    StatePublished - 2016
    Event5th Workshop on Computational Linguistics for Literature, CLfL 2016 at the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2016 - San Diego, United States
    Duration: 16 Jun 2016 → …

    Publication series

    NameProceedings of the 5th Workshop on Computational Linguistics for Literature, CLfL 2016 at the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2016

    Conference

    Conference5th Workshop on Computational Linguistics for Literature, CLfL 2016 at the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2016
    Country/TerritoryUnited States
    CitySan Diego
    Period16/06/16 → …

    Bibliographical note

    Publisher Copyright:
    © 2016 Proceedings of the 5th Workshop on Computational Linguistics for Literature, CLfL 2016 at the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2016. All rights reserved.

    ASJC Scopus subject areas

    • Computer Science Applications
    • Language and Linguistics
    • Linguistics and Language
    • Artificial Intelligence

    Fingerprint

    Dive into the research topics of 'Reconstructing ancient literary texts from noisy manuscripts'. Together they form a unique fingerprint.

    Cite this