Abstract
Given multiple corrupted versions of the same text, as is common with ancient manuscripts, we wish to reconstruct the original text from which the extant corrupted versions were copied (typically via latent intermediary versions). This is a challenge of cardinal importance in the humanities. We use a variant of expectation-maximization (EM), to solve this problem. We prove the efficacy of our method on both synthetic and real-world data.
Original language | English |
---|---|
Title of host publication | Proceedings of the 5th Workshop on Computational Linguistics for Literature, CLfL 2016 at the 2016 Conference of the North American Chapter of the Association for Computational Linguistics |
Subtitle of host publication | Human Language Technologies, NAACL-HLT 2016 |
Editors | Anna Feldman, Anna Kazantseva, Stan Szpakowicz |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 40-46 |
Number of pages | 7 |
ISBN (Electronic) | 9781941643808 |
State | Published - 2016 |
Event | 5th Workshop on Computational Linguistics for Literature, CLfL 2016 at the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2016 - San Diego, United States Duration: 16 Jun 2016 → … |
Publication series
Name | Proceedings of the 5th Workshop on Computational Linguistics for Literature, CLfL 2016 at the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2016 |
---|
Conference
Conference | 5th Workshop on Computational Linguistics for Literature, CLfL 2016 at the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2016 |
---|---|
Country/Territory | United States |
City | San Diego |
Period | 16/06/16 → … |
Bibliographical note
Publisher Copyright:© 2016 Proceedings of the 5th Workshop on Computational Linguistics for Literature, CLfL 2016 at the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2016. All rights reserved.
ASJC Scopus subject areas
- Computer Science Applications
- Language and Linguistics
- Linguistics and Language
- Artificial Intelligence