Abstract
Transcribing historical handwritten documents is a difficult task. One facet is that it is a very tedious task normally performed by experts. Some newer techniques rely on crowdsourcing of manual transcription. Crowdsourcing helps speeding up the transcription process, but it is still limited and brings with it new challenges. Though crowdsourcing transcriptions can imply a repetitive task done by a large group of users, there is in fact room for personalization. This paper reports on insights gathered for future personalizations from the "Tikkoun Sofrim" project, that implements a framework for combining automatic handwritten text recognition with crowdsourcing for transcription of complete handwritten manuscripts. As a case study, the Hebrew "Midrash Tanhuma" manuscripts were selected.
Original language | English |
---|---|
Title of host publication | UMAP 2020 Adjunct - Adjunct Publication of the 28th ACM Conference on User Modeling, Adaptation and Personalization |
Publisher | Association for Computing Machinery, Inc |
Pages | 373-375 |
Number of pages | 3 |
ISBN (Electronic) | 9781450367110 |
DOIs | |
State | Published - 14 Jul 2020 |
Event | 28th ACM International Conference on User Modeling, Adaptation, and Personalization, UMAP 2020 - Genoa, Italy Duration: 14 Jul 2020 → 17 Jul 2020 |
Publication series
Name | UMAP 2020 Adjunct - Adjunct Publication of the 28th ACM Conference on User Modeling, Adaptation and Personalization |
---|
Conference
Conference | 28th ACM International Conference on User Modeling, Adaptation, and Personalization, UMAP 2020 |
---|---|
Country/Territory | Italy |
City | Genoa |
Period | 14/07/20 → 17/07/20 |
Bibliographical note
Publisher Copyright:© 2020 ACM.
Keywords
- computer assisted transcription for text images
- crowdsourcing
- handwritten text recognition
ASJC Scopus subject areas
- Software