Abstract
We present a corpus of transcribed spoken Hebrew that forms an integral part of a comprehensive data system that has been developed to suit the specific needs and interests of child language researchers: CHILDES (Child Language Data Exchange System). We introduce a dedicated transcription scheme for the spoken Hebrew data that is aware both of the phonology and of the standard orthography of the language. We also introduce a morphological analyzer that was specifically developed for this corpus.
Original language | English |
---|---|
Title of host publication | Proceedings of the 7th International Conference on Language Resources and Evaluation, LREC 2010 |
Editors | Daniel Tapias, Irene Russo, Olivier Hamon, Stelios Piperidis, Nicoletta Calzolari, Khalid Choukri, Joseph Mariani, Helene Mazo, Bente Maegaard, Jan Odijk, Mike Rosner |
Publisher | European Language Resources Association (ELRA) |
Pages | 1487-1490 |
Number of pages | 4 |
ISBN (Electronic) | 2951740867, 9782951740860 |
State | Published - 2010 |
Event | 7th International Conference on Language Resources and Evaluation, LREC 2010 - Valletta, Malta Duration: 17 May 2010 → 23 May 2010 |
Publication series
Name | Proceedings of the 7th International Conference on Language Resources and Evaluation, LREC 2010 |
---|
Conference
Conference | 7th International Conference on Language Resources and Evaluation, LREC 2010 |
---|---|
Country/Territory | Malta |
City | Valletta |
Period | 17/05/10 → 23/05/10 |
Bibliographical note
Funding Information:This research was supported by a grant from the United States-Israel Binational Science Foundation (BSF), Jerusalem, Israel. We are grateful to Aviad Albert and Hadas Zaidenberg for their help with the annotation. The morphological analyzer was originally created by Bracha Nir and Sigal Uziel-Karl.
ASJC Scopus subject areas
- Education
- Library and Information Sciences
- Linguistics and Language
- Language and Linguistics