The Sign Language Dataset Compendium: Creating an Overview of Digital Linguistic Resources
Kopf, Maria
| Schulder, Marc
| Hanke, Thomas 
- Volume:
- Proceedings of the LREC2022 10th Workshop on the Representation and Processing of Sign Languages: Multilingual Sign Language Resources
- Venue:
- Marseille, France
- Date:
- 25 June 2022
- Pages:
- 102–109
- Publisher:
- European Language Resources Association (ELRA)
- License:
- CC BY-NC 4.0
- sign-lang ID:
- 22025
- ACL ID:
- 2022.signlang-1.16
- ISBN:
- 979-10-95546-86-3
Content Categories
- Projects:
- DGS-Korpus project, EASIER
- Other Tools:
- SLDC
Abstract
One of the challenges that sign language researchers face is the identification of suitable language datasets, particularly for cross-lingual studies. There is no single source of information on what sign language corpora and lexical resources exist or how they compare. Instead, they have to be found through extensive literature review or word-of-mouth. The amount of information available on individual datasets can also vary widely and may be distributed across different publications, data repositories and (potentially defunct) project websites. This article introduces the Sign Language Dataset Compendium, an extensive overview of linguistic resources for sign languages. It covers existing corpora and lexical resources, as well as commonly used data collection tasks. Special attention is paid to covering resources for many different languages from around the globe. All information is provided in a standardised format to make entries comparable, but kept flexible enough to allow for differences in content. The compendium is intended as a growing resource that will be updated regularly.Document Download
Paper PDF Poster BibTeX File + Abstract
Cite as
Citation in ACL Citation Format
Maria Kopf, Marc Schulder, Thomas Hanke. 2022. The Sign Language Dataset Compendium: Creating an Overview of Digital Linguistic Resources. In Proceedings of the LREC2022 10th Workshop on the Representation and Processing of Sign Languages: Multilingual Sign Language Resources, pages 102–109, Marseille, France. European Language Resources Association (ELRA).BibTeX Export
@inproceedings{kopf:22025:sign-lang:lrec, author = {Kopf, Maria and Schulder, Marc and Hanke, Thomas}, title = {The {Sign} {Language} {Dataset} {Compendium}: Creating an Overview of Digital Linguistic Resources}, pages = {102--109}, editor = {Efthimiou, Eleni and Fotinea, Stavroula-Evita and Hanke, Thomas and Hochgesang, Julie A. and Kristoffersen, Jette and Mesch, Johanna and Schulder, Marc}, booktitle = {Proceedings of the {LREC2022} 10th Workshop on the Representation and Processing of Sign Languages: Multilingual Sign Language Resources}, maintitle = {13th International Conference on Language Resources and Evaluation ({LREC} 2022)}, publisher = {{European Language Resources Association (ELRA)}}, address = {Marseille, France}, day = {25}, month = jun, year = {2022}, isbn = {979-10-95546-86-3}, language = {english}, url = {https://www.sign-lang.uni-hamburg.de/lrec/pub/22025.pdf} }