sign-lang@LREC Anthology

The Sign Language Dataset Compendium: Creating an Overview of Digital Linguistic Resources

Kopf, Maria ORCID button Kopf, Maria | Schulder, Marc ORCID button Schulder, Marc | Hanke, Thomas ORCID button Hanke, Thomas


Volume:
Proceedings of the LREC2022 10th Workshop on the Representation and Processing of Sign Languages: Multilingual Sign Language Resources
Venue:
Marseille, France
Date:
25 June 2022
Pages:
102–109
Publisher:
European Language Resources Association (ELRA)
License:
CC BY-NC 4.0
sign-lang ID:
22025
ACL ID:
2022.signlang-1.16
ISBN:
979-10-95546-86-3

Content Categories

Projects:
DGS Corpus project, EASIER
Other Tools:
SLDC

Abstract

One of the challenges that sign language researchers face is the identification of suitable language datasets, particularly for cross-lingual studies. There is no single source of information on what sign language corpora and lexical resources exist or how they compare. Instead, they have to be found through extensive literature review or word-of-mouth. The amount of information available on individual datasets can also vary widely and may be distributed across different publications, data repositories and (potentially defunct) project websites. This article introduces the Sign Language Dataset Compendium, an extensive overview of linguistic resources for sign languages. It covers existing corpora and lexical resources, as well as commonly used data collection tasks. Special attention is paid to covering resources for many different languages from around the globe. All information is provided in a standardised format to make entries comparable, but kept flexible enough to allow for differences in content. The compendium is intended as a growing resource that will be updated regularly.

Document Download

Paper PDF Poster BibTeX File+ Abstract

BibTeX Export

@inproceedings{kopf:22025:sign-lang:lrec,
  author    = {Kopf, Maria and Schulder, Marc and Hanke, Thomas},
  title     = {The {Sign} {Language} {Dataset} {Compendium}: Creating an Overview of Digital Linguistic Resources},
  pages     = {102--109},
  editor    = {Efthimiou, Eleni and Fotinea, Stavroula-Evita and Hanke, Thomas and Hochgesang, Julie A. and Kristoffersen, Jette and Mesch, Johanna and Schulder, Marc},
  booktitle = {Proceedings of the {LREC2022} 10th Workshop on the Representation and Processing of Sign Languages: Multilingual Sign Language Resources},
  maintitle = {13th International Conference on Language Resources and Evaluation ({LREC} 2022)},
  publisher = {{European Language Resources Association (ELRA)}},
  address   = {Marseille, France},
  day       = {25},
  month     = jun,
  year      = {2022},
  isbn      = {979-10-95546-86-3},
  language  = {english},
  url       = {https://www.sign-lang.uni-hamburg.de/lrec/pub/22025.pdf}
}
Something missing or wrong?