Resources for Computer-Based Sign Recognition from Video, and the Criticality of Consistency of Gloss Labeling across Multiple Large ASL Video Corpora

Neidle, Carol | Opoku, Augustine | Ballard, Carey M. | Dafnis, Konstantinos M. | Chroni, Evgenia | Metaxas, Dimitris

Volume:: Proceedings of the LREC2022 10th Workshop on the Representation and Processing of Sign Languages: Multilingual Sign Language Resources
Venue:: Marseille, France
Date:: 25 June 2022
Pages:: 165–172
Publisher:: European Language Resources Association (ELRA)
Licence:: CC BY-NC 4.0
sign-lang ID:: 22037
ACL ID:: 2022.signlang-1.26
ISBN:: 979-10-95546-86-3

Content Categories

Projects:: ASLLRP
Languages:: American Sign Language
Corpora:: ASLLRP, WLASL

Abstract

The WLASL purports to be “the largest video dataset for Word-Level American Sign Language (ASL) recognition.” It brings together various publicly shared video collections that could be quite valuable for sign recognition research, and it has been used extensively for such research. However, a critical problem with the accompanying annotations has heretofore not been recognized by the authors, nor by those who have exploited these data: There is no 1-1 correspondence between sign productions and gloss labels. Here we describe a large (and recently expanded and enhanced), linguistically annotated, downloadable, video corpus of citation-form ASL signs shared by the American Sign Language Linguistic Research Project (ASLLRP)—with 23,452 sign tokens and an online Sign Bank—in which such correspondences are enforced. We furthermore provide annotations for 19,672 of the WLASL video examples consistent with ASLLRP glossing conventions. For those wishing to use WLASL videos, this provides a set of annotations that makes it possible: (1) to use those data reliably for computational research; and/or (2) to combine the WLASL and ASLLRP datasets, creating a combined resource that is larger and richer than either of those datasets individually, with consistent gloss labeling for all signs. We also offer a summary of our own sign recognition research to date that exploits these data resources.

Document Download

Paper PDF Poster BibTeX File + Abstract

Video Presentation

Languages:: International Sign, English
Subtitle:: English

Cite as

Citation in ACL Citation Format

Carol Neidle, Augustine Opoku, Carey M. Ballard, Konstantinos M. Dafnis, Evgenia Chroni, Dimitris Metaxas. 2022. Resources for Computer-Based Sign Recognition from Video, and the Criticality of Consistency of Gloss Labeling across Multiple Large ASL Video Corpora. In Proceedings of the LREC2022 10th Workshop on the Representation and Processing of Sign Languages: Multilingual Sign Language Resources, pages 165–172, Marseille, France. European Language Resources Association (ELRA).

BibTeX Export

@inproceedings{neidle:22037:sign-lang:lrec,
  author    = {Neidle, Carol and Opoku, Augustine and Ballard, Carey M. and Dafnis, Konstantinos M. and Chroni, Evgenia and Metaxas, Dimitris},
  title     = {Resources for Computer-Based Sign Recognition from Video, and the Criticality of Consistency of Gloss Labeling across Multiple Large {ASL} Video Corpora},
  pages     = {165--172},
  editor    = {Efthimiou, Eleni and Fotinea, Stavroula-Evita and Hanke, Thomas and Hochgesang, Julie A. and Kristoffersen, Jette and Mesch, Johanna and Schulder, Marc},
  booktitle = {Proceedings of the {LREC2022} 10th Workshop on the Representation and Processing of Sign Languages: Multilingual Sign Language Resources},
  maintitle = {13th International Conference on Language Resources and Evaluation ({LREC} 2022)},
  publisher = {{European Language Resources Association (ELRA)}},
  address   = {Marseille, France},
  day       = {25},
  month     = jun,
  year      = {2022},
  isbn      = {979-10-95546-86-3},
  language  = {english},
  url       = {https://www.sign-lang.uni-hamburg.de/lrec/pub/22037.html}
}