sign-lang@LREC Anthology

Towards Large-Scale Data Mining for Data-Driven Analysis of Sign Languages

Mocialov, Boris | Turner, Graham | Hastie, Helen


Volume:
Proceedings of the LREC2020 9th Workshop on the Representation and Processing of Sign Languages: Sign Language Resources in the Service of the Language Community, Technological Challenges and Application Perspectives
Venue:
Marseille, France
Date:
16 May 2020
Pages:
145–150
Publisher:
European Language Resources Association (ELRA)
License:
CC BY-NC 4.0
sign-lang ID:
20003
ACL ID:
2020.signlang-1.24
ISBN:
979-10-95546-54-2

Content Categories

Languages:
American Sign Language, Libras
Corpora:
Interpreted Songs Corpus

Abstract

Access to sign language data is far from adequate. We show that it is possible to collect the data from social networking services such as TikTok, Instagram, and YouTube by applying data filtering to enforce quality standards and by discovering patterns in the filtered data, making it easier to analyse and model. Using our data collection pipeline, we collect and examine the interpretation of songs in both the American Sign Language (ASL) and the Brazilian Sign Language (Libras). We explore their differences and similarities by looking at the co-dependence of the orientation and location phonological parameters.

Keywords

Document Download

Paper PDF BibTeX File+ Abstract

BibTeX Export

@inproceedings{mocialov:20003:sign-lang:lrec,
  author    = {Mocialov, Boris and Turner, Graham and Hastie, Helen},
  title     = {Towards Large-Scale Data Mining for Data-Driven Analysis of Sign Languages},
  pages     = {145--150},
  editor    = {Efthimiou, Eleni and Fotinea, Stavroula-Evita and Hanke, Thomas and Hochgesang, Julie A. and Kristoffersen, Jette and Mesch, Johanna},
  booktitle = {Proceedings of the {LREC2020} 9th Workshop on the Representation and Processing of Sign Languages: Sign Language Resources in the Service of the Language Community, Technological Challenges and Application Perspectives},
  maintitle = {12th International Conference on Language Resources and Evaluation ({LREC} 2020)},
  publisher = {{European Language Resources Association (ELRA)}},
  address   = {Marseille, France},
  day       = {16},
  month     = may,
  year      = {2020},
  isbn      = {979-10-95546-54-2},
  language  = {english},
  url       = {https://www.sign-lang.uni-hamburg.de/lrec/pub/20003.pdf}
}
Something missing or wrong?