sign-lang@LREC Anthology

Crowdsourcing Kazakh-Russian Sign Language: FluentSigners-50

Mukushev, Medet ORCID button Mukushev, Medet | Ubingazhibov, Aidyn | Kydyrbekova, Aigerim | Imashev, Alfarabi | Kimmelman, Vadim ORCID button Kimmelman, Vadim | Sandygulova, Anara ORCID button Sandygulova, Anara


Volume:
Proceedings of the 13th International Conference on Language Resources and Evaluation (LREC 2022)
Venue:
Marseille, France
Date:
20 to 25 June 2022
Pages:
2541–2547
Publisher:
European Language Resources Association (ELRA)
License:
CC BY-NC 4.0
ACL ID:
2022.lrec-1.271
ISBN:
979-10-95546-72-6

Content Categories

Projects:
K-SLARS
Languages:
Kazakh-Russian Sign Language
Corpora:
FluentSigners-50

Abstract

This paper presents the methodology we used to crowdsource a data collection of a new large-scale signer independent dataset for Kazakh-Russian Sign Language (KRSL) created for Sign Language Processing. By involving the Deaf community throughout the research process, we firstly designed a research protocol and then performed an efficient crowdsourcing campaign that resulted in a new FluentSigners-50 dataset. The FluentSigners-50 dataset consists of 173 sentences performed by 50 KRSL signers for 43,250 video samples. Dataset contributors recorded videos in real-life settings on various backgrounds using various devices such as smartphones and web cameras. Therefore, each dataset contribution has a varying distance to the camera, camera angles and aspect ratio, video quality, and frame rates. Additionally, the proposed dataset contains a high degree of linguistic and inter-signer variability and thus is a better training set for recognizing a real-life signed speech. FluentSigners-50 is publicly available at https://krslproject.github.io/fluentsigners-50/

Document Download

Paper PDF BibTeX File+ Abstract

BibTeX Export

@inproceedings{mukushev-etal-2022-crowdsourcing:lrec,
  author    = {Mukushev, Medet and Ubingazhibov, Aidyn and Kydyrbekova, Aigerim and Imashev, Alfarabi and Kimmelman, Vadim and Sandygulova, Anara},
  title     = {Crowdsourcing {Kazakh-Russian} {Sign} {Language}: {FluentSigners-50}},
  pages     = {2541--2547},
  editor    = {Calzolari, Nicoletta and Fr{\'e}d{\'e}ric B{\'e}chet and Blache, Philippe and Choukri, Khalid and Cieri, Christopher and Declerck, Thierry and Goggi, Sara and Isahara, Hitoshi and Maegaard, Bente and Mariani, Joseph and Mazo, H{\'e}l{\`e}ne and Odijk, Jan and Piperidis, Stelios},
  booktitle = {13th International Conference on Language Resources and Evaluation ({LREC} 2022)},
  publisher = {{European Language Resources Association (ELRA)}},
  address   = {Marseille, France},
  day       = {20--25},
  month     = jun,
  year      = {2022},
  isbn      = {979-10-95546-72-6},
  language  = {english},
  url       = {http://www.lrec-conf.org/proceedings/lrec2022/pdf/2022.lrec-1.271}
}
Something missing or wrong?