Crowdsourcing Kazakh-Russian Sign Language: FluentSigners-50
Mukushev, Medet
| Ubingazhibov, Aidyn | Kydyrbekova, Aigerim | Imashev, Alfarabi
| Kimmelman, Vadim
| Sandygulova, Anara 
- Volume:
- Proceedings of the 13th International Conference on Language Resources and Evaluation (LREC 2022)
- Venue:
- Marseille, France
- Date:
- 20 to 25 June 2022
- Pages:
- 2541–2547
- Publisher:
- European Language Resources Association (ELRA)
- License:
- CC BY-NC 4.0
- ACL ID:
- 2022.lrec-1.271
- ISBN:
- 979-10-95546-72-6
Content Categories
- Projects:
- K-SLARS
- Languages:
- Kazakh-Russian Sign Language
- Corpora:
- FluentSigners-50
Abstract
This paper presents the methodology we used to crowdsource a data collection of a new large-scale signer independent dataset for Kazakh-Russian Sign Language (KRSL) created for Sign Language Processing. By involving the Deaf community throughout the research process, we firstly designed a research protocol and then performed an efficient crowdsourcing campaign that resulted in a new FluentSigners-50 dataset. The FluentSigners-50 dataset consists of 173 sentences performed by 50 KRSL signers for 43,250 video samples. Dataset contributors recorded videos in real-life settings on various backgrounds using various devices such as smartphones and web cameras. Therefore, each dataset contribution has a varying distance to the camera, camera angles and aspect ratio, video quality, and frame rates. Additionally, the proposed dataset contains a high degree of linguistic and inter-signer variability and thus is a better training set for recognizing a real-life signed speech. FluentSigners-50 is publicly available at https://krslproject.github.io/fluentsigners-50/Document Download
Paper PDF BibTeX File + Abstract
Cite as
Citation in ACL Citation Format
Medet Mukushev, Aidyn Ubingazhibov, Aigerim Kydyrbekova, Alfarabi Imashev, Vadim Kimmelman, Anara Sandygulova. 2022. Crowdsourcing Kazakh-Russian Sign Language: FluentSigners-50. In Proceedings of the 13th International Conference on Language Resources and Evaluation (LREC 2022), pages 2541–2547, Marseille, France. European Language Resources Association (ELRA).BibTeX Export
@inproceedings{mukushev-etal-2022-crowdsourcing:lrec, author = {Mukushev, Medet and Ubingazhibov, Aidyn and Kydyrbekova, Aigerim and Imashev, Alfarabi and Kimmelman, Vadim and Sandygulova, Anara}, title = {Crowdsourcing {Kazakh-Russian} {Sign} {Language}: {FluentSigners-50}}, pages = {2541--2547}, editor = {Calzolari, Nicoletta and B{\'e}chet, Fr{\'e}d{\'e}ric and Blache, Philippe and Choukri, Khalid and Cieri, Christopher and Declerck, Thierry and Goggi, Sara and Isahara, Hitoshi and Maegaard, Bente and Mariani, Joseph and Mazo, H{\'e}l{\`e}ne and Odijk, Jan and Piperidis, Stelios}, booktitle = {13th International Conference on Language Resources and Evaluation ({LREC} 2022)}, publisher = {{European Language Resources Association (ELRA)}}, address = {Marseille, France}, day = {20--25}, month = jun, year = {2022}, isbn = {979-10-95546-72-6}, language = {english}, url = {http://www.lrec-conf.org/proceedings/lrec2022/pdf/2022.lrec-1.271} }