sign-lang@LREC Anthology

SwissSLi: the Multi-parallel Sign Language Corpus for Switzerland

Jiang, Zifan | Göhring, Anne ORCID button Göhring, Anne | Moryossef, Amit | Sennrich, Rico | Ebling, Sarah ORCID button Ebling, Sarah


Volume:
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
Venue:
Torino, Italy
Date:
20 to 25 May 2024
Pages:
15448–15456
Publisher:
ELRA Language Resources Association (ELRA) and the International Committee on Computational Linguistics (ICCL)
License:
CC BY-NC 4.0
ACL ID:
2024.lrec-main.1342
ISBN:
978-2-493814-10-4

Abstract

In this work, we introduce SwissSLi, the first sign language corpus that contains parallel data of all three Swiss sign languages, namely Swiss German Sign Language (DSGS), French Sign Language of Switzerland (LSF-CH), and Italian Sign Language of Switzerland (LIS-CH). The data underlying this corpus originates from television programs in three spoken languages: German, French, and Italian. The programs have for the most part been translated into sign language by deaf translators, resulting in a unique, up to six-way multi-parallel dataset between spoken and sign languages. We describe and release the sign language videos and spoken language subtitles as well as the overall statistics and some derivatives of the raw material. These derived components include cropped videos, pose estimation, phrase/sign-segmented videos, and sentence-segmented subtitles, all of which facilitate downstream tasks such as sign language transcription (glossing) and machine translation. The corpus is publicly available on the SWISSUbase data platform for research purposes only under a CC BY-NC-SA 4.0 license.

Document Download

Paper PDF BibTeX File+ Abstract

BibTeX Export

@inproceedings{jiang-etal-2024-swisssli:lrec,
  author    = {Jiang, Zifan and G{\"o}hring, Anne and Moryossef, Amit and Sennrich, Rico and Ebling, Sarah},
  title     = {SwissSLi: the Multi-parallel Sign Language Corpus for Switzerland},
  pages     = {15448--15456},
  editor    = {Calzolari, Nicoletta and Kan, Min-Yen and Hoste, Veronique and Lenci, Alessandro and Sakti, Sakriani and Xue, Nianwen},
  booktitle = {2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation ({LREC-COLING} 2024)},
  publisher = {{ELRA Language Resources Association (ELRA) and the International Committee on Computational Linguistics (ICCL)}},
  address   = {Torino, Italy},
  day       = {20--25},
  month     = may,
  year      = {2024},
  isbn      = {978-2-493814-10-4},
  language  = {english},
  url       = {https://aclanthology.org/2024.lrec-main.1342}
}
Something missing or wrong?