KoSign Sign Language Translation Project: Introducing The NIASL2021 Dataset
Huerta-Enochian, Mathew
| Lee, Du Hui | Myung, Hye Jin | Byun, Kang Suk | Lee, Jun Woo
- Volume:
- Proceedings of the 7th International Workshop on Sign Language Translation and Avatar Technology: The Junction of the Visual and the Textual: Challenges and Perspectives
- Venue:
- Marseille, France
- Date:
- 24 June 2022
- Pages:
- 59–66
- Publisher:
- European Language Resources Association (ELRA)
- License:
- CC BY-NC 4.0
- ACL ID:
- 2022.sltat-1.9
- ISBN:
- 979-10-95546-82-5
Content Categories
- Projects:
- KoSign
- Languages:
- Korean Sign Language, Korean
- Corpora:
- NIASL2021
Abstract
We introduce a new sign language production (SLP) and sign language translation (SLT) dataset, NIASL2021, consisting of 201,026 Korean-KSL data pairs. KSL translations of Korean source texts are represented in three formats: video recordings, keypoint position data, and time-aligned gloss annotations for each hand (using a 7,989 sign vocabulary) and for eight different non-manual signals (NMS). We evaluated our sign language elicitation methodology and found that text-based prompting had a negative effect on translation quality in terms of naturalness and comprehension. We recommend distilling text into a visual medium before translating into sign language or adding a prompt-blind review step to text-based translation methodologies.Document Download
Paper PDF BibTeX File + Abstract
Cite as
Citation in ACL Citation Format
Mathew Huerta-Enochian, Du Hui Lee, Hye Jin Myung, Kang Suk Byun, Jun Woo Lee. 2022. KoSign Sign Language Translation Project: Introducing The NIASL2021 Dataset. In Proceedings of the 7th International Workshop on Sign Language Translation and Avatar Technology: The Junction of the Visual and the Textual: Challenges and Perspectives, pages 59–66, Marseille, France. European Language Resources Association (ELRA).BibTeX Export
@inproceedings{huertaenochian:70012:sltat:lrec,
author = {Huerta-Enochian, Mathew and Lee, Du Hui and Myung, Hye Jin and Byun, Kang Suk and Lee, Jun Woo},
title = {{KoSign} Sign Language Translation Project: Introducing The {NIASL2021} Dataset},
pages = {59--66},
editor = {Efthimiou, Eleni and Fotinea, Stavroula-Evita and Hanke, Thomas and McDonald, John C. and Shterionov, Dimitar and Wolfe, Rosalee},
booktitle = {Proceedings of the 7th International Workshop on Sign Language Translation and Avatar Technology: The Junction of the Visual and the Textual: Challenges and Perspectives},
maintitle = {13th International Conference on Language Resources and Evaluation ({LREC} 2022)},
publisher = {{European Language Resources Association (ELRA)}},
address = {Marseille, France},
day = {24},
month = jun,
year = {2022},
isbn = {979-10-95546-82-5},
language = {english},
url = {http://www.lrec-conf.org/proceedings/lrec2022/workshops/sltat/pdf/2022.sltat-1.9}
}