To support language documentation, linguistic research, and acquisition of Sign Language of the Netherlands (NGT), we are expanding the NGT dataset in the lexical database Global Signbank. Our most prioritized goal is to add ca. 11,000 glosses (entries). We further aim at adding ca. 3,000 example sentences and to provide linguistic information with as many glosses as possible. As for linguistic information, Signbank allows for extensive phonological descriptions of signs, and the addition of multiple senses per sign, which we would like to connect to synsets in the Multilingual Sign Language Wordnet. Additionally, we are recording extra video data: we make multiple videos of the same sign, taken from different angles, and videos with non-manual expressions. Furthermore, we are collecting motion capture data, for improved (automatic) sign language recognition and production in the future. In this paper, we describe how we proceed, the decisions that have been made so far, and future uses of the dataset.
Ulrika Klomp, Lisa Gierman, Pieter Manders, Ellen Yassine Nauta, Gomèr Otterspeer, Ray Pelupessy, Galya Stern, Dalene Venter, Casper Wubbolts, Marloes Oomen, Floris Roelofsen. 2024. An Extension of the NGT Dataset in Global Signbank. In Proceedings of the LREC-COLING 2024 11th Workshop on the Representation and Processing of Sign Languages: Evaluation of Sign Language Resources, pages 292–297, Torino, Italy. ELRA Language Resources Association (ELRA) and the International Committee on Computational Linguistics (ICCL).
BibTeX Export
@inproceedings{klomp:24036:sign-lang:lrec,
author = {Klomp, Ulrika and Gierman, Lisa and Manders, Pieter and Nauta, Ellen Yassine and Otterspeer, Gom{\`e}r and Pelupessy, Ray and Stern, Galya and Venter, Dalene and Wubbolts, Casper and Oomen, Marloes and Roelofsen, Floris},
title = {An Extension of the {NGT} Dataset in {Global} {Signbank}},
pages = {292--297},
editor = {Efthimiou, Eleni and Fotinea, Stavroula-Evita and Hanke, Thomas and Hochgesang, Julie A. and Mesch, Johanna and Schulder, Marc},
booktitle = {Proceedings of the {LREC-COLING} 2024 11th Workshop on the Representation and Processing of Sign Languages: Evaluation of Sign Language Resources},
maintitle = {2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation ({LREC-COLING} 2024)},
publisher = {{ELRA Language Resources Association (ELRA) and the International Committee on Computational Linguistics (ICCL)}},
address = {Torino, Italy},
day = {25},
month = may,
year = {2024},
isbn = {978-2-493814-30-2},
language = {english},
url = {https://www.sign-lang.uni-hamburg.de/lrec/pub/24036.pdf}
}