This paper describes the creation of annotation standards for glossing sign language corpora as part of the Digging into Signs project (2014-2015). This project was based on the annotation of two major sign language corpora, the BSL Corpus (British Sign Language) and the Corpus NGT (Sign Language of the Netherlands). The focus of the gloss annotations in these data sets was in line with the starting point of most sign language corpora: to make general corpus annotation maximally useful regardless of the particular research focus. Therefore, the joint annotation guidelines that were the output of the project focus on basic annotation of hand activity, aiming to ensure that annotations can be made in a consistent way irrespective of the particular sign language. The annotation standard provides annotators with the means to create consistent annotations for various types of signs that in turn will facilitate cross-linguistic research. At the same time, the standard includes alternative strategies for some types of signs. In this paper we outline the key features of the joint annotation conventions arising from this project, describe the arguments around providing alternative strategies in a standard, as well as discuss reliability measures and improvement to annotation tools.
Kearsy Cormier, Onno Crasborn, Richard Bank. 2016. Digging into Signs: Emerging Annotation Standards for Sign Language Corpora. In Proceedings of the LREC2016 7th Workshop on the Representation and Processing of Sign Languages: Corpus Mining, pages 35–40, Portorož, Slovenia. European Language Resources Association (ELRA).
BibTeX Export
@inproceedings{cormier:16015:sign-lang:lrec,
author = {Cormier, Kearsy and Crasborn, Onno and Bank, Richard},
title = {Digging into Signs: Emerging Annotation Standards for Sign Language Corpora},
pages = {35--40},
editor = {Efthimiou, Eleni and Fotinea, Stavroula-Evita and Hanke, Thomas and Hochgesang, Julie A. and Kristoffersen, Jette and Mesch, Johanna},
booktitle = {Proceedings of the {LREC2016} 7th Workshop on the Representation and Processing of Sign Languages: Corpus Mining},
maintitle = {10th International Conference on Language Resources and Evaluation ({LREC} 2016)},
publisher = {{European Language Resources Association (ELRA)}},
address = {Portoro{\v z}, Slovenia},
day = {28},
month = may,
year = {2016},
language = {english},
url = {https://www.sign-lang.uni-hamburg.de/lrec/pub/16015.pdf}
}