sign-lang@LREC Anthology

A Web Tool for Building Parallel Corpora of Spoken and Sign Languages

Becker, Alex | Kepler, Fabio | Candeias, Sara


Volume:
Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016)
Venue:
Portorož, Slovenia
Date:
23 to 28 May 2016
Pages:
1438–1445
Publisher:
European Language Resources Association (ELRA)
License:
CC BY-NC 4.0
ACL ID:
L16-1229
ISBN:
978-2-9517408-9-1

Content Categories

Editors:
SignCorpus Annotator, SignMaker

Abstract

In this paper we describe our work in building an online tool for manually annotating texts in any spoken language with SignWriting in any sign language. The existence of such tool will allow the creation of parallel corpora between spoken and sign languages that can be used to bootstrap the creation of efficient tools for the Deaf community. As an example, a parallel corpus between English and American Sign Language could be used for training Machine Learning models for automatic translation between the two languages. Clearly, this kind of tool must be designed in a way that it eases the task of human annotators, not only by being easy to use, but also by giving smart suggestions as the annotation progresses, in order to save time and effort. By building a collaborative, online, easy to use annotation tool for building parallel corpora between spoken and sign languages we aim at helping the development of proper resources for sign languages that can then be used in state-of-the-art models currently used in tools for spoken languages. There are several issues and difficulties in creating this kind of resource, and our presented tool already deals with some of them, like adequate text representation of a sign and many to many alignments between words and signs.

Document Download

Paper PDF BibTeX File+ Abstract

BibTeX Export

@inproceedings{becker-etal-2016-web:lrec,
  author    = {Becker, Alex and Kepler, Fabio and Candeias, Sara},
  title     = {A Web Tool for Building Parallel Corpora of Spoken and Sign Languages},
  pages     = {1438--1445},
  editor    = {Calzolari, Nicoletta and Choukri, Khalid and Declerck, Thierry and Goggi, Sara and Grobelnik, Marko and Maegaard, Bente and Mariani, Joseph and Mazo, H{\'e}l{\`e}ne and Moreno, Asuncion and Odijk, Jan and Piperidis, Stelios},
  booktitle = {10th International Conference on Language Resources and Evaluation ({LREC} 2016)},
  publisher = {{European Language Resources Association (ELRA)}},
  address   = {Portoro{\v z}, Slovenia},
  day       = {23--28},
  month     = may,
  year      = {2016},
  isbn      = {978-2-9517408-9-1},
  language  = {english},
  url       = {https://aclanthology.org/L16-1229}
}
Something missing or wrong?