A First Corpus of AZee Discourse Expressions
Challant, Camille
| Filhol, Michael 
- Volume:
- Proceedings of the 13th International Conference on Language Resources and Evaluation (LREC 2022)
- Venue:
- Marseille, France
- Date:
- 20 to 25 June 2022
- Pages:
- 1560–1565
- Publisher:
- European Language Resources Association (ELRA)
- License:
- CC BY-NC 4.0
- ACL ID:
- 2022.lrec-1.167
- ISBN:
- 979-10-95546-72-6
Content Categories
- Languages:
- French Sign Language, French
- Corpora:
- 40 brèves
- Writing Systems:
- AZee
Abstract
This paper presents a corpus of AZee discourse expressions, i.e. expressions which formally describe Sign Language utterances of any length using the AZee approach and language. The construction of this corpus had two main goals: a first reference corpus for AZee, and a test of its coverage on a significant sample of real-life utterances. We worked on productions from an existing corpus, namely the "40 breves", containing an hour of French Sign Language. We wrote the corresponding AZee discourse expressions for the entire video content, i.e. expressions capturing the forms produced by the signers and their associated meaning by combining known production rules, a basic building block for these expressions. These are made available as a version 2 extension of the "40 breves". We explain the way in which these expressions can be built, present the resulting corpus and set of production rules used, and perform first measurements on it. We also propose an evaluation of our corpus: for one hour of discourse, AZee allows to describe 94% of it, while ongoing studies are increasing this coverage. This corpus offers a lot of future prospects, for instance concerning synthesis with virtual signers, machine translation or formal grammars for Sign Language.Document Download
Paper PDF BibTeX File + Abstract
Cite as
Citation in ACL Citation Format
Camille Challant, Michael Filhol. 2022. A First Corpus of AZee Discourse Expressions. In Proceedings of the 13th International Conference on Language Resources and Evaluation (LREC 2022), pages 1560–1565, Marseille, France. European Language Resources Association (ELRA).BibTeX Export
@inproceedings{challant-filhol-2022-corpus:lrec, author = {Challant, Camille and Filhol, Michael}, title = {A First Corpus of {AZee} Discourse Expressions}, pages = {1560--1565}, editor = {Calzolari, Nicoletta and B{\'e}chet, Fr{\'e}d{\'e}ric and Blache, Philippe and Choukri, Khalid and Cieri, Christopher and Declerck, Thierry and Goggi, Sara and Isahara, Hitoshi and Maegaard, Bente and Mariani, Joseph and Mazo, H{\'e}l{\`e}ne and Odijk, Jan and Piperidis, Stelios}, booktitle = {13th International Conference on Language Resources and Evaluation ({LREC} 2022)}, publisher = {{European Language Resources Association (ELRA)}}, address = {Marseille, France}, day = {20--25}, month = jun, year = {2022}, isbn = {979-10-95546-72-6}, language = {english}, url = {http://www.lrec-conf.org/proceedings/lrec2022/pdf/2022.lrec-1.167} }