sign-lang@LREC Anthology

The Construction of the CORALSE Corpus, Now and Beyond: A Tool for Documenting Spanish Sign Language

Fernández Soneira, Ana | Bao-Fente, María C. | González-Montesino, Rayco H. | Báez Montero, Inmaculada C.


Volume:
Proceedings of the LREC2026 12th Workshop on the Representation and Processing of Sign Languages: Language in Motion
Venue:
Palma, Mallorca, Spain
Date:
16 May 2026
Pages:
140–147
Publisher:
European Language Resources Association (ELRA)
Licence:
CC BY-NC 4.0
sign-lang ID:
26048
ISBN:
978-2-493814-82-1

Abstract

The main objective of this paper is to present the experience of building the CORALSE corpus and to discuss the challenges that arise when attempting to provide a comprehensive description of a sign language. To this end, we address the following questions, drawing on the data obtained in the completed phases of the CORALSE project as well as on the foundational principles guiding the project's third phase. THE CORALSE CORPUS TODAY: How have we developed a linguistic corpus of sign language?, What steps have we taken in developing the CORALSE corpus?, Which informants have we recorded and what criteria have guided their selection? THE CORALSE CORPUS IN THE FUTURE: Which (native) languages do we prioritise when selecting informants?, How do the perspectives of reference signers, interpreters, educators, and psycholinguists contribute to a more complete understanding of a sign language? Corpus linguistics is understood as a set of methodologies designed to study language through collections of digitised texts. Its development over recent decades—initially driven by advances in computing and, subsequently, by the emergence of the internet—represents one of the most significant transformations in contemporary linguistic research. The projects CORALSE: Annotated Inter-university Corpus of Spanish Sign Language and Textual Typology, Registers and Styles in Spanish Sign Language: New Data for the Expansion of the CORALSE Corpus adopt a corpus linguistics approach to collect, analyse and describe a representative sample of Spanish Sign Language (LSE). We also reflect on the types of linguistic data that are truly necessary to document the actual use of Spanish Sign Language.

Document Download

Paper PDF BibTeX File+ Abstract

Cite as

Citation in ACL Citation Format

Ana Fernández Soneira, María C. Bao-Fente, Rayco H. González-Montesino, Inmaculada C. Báez Montero. 2026. The Construction of the CORALSE Corpus, Now and Beyond: A Tool for Documenting Spanish Sign Language. In Proceedings of the LREC2026 12th Workshop on the Representation and Processing of Sign Languages: Language in Motion, pages 140–147, Palma, Mallorca, Spain. European Language Resources Association (ELRA).

BibTeX Export

@inproceedings{fernandezsoneira:26048:sign-lang:lrec,
  author    = {Fern{\'a}ndez Soneira, Ana and Bao-Fente, Mar{\'i}a C. and Gonz{\'a}lez-Montesino, Rayco H. and B{\'a}ez Montero, Inmaculada C.},
  title     = {The Construction of the {CORALSE} Corpus, Now and Beyond: A Tool for Documenting {Spanish} {Sign} {Language}},
  pages     = {140--147},
  editor    = {Efthimiou, Eleni and Fotinea, Stavroula-Evita and Hanke, Thomas and Hochgesang, Julie A. and Mesch, Johanna and Schulder, Marc},
  booktitle = {Proceedings of the {LREC2026} 12th Workshop on the Representation and Processing of Sign Languages: Language in Motion},
  maintitle = {15th International Conference on Language Resources and Evaluation ({LREC} 2026)},
  publisher = {{European Language Resources Association (ELRA)}},
  address   = {Palma, Mallorca, Spain},
  day       = {16},
  month     = may,
  year      = {2026},
  isbn      = {978-2-493814-82-1},
  language  = {english},
  url       = {https://www.sign-lang.uni-hamburg.de/lrec/pub/26048.html}
}
Something missing or wrong?