The Sign Language Dataset Compendium


Corpus

CORLSE

The Corpus de la Lengua de Signos Española (CORLSE) is a collection of LSESpanish Sign Language video clips of 246 signers from Spain. CORLSE is based at the Centro de Normalización Lingüística de la Lengua de Signos Española (CNLSE). The creation of the corpus started 2014 and is ongoing. The first data collection (2016–2018) produced recordings in four regions. A second data collection (2022–2023) visited thirteen regions.

The recordings were made in the premises of the associations of the deaf. The participants were recorded in pairs, sitting opposite each other in front of a blue background. A deaf interviewer was leading through the tasks. Recording sessions lasted 2–3 hours.

Language Spanish Sign Language
Size 420 hours of recordings.
Public recordings for 108 participant pairs.
Participants 246 participants (216 public)
4 age groups: 18–30, 31–50, 51–65 and 65 years and older
18 locations in 11 autonomous communities
Metadata Format information not available
Translation Spanish, size unknown
Annotation Centro de Normalización Lingüística de la Lengua de Signos Española (CNLSE) (2023).
Follows the system of the majority of European corpora, together with the Australian (Johnston, 2010) and Catalan Sign Language corpus.
Data Format ELAN
Licence information not available
Access Public access via browsable homepage
Restricted access for researchers, teachers and experts to annotations and sociolinguistic data requires confirmed registration
Webpages Project page: https://corpuslse.es/
Dataset: https://corpuslse.es/corpus
Institution Centro de Normalización Lingüística de la Lengua de Signos Española (CNLSE)
Publications Esteban Saiz and Villameriel García (2023)

Cite as

information not available

Common tasks used in this corpus

Hide/Show tasks
Task Debate
# recordings – open access 107
# recordings – restricted access information not available
Data available https://corpuslse.es/corpus#b_start=96&c9=asoc_codigo (Debate Movimiento Asociativo)
Task Diachronic changes
# recordings – open access 105
# recordings – restricted access information not available
Data available https://corpuslse.es/corpus#b_start=0&c9=diacr_codigo (Debate diacronía en LSE)
Task Frog Story
# recordings – open access 102
# recordings – restricted access information not available
Data available https://corpuslse.es/corpus#b_start=0&c9=rana_codigo (Rana, ¿dónde estás?)
Task Lexical elicitation
# recordings – open access 104
# recordings – restricted access information not available
Data available https://corpuslse.es/corpus#b_start=0&c9=chyc_codigo (Cuerpo humano y colores)
Task Pear Story
# recordings – open access 81
# recordings – restricted access information not available
Data available https://corpuslse.es/corpus#b_start=0&c9=pera_codigo (Historia de la pera)
Task Present yourself
# recordings – open access 106
# recordings – restricted access information not available
Data available https://corpuslse.es/corpus#b_start=0&c9=pres_codigo (Presentación)
Task Subject areas
# recordings – open access 104 (childhood), 106 (education), 102 (work)
# recordings – restricted access information not available
Data available https://corpuslse.es/corpus#b_start=0&c9=infan_codigo (Childhood – Anéctodas de la infancia)
https://corpuslse.es/corpus#b_start=0&c9=edu_codigo (Education – Experiencia educativa)
https://corpuslse.es/corpus#b_start=0&c9=tya_codigo (Work – Trabajo y aficiones)
Task Sylvester and Tweety
# recordings – open access 98
# recordings – restricted access information not available
Data available https://corpuslse.es/corpus#b_start=40&c9=syp_codigo (Silvestre y Piolín)

References

Primary references

References to other works

This entry was last modified on 27 March 2025.