Corpora
The compendium contains 43 linguistic corpora from around the world.
To be included, a corpus must contain (semi-)spontaneous signing, provide transcriptions or translations for at least some of its content, contain at least 10 hours of sign language recordings and fulfill the general curation criteria of the compendium. For languages for which none of their corpora meet the size requirement, corpora with at least 5 hours of recordings may still be included. Multilingual corpora are included irrespective of their size.
- A reference corpus of the Malian Sign Language/Langue des Signes Malienne (LSM)
- Adamorobe Sign Language Corpus
- Auslan Corpus
- Australian Irish Sign Language
- Black ASL Project Corpus
- British Sign Language Corpus
- Catalan Sign Language Corpus
- CORLSE
- Corpus LSFB
- Corpus NGT
- Corpus of Finnish Sign Language
- Corpus Vlaamse Gebarentaal
- CREAGEST
- Danish Sign Language Corpus
- DGS Corpus
- Dicta-Sign Corpus
- Dicta-Sign-GSL-v2
- Dicta-Sign-LSF-v2
- Documentation and description of a sign language in Côte d’Ivoire
- Documentation and description of Inuit Sign Language
- Documentation of Extreme North Cameroon Sign Language and Cameroon Sign Language
- Dogon Sign Language Corpus
- ECHO Corpus
- Giving Cognition a Hand Corpus
- Hong Kong Sign Language Corpus
- Hungarian Sign Language Corpus
- IPROSLA Corpus
- Italian Sign Language Corpus
- Japanese Sign Language Colloquial Corpus
- Kata Kolok Corpus
- Korean Sign Language Corpus
- LESCO Corpus
- MEDIAPI-SKEL
- PJM Corpus
- POLYTROPON Parallel Corpus
- Russian Sign Language Corpus
- Signing in a ‘deaf family’ – documentation of the Mardin Sign Language, Turkey
- SIGNOR Corpus
- Signs of Ireland
- Swedish Sign Language corpus
- VIDI Sign Space Corpus
- Visibase Corpus
- ZEI Corpus