SIGNOR Corpus
The SIGNOR Corpus of SZJ is a collection of SZJSlovene Sign Language video data from 80 signers of Slovenia. The Corpus Signor project was based at the University of Ljubljana, ran from 2011–2014 and was led by Špela Vintar.
The annotation is based largely on the DGS Corpus Conventions (Konrad et al., 2022). Seven layers of annotation are provided: segmentation or tokenisation, glossing or lemmatisation, mouthing, HamNoSys transcription, Meaning, compositional meaning and segmentation into utterances. For the database of meanings the Slovene WordNet SloWNet (archival copy) (Fišer and Sagot, 2015) was adapted.
The recordings took place at the premises of Deaf clubs, partially at the informants homes and at the Deaf Institute Ljubljana. A moderator lead the participants through the tasks.
Language | Slovene Sign Language |
---|---|
Size | 40 hours recorded, 30335 tokens and 1976 types annotated |
Participants | 80 participants |
Metadata Format | information not available |
Translation | information not available |
Annotation |
Based on Konrad et al. (2022)
See Jerko and Vintar (2015) for more information |
Data Format | iLex |
Licence | information not available |
Access | Public access via browsable homepage (temporarily unavailable at the time of writing) |
Webpage |
Project page: http://lojze.lugos.si/signor/en.html |
Institution | University of Ljubljana |
Publications |
http://lojze.lugos.si/signor/en.html#objave |
Cite as
information not available
Common tasks used in this corpus
Hide/Show tasks
Task | Frog Story |
---|---|
# recordings – open access | 0 |
# recordings – restricted access | information not available |
Data available | none |
Task | Present yourself |
# recordings – open access | 0 |
# recordings – restricted access | information not available |
Data available | none |
References
Primary references
- Boštjan Jerko, Špela Vintar (2015). "SIGNOR. Annotating for Slovene Sign Language Corpus".
References to other works
- Darja Fišer, Benoît Sagot (2015). "Constructing a poor man’s wordnet in a resource-rich world". In: Language Resources and Evaluation 49(3), pp. 601-635. ISSN: 1574-0218. DOI: 10.1007/s10579-015-9295-6.
- Reiner Konrad, Thomas Hanke, Gabriele Langer, Susanne König, Lutz König, Rie Nishio, Anja Regen (2022). "Public DGS Corpus: Annotation Conventions". Project Note. DOI: 10.25592/uhhfdm.822.
Further information sources
- sign-lang@LREC Anthology:
- Dataset "SIGNOR Corpus"
This entry was last modified on 11 April 2025.