Corpus: SIGNOR Corpus | SL Data Compendium

Corpus

SIGNOR Corpus

The SIGNOR Corpus of SZJ is a collection of SZJSlovene Sign Language video data from 80 signers of Slovenia. The Corpus Signor project was based at the University of Ljubljana, ran from 2011–2014 and was led by Špela Vintar.

The annotation is based largely on the DGS Corpus Conventions (Konrad et al., 2022). Seven layers of annotation are provided: segmentation or tokenisation, glossing or lemmatisation, mouthing, HamNoSys transcription, Meaning, compositional meaning and segmentation into utterances. For the database of meanings the Slovene WordNet SloWNet (archival copy) (Fišer and Sagot, 2015) was adapted.

The recordings took place at the premises of Deaf clubs, partially at the informants homes and at the Deaf Institute Ljubljana. A moderator lead the participants through the tasks.

Language	Slovene Sign Language
Size	40 hours recorded, 30335 tokens and 1976 types annotated
Participants	80 participants
Metadata Format	information not available
Translation	information not available
Annotation	Based on Konrad et al. (2022) See Jerko and Vintar (2015) for more information
Data Format	iLex
Licence	information not available
Access	Public access via browsable homepage (temporarily unavailable at the time of writing)
Webpage	Project page: http://lojze.lugos.si/signor/en.html
Institution	University of Ljubljana
Publications	http://lojze.lugos.si/signor/en.html#objave

Cite as

information not available

Common tasks used in this corpus

Hide/Show tasks

Task	Frog Story
# recordings – open access	0
# recordings – restricted access	information not available
Data available	none
Task	Present yourself
# recordings – open access	0
# recordings – restricted access	information not available
Data available	none

References

Primary references

Boštjan Jerko, Špela Vintar (2015). "SIGNOR. Annotating for Slovene Sign Language Corpus".

References to other works

Darja Fišer, Benoît Sagot (2015). "Constructing a poor man’s wordnet in a resource-rich world". In: Language Resources and Evaluation 49(3), pp. 601-635. ISSN: 1574-0218. DOI: 10.1007/s10579-015-9295-6.
Reiner Konrad, Thomas Hanke, Gabriele Langer, Susanne König, Lutz König, Rie Nishio, Anja Regen (2022). "Public DGS Corpus: Annotation Conventions". Project Note. DOI: 10.25592/uhhfdm.822.

Further information sources

sign-lang@LREC Anthology:: Dataset "SIGNOR Corpus"

This entry was last modified on 11 April 2025.