Swedish Sign Language corpus
The Swedish Sign Language Corpus is a collection of 42 signers from Sweden using STSSwedish Sign Language. The STS Corpus project is based at Stockholm University, lasted from 2009–2011 and was led by Johanna Mesch.
The STS Corpus is one of a few corpora that provide part-of-speech tagging.
The signers were recorded in pairs, sitting opposite each other in a studio with a dark background. Five cameras were used for recording.
Language | Swedish Sign Language |
---|---|
Size | 24 hours recorded, 190000 tokens and 18800 types annotated |
Participants |
42 participants
20–82 years old From three regions |
Metadata Format | information not available |
Translation | Swedish, 14 hours translated (60%) |
Annotation |
See Wallin and Mesch (2018), inspired by Nonhebel et al. (2004) and Johnston (2010)
24 hours transcribed (100%) |
Data Format | ELAN |
Licence | CC BY-NC-SA (version unspecified) |
Access |
Public access via browsable homepage
Open access to video and parts of annotation data |
Webpages |
Project page: https://www.su.se/english/research/research-projects/swedish-sign-language-corpus Public access: https://teckensprakskorpus.su.se Dataset: https://ling33.ling.su.se/sslc/video/ (archival copy) |
Institution | Stockholm University |
Publications |
https://su.diva-portal.org/smash/resultList.jsf?query=In2008-0276-IK |
Cite as
See: https://teckensprakskorpus.su.se/information/kallhanvisning
Common tasks used in this corpus
Hide/Show tasks
Task | Deaf life experiences |
---|---|
# recordings – open access | 12 |
# recordings – restricted access | information not available |
Data available | https://ling33.ling.su.se/sslc/video/ (archival copy) (Döv ...) |
Task | Free conversation |
# recordings – open access | 0 |
# recordings – restricted access | information not available |
Data available | none |
Task | Frog Story |
# recordings – open access | 23 |
# recordings – restricted access | information not available |
Data available | https://ling33.ling.su.se/sslc/video/ (archival copy) (Var är du, grodan?) |
Task | Mr. Bean |
# recordings – open access | 21 |
# recordings – restricted access | information not available |
Data available |
https://ling33.ling.su.se/sslc/video/ (archival copy) (Film Mr. Bean) |
Task | Present yourself |
# recordings – open access | 22 |
# recordings – restricted access | information not available |
Data available | https://ling33.ling.su.se/sslc/video/ (archival copy) (Presentation) |
Task | Sign Name |
# recordings – open access | 0 |
# recordings – restricted access | information not available |
Data available | none |
Task | Snowman Story |
# recordings – open access | 21 |
# recordings – restricted access | information not available |
Data available | https://ling33.ling.su.se/sslc/video/ (archival copy) (Snögubben) |
References
Primary references
- Lars Wallin, Johanna Mesch (2018). "Annoteringskonventioner för teckenspråkstexter". Annotation Convention. 49 pp.
References to other works
- Annika Nonhebel, Onno A. Crasborn, Els van der Kooij (2004). "Sign language transcription conventions for the ECHO Project". Annotation Convention. 11 pp.
- Trevor Johnston (2010). "From archive to corpus: Transcription and annotation in the creation of signed language corpora". In: International Journal of Corpus Linguistics 15(1). John Benjamins, pp. 106-131. ISSN: 1569-9811. DOI: 10.1075/ijcl.15.1.05joh.
Further information sources
- sign-lang@LREC Anthology:
- Dataset "STS-korpus"
This entry was last modified on 11 April 2025.