Swedish Sign Language corpus
The Swedish Sign Language Corpus is a collection of 42 signers from Sweden using Swedish Sign Language. The STS Corpus project is based at Stockholm University, lasted from 2009–2011 and was led by Johanna Mesch.
The STS Corpus is one of a few corpora that provide part-of-speech tagging.
The signers were recorded in pairs, sitting opposite each other in a studio with a dark background. Five cameras were used for recording.
Language | Swedish Sign Language |
---|---|
Size | 24 hours recorded, 190000 tokens and 18800 types annotated |
Participants |
42 participants
20–82 years old From three regions |
Metadata Format | not available |
Translation | Swedish, 14 hours translated (60%) |
Annotation |
See Wallin and Mesch (2018), inspired by Nonhebel et al. (2004) and Johnston (2010)
24 hours transcribed (100%) |
Data Format | ELAN |
Licence | CC BY-NC-SA (version unspecified) |
Access |
Public access via browsable homepage
Open access to video and parts of annotation data |
Webpages |
Project page: https://www.ling.su.se/teckenspraksresurser/teckensprakskorpusar/svensk-teckensprakskorpus
Dataset: https://ling33.ling.su.se/sslc/video/ Public access: https://teckensprakskorpus.su.se |
Institution | Stockholm University |
Publications | https://www.ling.su.se/teckenspraksresurser/teckensprakskorpusar/svensk-teckensprakskorpus/publikationer |
Cite as
See: https://www.ling.su.se/teckenspråksresurser/teckenspråkskorpusar/svensk-teckenspråkskorpus/annoteringsfiler (Källhänvisning) and https://www.ling.su.se/teckenspråksresurser/teckenspråkskorpusar/svensk-teckenspråkskorpus/videofiler (Källhänvisning)
Common tasks used in this corpus
Task | Deaf life experiences |
---|---|
# recordings – open access | 12 |
# recordings – restricted access | not available |
Data available | https://ling33.ling.su.se/sslc/video/ (Döv ...) |
Task | Free conversation |
# recordings – open access | 0 |
# recordings – restricted access | not available |
Data available | none |
Task | Frog Story |
# recordings – open access | 23 |
# recordings – restricted access | not available |
Data available | https://ling33.ling.su.se/sslc/video/ (Var är du, grodan?) |
Task | Mr. Bean |
# recordings – open access | 21 |
# recordings – restricted access | not available |
Data available |
https://ling33.ling.su.se/sslc/video/ (Film Mr. Bean) |
Task | Present yourself |
# recordings – open access | 22 |
# recordings – restricted access | not available |
Data available | https://ling33.ling.su.se/sslc/video/ (Presentation) |
Task | Sign Name |
# recordings – open access | 0 |
# recordings – restricted access | not available |
Data available | none |
Task | Snowman Story |
# recordings – open access | 21 |
# recordings – restricted access | not available |
Data available | https://ling33.ling.su.se/sslc/video/ (Snögubben) |
Articles mentioned above
- Annika Nonhebel, Onno A. Crasborn, Els van der Kooij (2004). "Sign language transcription conventions for the ECHO Project". Annotation Convention. 11 pp.
- Lars Wallin, Johanna Mesch (2018). "Annoteringskonventioner för teckenspråkstexter". Annotation Convention. 49 pp.
- Trevor Johnston (2010). "From archive to corpus: Transcription and annotation in the creation of signed language corpora". In: International Journal of Corpus Linguistics 15(1). John Benjamins, pp. 106-131. ISSN: 1569-9811. DOI: 10.1075/ijcl.15.1.05joh.
This entry was last modified on 6 January 2023.