Collection and Preprocessing of Czech Sign Language Corpus for Sign Language Recognition
Campr, Pavel | Hrúz, Marek | Trojanová, Jana
- Volume:
- Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC 2008)
- Venue:
- Marrakech, Morocco
- Date:
- 26 May to 1 June 2008
- Pages:
- 3175–3178
- Publisher:
- European Language Resources Association (ELRA)
- License:
- CC BY-NC
- ACL ID:
- L08-1471
- ISBN:
- 978-2-9517408-4-6
Content Categories
- Languages:
- Czech Sign Language
- Corpora:
- UWB-07-SLR-P
Abstract
This paper discusses the design, recording and preprocessing of a Czech sign language corpus. The corpus is intended for training and testing of sign language recognition (SLR) systems. The UWB-07-SLR-P corpus contains video data of 4 signers recorded from 3 different perspectives. Two of the perspectives contain whole body and provide 3D motion data, the third one is focused on signers face and provide data for face expression and lip feature extraction. Each signer performed 378 signs with 5 repetitions. The corpus consists of several types of signs: numbers (35 signs), one and two-handed finger alphabet (64), town names (35) and other signs (244). Each sign is stored in a separate AVI file. In total the corpus consists of 21853 video files in total length of 11.1 hours. Additionally each sign is preprocessed and basic features such as 3D hand and head trajectories are available. The corpus is mainly focused on feature extraction and isolated SLR rather than continuous SLR experiments.Document Download
Paper PDF BibTeX File + Abstract
Cite as
Citation in ACL Citation Format
Pavel Campr, Marek Hrúz, Jana Trojanová. 2008. Collection and Preprocessing of Czech Sign Language Corpus for Sign Language Recognition. In Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC 2008), pages 3175–3178, Marrakech, Morocco. European Language Resources Association (ELRA).BibTeX Export
@inproceedings{campr-etal-2008-collection:lrec, author = {Campr, Pavel and Hr{\'u}z, Marek and Trojanov{\'a}, Jana}, title = {Collection and Preprocessing of {C}zech {S}ign {L}anguage Corpus for Sign Language Recognition}, pages = {3175--3178}, editor = {Calzolari, Nicoletta and Choukri, Khalid and Maegaard, Bente and Mariani, Joseph and Odijk, Jan and Piperidis, Stelios and Tapias, Daniel}, booktitle = {6th International Conference on Language Resources and Evaluation ({LREC} 2008)}, publisher = {{European Language Resources Association (ELRA)}}, address = {Marrakech, Morocco}, day = {26}, month = may, year = {2008}, isbn = {978-2-9517408-4-6}, language = {english}, url = {http://www.lrec-conf.org/proceedings/lrec2008/pdf/804_paper.pdf} }