sign-lang@LREC Anthology

Extracting Signs from Weakly Aligned Sign Language Corpora: A Study on LSF and LSM

de la Garza, Lorena ORCID button de la Garza, Lorena | Halbout, Julie ORCID button Halbout, Julie | Lascar, Julie ORCID button Lascar, Julie | Martinez, Niels | Curiel, Arturo | Gouiffès, Michèle ORCID button Gouiffès, Michèle | Braffort, Annelies ORCID button Braffort, Annelies


Volume:
Proceedings of the LREC2026 12th Workshop on the Representation and Processing of Sign Languages: Language in Motion
Venue:
Palma, Mallorca, Spain
Date:
16 May 2026
Pages:
174–183
Publisher:
European Language Resources Association (ELRA)
Licence:
CC BY-NC 4.0
sign-lang ID:
26039
ISBN:
978-2-493814-82-1

Abstract

This paper presents a framework for the automatic annotation of sign language data across different recording conditions, including original and interpreted content. The proposed approach integrates weak alignment, sign segmentation, and multiple instance learning with a contrastive loss. The resulting annotations are subsequently refined and filtered to enhance their reliability. Our method was applied to two historically related sign languages, French Sign Language (LSF) and Mexican Sign Language (LSM). This led to the creation of two signaries, comprising approximately 2k categories in LSF (25k occurrences) and 41 categories in LSM (1k occurrences). Both resources provide valuable support for future research in artificial intelligence and linguistics, particularly for comparative analyses between the two languages. A seminal analysis is presented as part of this paper.

Document Download

Paper PDF BibTeX File+ Abstract

Cite as

Citation in ACL Citation Format

Lorena de la Garza, Julie Halbout, Julie Lascar, Niels Martinez, Arturo Curiel, Michèle Gouiffès, Annelies Braffort. 2026. Extracting Signs from Weakly Aligned Sign Language Corpora: A Study on LSF and LSM. In Proceedings of the LREC2026 12th Workshop on the Representation and Processing of Sign Languages: Language in Motion, pages 174–183, Palma, Mallorca, Spain. European Language Resources Association (ELRA).

BibTeX Export

@inproceedings{delagarza:26039:sign-lang:lrec,
  author    = {de la Garza, Lorena and Halbout, Julie and Lascar, Julie and Martinez, Niels and Curiel, Arturo and Gouiff{\`e}s, Mich{\`e}le and Braffort, Annelies},
  title     = {Extracting Signs from Weakly Aligned Sign Language Corpora: A Study on {LSF} and {LSM}},
  pages     = {174--183},
  editor    = {Efthimiou, Eleni and Fotinea, Stavroula-Evita and Hanke, Thomas and Hochgesang, Julie A. and Mesch, Johanna and Schulder, Marc},
  booktitle = {Proceedings of the {LREC2026} 12th Workshop on the Representation and Processing of Sign Languages: Language in Motion},
  maintitle = {15th International Conference on Language Resources and Evaluation ({LREC} 2026)},
  publisher = {{European Language Resources Association (ELRA)}},
  address   = {Palma, Mallorca, Spain},
  day       = {16},
  month     = may,
  year      = {2026},
  isbn      = {978-2-493814-82-1},
  language  = {english},
  url       = {https://www.sign-lang.uni-hamburg.de/lrec/pub/26039.html}
}
Something missing or wrong?