sign-lang@LREC Anthology

A Machine Learning-based Segmentation Approach for Measuring Similarity between Sign Languages

Jui, Tonni Das ORCID button Jui, Tonni Das | Bejarano, Gissella Maria | Rivas, Pablo


Volume:
Proceedings of the LREC2022 10th Workshop on the Representation and Processing of Sign Languages: Multilingual Sign Language Resources
Venue:
Marseille, France
Date:
25 June 2022
Pages:
94–101
Publisher:
European Language Resources Association (ELRA)
License:
CC BY-NC 4.0
sign-lang ID:
22018
ACL ID:
2022.signlang-1.15
ISBN:
979-10-95546-86-3

Content Categories

Languages:
American Sign Language, Auslan, British Sign Language, Indian Sign Language
Corpora:
Auslan Corpus, How2Sign

Abstract

Due to the lack of more variate, native and continuous datasets, sign languages are low-resources languages that can benefit from multilingualism in machine translation. In order to analyze the benefits of approaches like multilingualism, finding the similarity between sign languages can guide better matches and contributions between languages. However, calculating the similarity between sign languages again implies a laborious work to measure how close or distant signs are and their respective contexts. For that reason, we propose to support the similarity measurement between sign languages through a video-segmentation-based machine learning model that will quantify this match among signs of different countries' sign languages. Using a machine learning approach the similarity measurement process can run more smoothly, compared to a more manual approach. We use a pre-trained temporal segmentation model for British Sign Language (BSL). We test it on three datasets, an American Sign Language (ASL) dataset, an Indian Sign Language (ISL), and an Australian Sign Language (AUSLAN) dataset. We hypothesize that the percentage of segmented and recognized signs by this machine learning model can represent the percentage of overlap or similarity between British and the other three sign languages. In our ongoing work, we evaluate three metrics considering Swadesh's and Woodward's list and their synonyms. We found that our intermediate-strict metric coincides with a more classical analysis of the similarity between British and American Sign Language, as well as with the classical low measurement between Indian and British sign languages. On the other hand, our similarity measurement between British and Australian Sign language just holds for part of the Australian Sign Language and not the whole data sample.

Video Presentation

Languages:
English, International Sign
Subtitle:
English

Document Download

Paper PDF Poster BibTeX File+ Abstract

BibTeX Export

@inproceedings{jui:22018:sign-lang:lrec,
  author    = {Jui, Tonni Das and Bejarano, Gissella Maria and Rivas, Pablo},
  title     = {A Machine Learning-based Segmentation Approach for Measuring Similarity between Sign Languages},
  pages     = {94--101},
  editor    = {Efthimiou, Eleni and Fotinea, Stavroula-Evita and Hanke, Thomas and Hochgesang, Julie A. and Kristoffersen, Jette and Mesch, Johanna and Schulder, Marc},
  booktitle = {Proceedings of the {LREC2022} 10th Workshop on the Representation and Processing of Sign Languages: Multilingual Sign Language Resources},
  maintitle = {13th International Conference on Language Resources and Evaluation ({LREC} 2022)},
  publisher = {{European Language Resources Association (ELRA)}},
  address   = {Marseille, France},
  day       = {25},
  month     = jun,
  year      = {2022},
  isbn      = {979-10-95546-86-3},
  language  = {english},
  url       = {https://www.sign-lang.uni-hamburg.de/lrec/pub/22018.pdf}
}
Something missing or wrong?