sign-lang@LREC Anthology

Measuring Lexical Similarity across Sign Languages in Global Signbank

Börstell, Carl ORCID button Börstell, Carl | Crasborn, Onno ORCID button Crasborn, Onno | Whynot, Lori


Volume:
Proceedings of the LREC2020 9th Workshop on the Representation and Processing of Sign Languages: Sign Language Resources in the Service of the Language Community, Technological Challenges and Application Perspectives
Venue:
Marseille, France
Date:
16 May 2020
Pages:
21–26
Publisher:
European Language Resources Association (ELRA)
License:
CC BY-NC 4.0
sign-lang ID:
20011
ACL ID:
2020.signlang-1.4
ISBN:
979-10-95546-54-2

Content Categories

Languages:
Chinese Sign Language, International Sign, Sign Language of the Netherlands
Lexical Databases:
Global Signbank - CSL, Global Signbank - IS, Global Signbank - NGT

Abstract

Lexicostatistics is the main method used in previous work measuring linguistic distances between sign languages. As a method, it disregards any possible structural/grammatical similarity, instead focusing exclusively on lexical items, but it is time consuming as it requires some comparable phonological coding (i.e. form description) as well as concept matching (i.e. meaning description) of signs across the sign languages to be compared. In this paper, we present a novel approach for measuring lexical similarity across any two sign languages using the Global Signbank platform, a lexical database of uniformly coded signs. The method involves a feature-by-feature comparison of all matched phonological features. This method can be used in two distinct ways: 1) automatically comparing the amount of lexical overlap between two sign languages (with a more detailed feature-description than previous lexicostatistical methods); 2) finding exact form-matches across languages that are either matched or mismatched in meaning (i.e. true or false friends). We show the feasability of this method by comparing three languages (datasets) in Global Signbank, and are currently expanding both the size of these three as well as the total number of datasets.

Keywords

Document Download

Paper PDF BibTeX File+ Abstract

BibTeX Export

@inproceedings{borstell:20011:sign-lang:lrec,
  author    = {B{\"o}rstell, Carl and Crasborn, Onno and Whynot, Lori},
  title     = {Measuring Lexical Similarity across Sign Languages in {Global} {Signbank}},
  pages     = {21--26},
  editor    = {Efthimiou, Eleni and Fotinea, Stavroula-Evita and Hanke, Thomas and Hochgesang, Julie A. and Kristoffersen, Jette and Mesch, Johanna},
  booktitle = {Proceedings of the {LREC2020} 9th Workshop on the Representation and Processing of Sign Languages: Sign Language Resources in the Service of the Language Community, Technological Challenges and Application Perspectives},
  maintitle = {12th International Conference on Language Resources and Evaluation ({LREC} 2020)},
  publisher = {{European Language Resources Association (ELRA)}},
  address   = {Marseille, France},
  day       = {16},
  month     = may,
  year      = {2020},
  isbn      = {979-10-95546-54-2},
  language  = {english},
  url       = {https://www.sign-lang.uni-hamburg.de/lrec/pub/20011.pdf}
}
Something missing or wrong?