sign-lang@LREC Anthology


Collocations in Sign Language Lexicography: Towards Semantic Abstractions for Word Sense Discrimination

Langer, Gabriele | Schulder, Marc


Volume:
Proceedings of the LREC2020 9th Workshop on the Representation and Processing of Sign Languages: Sign Language Resources in the Service of the Language Community, Technological Challenges and Application Perspectives
Venue:
Marseille, France
Date:
16 May 2020
Pages:
127–134
Publisher:
European Language Resources Association (ELRA)
License:
CC BY-NC 4.0
sign-lang ID:
20017
ACL ID:
2020.signlang-1.21
ISBN:
979-10-95546-54-2

Content Categories

Projects:
DGS Corpus project
Languages:
German Sign Language
Corpora:
DGS Corpus
Dictionaries:
DW-DGS
Lexical Databases:
DGS Corpus types list

Abstract

In general monolingual lexicography a corpus-based approach to word sense discrimination (WSD) is the current standard. Automatically generated lexical profiles such as Word Sketches provide an overview on typical uses in the form of collocate lists grouped by their part of speech categories and their syntactic dependency relations to the base item. Collocates are sorted by their typicality according to frequency-based rankings. With the advancement of sign language (SL) corpora, SL lexicography can finally be based on actual language use as reflected in corpus data. In order to use such data effectively and gain new insights on sign usage, automatically generated collocation profiles need to be developed under the special conditions and circumstances of the SL data available. One of these conditions is that many of the prerequesites for the automatic syntactic parsing of corpora are not yet available for SL. In this article we describe a collocation summary generated from DGS Corpus data which is used for WSD as well as in entry-writing. The summary works based on the glosses used for lemmatisation. In addition, we explore how other resources can be utilised to add an additional layer of semantic grouping to the collocation analysis. For this experimental approach we use glosses, concepts, and wordnet supersenses.

Keywords

Document Download

Paper PDF BibTeX File BibTeX File w/Abstract

BibTeX Export

@inproceedings{langer:20017:sign-lang:lrec,
  author    = {Langer, Gabriele and Schulder, Marc},
  title     = {Collocations in Sign Language Lexicography: Towards Semantic Abstractions for Word Sense Discrimination},
  pages     = {127--134},
  editor    = {Efthimiou, Eleni and Fotinea, Stavroula-Evita and Hanke, Thomas and Hochgesang, Julie A. and Kristoffersen, Jette and Mesch, Johanna},
  booktitle = {Proceedings of the {LREC2020} 9th Workshop on the Representation and Processing of Sign Languages: Sign Language Resources in the Service of the Language Community, Technological Challenges and Application Perspectives},
  maintitle = {12th International Conference on Language Resources and Evaluation ({LREC} 2020)},
  publisher = {{European Language Resources Association (ELRA)}},
  address   = {Marseille, France},
  day       = {16},
  month     = may,
  year      = {2020},
  isbn      = {979-10-95546-54-2},
  language  = {english},
  url       = {https://www.sign-lang.uni-hamburg.de/lrec/pub/20017.pdf}
}
Something missing or wrong?