Automated Extraction of Prosodic Structure from Unannotated Sign Language Video

Sevilla, Antonio F. G.; Lahoz-Bengoechea, José María; Díaz Esteban, Alberto

Automated Extraction of Prosodic Structure from Unannotated Sign Language Video

Sevilla, Antonio F. G. | Lahoz-Bengoechea, José María | Díaz Esteban, Alberto

Volume:: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
Venue:: Torino, Italy
Date:: 20 to 25 May 2024
Pages:: 1808–1816
Publisher:: ELRA Language Resources Association (ELRA) and the International Committee on Computational Linguistics (ICCL)
Licence:: CC BY-NC 4.0
ACL ID:: 2024.lrec-main.161
ISBN:: 978-2-493814-10-4

Content Categories

Projects:: CANTOR, Signario LSE, VISSE
Corpora:: Spreadthesign
Lexical Databases:: ASL Signbank, BSL SignBank, Signario LSE

Abstract

As in oral phonology, prosody is an important carrier of linguistic information in sign languages. One of the most prominent ways this reveals itself is in the time structure of signs: their rhythm and intensity of articulation. To be able to empirically see these effects, the velocity of the hands can be computed throughout the execution of a sign. In this article, we propose a method for extracting this information from unlabeled videos of sign language, exploiting CoTracker, a recent advancement in computer vision which can track every point in a video without the need of any calibration or fine-tuning. The dominant hand is identified via clustering of the computed point velocities, and its dynamic profile plotted to make apparent the prosodic structure of signing. We apply our method to different datasets and sign languages, and perform a preliminary visual exploration of results. This exploration supports the usefulness of our methodology for linguistic analysis, though issues to be tackled remain, such as bi-manual signs and a formal and numerical evaluation of accuracy. Nonetheless, the absence of any preprocessing requirements may make it useful for other researchers and datasets.

Document Download

Paper PDF BibTeX File + Abstract

Cite as

Citation in ACL Citation Format

Antonio F. G. Sevilla, José María Lahoz-Bengoechea, Alberto Díaz Esteban. 2024. Automated Extraction of Prosodic Structure from Unannotated Sign Language Video. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 1808–1816, Torino, Italy. ELRA Language Resources Association (ELRA) and the International Committee on Computational Linguistics (ICCL).

BibTeX Export

@inproceedings{sevilla-etal-2024-prosodic:lrec,
  author    = {Sevilla, Antonio F. G. and Lahoz-Bengoechea, Jos{\'e} Mar{\'i}a and D{\'i}az Esteban, Alberto},
  title     = {Automated Extraction of Prosodic Structure from Unannotated Sign Language Video},
  pages     = {1808--1816},
  editor    = {Calzolari, Nicoletta and Kan, Min-Yen and Hoste, Veronique and Lenci, Alessandro and Sakti, Sakriani and Xue, Nianwen},
  booktitle = {2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation ({LREC-COLING} 2024)},
  publisher = {{ELRA Language Resources Association (ELRA) and the International Committee on Computational Linguistics (ICCL)}},
  address   = {Torino, Italy},
  day       = {20--25},
  month     = may,
  year      = {2024},
  isbn      = {978-2-493814-10-4},
  language  = {english},
  url       = {https://aclanthology.org/2024.lrec-main.161}
}