sign-lang@LREC Anthology

MobileASL: Overcoming the technical challenges of mobile video conversation in sign language

Cavender, Anna | Cherniavsky, Neva | Chon, Jaehong | Ladner, Richard | Riskin, Eve | Vanam, Rahul | Wobbrock, Jacob


Volume:
Proceedings of the LREC2010 4th Workshop on the Representation and Processing of Sign Languages: Corpora and Sign Language Technologies
Venue:
Valletta, Malta
Date:
22 and 23 May 2010
Pages:
45–48
Publisher:
European Language Resources Association (ELRA)
License:
CC BY-NC
sign-lang ID:
10017

Content Categories

Projects:
MobileASL
Languages:
American Sign Language

Abstract

As part of the ongoing MobileASL project, we have built a system to compress, transmit, and decode sign language video in real-time on an off-the-shelf mobile phone. In this work, we review the challenges that arose in developing our system and the algorithms we implemented to address them. Separate parts of this research have been previously published.
Compression and transmission of sign language video presents unique difficulties. We must overcome weak processing power, limited bandwidth capacity, and low battery life. We also must ensure that the system is usable; that is, that the video is intelligible and the algorithms that we employ to save system resources do not irritate users.
We describe the evolution of the MobileASL system and the algorithms we utilize to achieve real-time video communication on mobile phones. We first review our initial user studies to test feasibility and interest in video sign language on mobile phones. We then detail our three main challenges and solutions. To address weak processing power, we optimize the encoder to work on mobile phones, adapting a fast algorithm for distortion-complexity optimization to choose the best parameters. To overcome limited bandwidth capacity, we utilize a dynamic skin-based region of interest, which encodes the face and hands at a higher bit rate at the expense of the rest of the image. To save battery life, we automatically detect periods of signing and lower the frame rate when the user is not signing.
We implement our system on off-the-shelf mobile phones and validate it through a user study. Fluent ASL signers participate in unconstrained conversations over the phones in a laboratory setting. They find the conversations with the dynamic skin-based region of interest more intelligible. The variable frame rate affects conversations negatively, but does not affect the users' perceived desire for the technology.
Ongoing work includes varying the spatial resolution instead of the temporal resolution, further optimization of rate-distortion-complexity, and a field study to determine usability over a long period of time in a realistic setting.

Document Download

Paper PDF BibTeX File+ Abstract

BibTeX Export

@inproceedings{cavender:10017:sign-lang:lrec,
  author    = {Cavender, Anna and Cherniavsky, Neva and Chon, Jaehong and Ladner, Richard and Riskin, Eve and Vanam, Rahul and Wobbrock, Jacob},
  title     = {{MobileASL}: Overcoming the technical challenges of mobile video conversation in sign language},
  pages     = {45--48},
  editor    = {Dreuw, Philippe and Efthimiou, Eleni and Hanke, Thomas and Johnston, Trevor and Mart{\'i}nez Ruiz, Gregorio and Schembri, Adam},
  booktitle = {Proceedings of the {LREC2010} 4th Workshop on the Representation and Processing of Sign Languages: Corpora and Sign Language Technologies},
  maintitle = {7th International Conference on Language Resources and Evaluation ({LREC} 2010)},
  publisher = {{European Language Resources Association (ELRA)}},
  address   = {Valletta, Malta},
  day       = {22--23},
  month     = may,
  year      = {2010},
  language  = {english},
  url       = {https://www.sign-lang.uni-hamburg.de/lrec/pub/10017.pdf}
}
Something missing or wrong?