Real-time American Sign Language recognition via temporal contrast emulation.
Millions of people rely on sign language to communicate, yet real-time translation tools typically demand expensive hardware or high-end GPUs. TCASL addresses this by mimicking Dynamic Vision Sensors (DVS) in software, using temporal contrast emulation to isolate hand motion from standard webcam video and discard static background noise. The resulting sparse event frames are classified by a Spiking Deep Neural Network (SDNN), enabling real-time ASL finger-spelling recognition on consumer hardware.
A gamified finger-spelling application (Spelling Bee format) that uses your webcam to validate ASL gestures in real time across three difficulty levels.
For setup and usage, see the TCASLApp README.
A lightweight Python library for integrating temporal contrast emulation and SDNN inference into your own projects.
For setup and usage, see the TCASLCore README.
To access our custom dataset, see our HuggingFace page for instructions!
- Lighting: Temporal contrast emulation detects pixel-level intensity changes, so flickering lights or moving shadows can generate false events and degrade accuracy.
- Hand speed: Signing too slowly produces an event frame too sparse to classify reliably; signing too quickly causes motion blur.
- Dynamic gestures (J and Z): The current pipeline classifies static frames, so only the final hand position of motion-based letters is captured.
