Isharah Dataset

🧾 About

Isharah is a large-scale dataset for Continuous Saudi Sign Language (SSL) recognition and translation. It features over 30,000 video samples signed by deaf and hearing-impaired individuals using smartphones in varied settings.

The dataset supports both Continuous Sign Language Recognition (CSLR) and Sign Language Translation (SLT), and includes Sentence-level gloss annotations and Corresponding Arabic translations. Three benchmark subsets are included: Isharah-500, Isharah-1000, and Isharah-2000.

📄 Citation

If you use Isharah in your work, please cite:

@ARTICLE{11397217,
  author={Alyami, Sarah and Luqman, Hamzah and Al-Azani, Sadam and Alowaifeer, Maad and Alharbi, Yazeed and Alonaizan, Yaser},
  journal={IEEE Transactions on Multimedia}, 
  title={Isharah: A Large-Scale Multi-Scene Dataset for Continuous Sign Language Recognition}, 
  year={2026},
  volume={},
  number={},
  pages={1-9},
  keywords={Sign Language Recognition;Continuous Sign Language Recognition;Sign Language Translation;Arabic Sign Language;Sign Language Dataset},
  doi={10.1109/TMM.2026.3664959}}

Isharah Continuous Sign Language Recognition and Translation Dataset

🧾 About

🔗 Download

📄 Citation