ASR System for Patient Symptoms

Tags: Healthcare AI · Speech AI

ASR system for understanding medical symptoms spoken by patients in Bengali. Trained DeepSpeech from scratch on audio collected via a consented data collection portal, then finetuned for noisy environments using 13 domain augmentations. Switched to a Whisper (tiny) model finetuned on the BanglaASR corpus (Bangla Mozilla Common Voice), achieving a WER of only 8%—enabled by the limited vocabulary of symptom terms.

Slides · Live demo on Hugging Face