START Conference Manager    

VocaliD: Personalizing Text-to-Speech Synthesis for Individuals with Severe Speech Impairment

Camil Jreige, Rupal Patel and Timothy Bunnell

Eleventh International ACM SIGACCESS Conference on Computers and Accessibility - Posters and Demos (ASSETS 2009)
Pittsburgh, PA, USA, October 26-28, 2009


Summary

Speech synthesis options on assistive communication devices are very limited and do not reflect the user’s vocal quality or personality. Previous work suggests that speakers with severe speech impairment can control prosodic aspects of their voice, and often retain the ability to produce sustained vowel-like utterances. This project leverages these residual phonatory abilities in order to build an adaptive text-to-speech synthesizer that is intelligible, yet conveys the user’s vocal identity. Our VocaliD system combines the source characteristics of the disordered speaker with the filter characteristics of an age-matched healthy speaker using voice transformation techniques, in order to produce a personalized voice. Usability testing indicated that the transformed voices were more than 94% intelligible, and that speaker identification accuracy was 79.5%.