Speaker: Dr. Carol Espy-Wilson
Title:Knowledge-based Automatic Speech Recognition
Date:February 14, 2003
Time:3:00 pm
Location:GCATT Room 325
Abstract:

The development of an automatic event-based speech recognition system (EBS) that relies heavily on acoustic phonetics (to guide the recognition process and to extract relevant information) and combines a phonetic-feature hierarchy with a uniform statistical framework (at present, Support Vector Machines) to provide adaptability and flexibility is currently underway. This recognition framework allows for easy assessment and distinction of the performances of the acoustic parameters versus that of the pattern recognizer. The overall structure of EBS involves (1) landmark detection based on acoustic parameters that are related to the source and manner-of-articulation phonetic features and (2) use of the manner landmarks in the extraction of other acoustic parameters related to the place-of-articulation phonetic features. This talk will focus on the development of the acoustic parameters and the need for relative parameters for speaker independence, multi-time-scale processing to capture the dynamics of phonetic segments and extensive evaluation of the parameters to hone in on direct measures of the relevant acoustic properties.

Biography:

Dr. Carol Espy-Wilson's research interests include the integration of engineering, linguistics and speech science to study speech communication. She is developing an approach to speech recognition based on phonetic features to address the limitations of present recognizers (e.g., speaker dependence). She also conducts research in the areas of speech production and speech enhancement. A major focus of her research is to gain a better understanding of the relationship between articulation, acoustics and perception. This knowledge is important for speech recognition, speech synthesis and the study of speech impairments. Professor Espy-Wilson heads the Speech Communication Lab where postdoctoral, graduate and undergraduate students perform research. Funding for this research is provided by NIH and NSF.

Slides: