Mission Statement

Recent Presentations

Recent Publications





Data Sets


Auditory-Visual Speech Recognition Laboratory
Bethesda, MD 20889-5600
(301) 919-2957
Recent Publications and Submissions (1990 - )
Deng, L., Lennig, M., Seitz, P.F., and Mermelstein, P. (1990). "Large vocabulary word recognition using
context-dependent allophonic hidden Markov models," Computer Speech and Language 4, 345-358.
Seitz, P.F., Gupta, V.N., Lennig, M., Kenny, P., Deng, L., O'Shaughnessy, D., and Mermelstein, P.  (1990)."A dictionary for a very large vocabulary word recognition system," Computer Speech and Language 4, 193-202.
Seitz, P.F., McCormick, M., Watson, I., and Bladon, R.A. (1990). "Relational spectral features for place of
articulation in nasal consonants," J. Acoust. Soc. Am. 87, 351-358.
Deng, L., Kenny, P., Lennig, M., Gupta, V., Seitz, P., and Mermelstein, P. (1991). "Phonemic hidden Markov models with continuous mixture output densities for large vocabulary speech recognition," IEEE Transactions on Signal Processing 39, 1677-1681.
Grant, K.W., and Braida, L.D. (1991). "Evaluating the Articulation Index for audiovisual input," J. Acoust.
Soc. Am. 89, 2952-2960.

Grant, K.W., Braida, L.D., and Renn, R.J. (1991). "Single-band amplitude envelope cues as an aid to speechreading," Quarterly J. Exp. Psych. 43, 621-645.

Walden, B.W., and Grant, K.W. (1993). "Research needs in rehabilitative audiology," in J.G. Alpiner and P.A. McCarthy (Eds.) Rehabilitative Audiology: Children and Adults, Williams and Wilkins, Baltimore, MD (pp. 500-528).

Braida, L.D., Zurek, P.M., Grant, K.W., Greenberg, J.E., and Rankovic, C.M. (1993). "Current research in hearing aids at M.I.T.," presented at the International Symposium on Hearing Aids and Speech Training for the Hearing Impaired, Osaka, Japan, July 16-17, 1991.

Grant, K.W., Braida, L.D., and Renn, R.J. (1994). "Auditory supplements to speechreading: Combining amplitude envelope cues from different spectral regions of speech," J. Acoust. Soc. Am. 95, 1065-1073.

Grant, K.W., and Walden, B.E. (1995). "Predicting auditory-visual speech recognition in hearing-impaired listeners," presented at the XIIIth International Congress of Phonetic Sciences, Stockholm, Sweden, August 13-19, 1995, Vol. 3, 122-129.

Grant, K.W., and Walden, B.E. (1996). "The spectral distribution of prosodic information," J. Speech Hear. Res. 39, 228-238.

Grant, K.W., and Walden, B.E. (1996). "Evaluating the articulation index for auditory-visual consonant recognition," J. Acoust. Soc. Am. 100, 2415-2424.

Rakerd, B., Seitz, P.F., and Whearty, M. (1996). "A dual-task approach to assessing cognitive demands of speech listening for people with hearing losses," Ear and Hearing 17, 97-108.

Seitz, P.F., and Lennig, M. (1996). "Phonological rule set complexity in a very large vocabulary word recognition system," In G. Guy, C. Feagin, D. Schiffrin, and J. Baugh (Eds.), Towards a social science of language (pp. 289-307). The Hague: John Benjamins.

Seitz, P.F., and Rakerd, B. (1997). "Auditory stimulus intensity and reaction time in listeners with longstanding sensorineural hearing loss," Ear and Hearing 18, 502-512.

Grant, K.W., Walden, B.E., and Seitz, P.F. (1998). "Auditory-visual speech recognition by hearing-impaired subjects: Consonant recognition , sentence recognition, and auditory-visual integration," J. Acoust. Soc. Am. 103, 2677-2690.

Grant, K.W., Summers, V., and Leek, M.R. (1998). "Modulation rate detection and discrimination by normal-hearing and hearing-impaired listeners," J. Acoust. Soc. Am. 104, 1051-1060.

Grant, K.W., and P.F. Seitz (1998). "Measures of auditory-visual integration in nonsense syllables and sentences," J. Acoust. Soc. Am. 104, 2438-2450.

Grant, K.W., and Seitz, P.F. (2000a). "The recognition of isolated words and words in sentences: Individual variability in the use of sentence context," J. Acoust. Soc. Am. 107, 1000-1011.

Grant, K.W., and Seitz, P.F. (2000b). "The use of visible speech cues for improving auditory detection of spoken sentences," J. Acoust. Soc. Am. 108, 1197-1208.

Beamer, S.L., Grant, K.W., and Walden, B.E. (2000). "Hearing aid benefit for patients with high frequency hearing impairment," J. Am. Acad. Audiol. 11, 429-437.

Grant, K.W. (2001). "The effect of speechreading on masked detection thresholds for filtered speech," J. Acoust. Soc. Am. 109, 2272-2275.

Walden, B.E., Grant, K.W., and Cord, M.T. (2001). "Effects of amplification and speechreading on consonant recognition in persons with impaired hearing," Ear and Hearing 22, 333-341.

Grant, K.W., and Greenberg, S. (2001). "Speech Intelligibility Derived From Asynchronous Processing of Auditory-Visual Information," Proc. AVSP 2001 International Conference on Auditory-Visual Speech Processing, Scheelsminde, Denmark, 132-137.

Grant, K.W. (2002). "Measures of auditory-visual integration for speech understanding: A theoretical perspective," J. Acoust. Soc. Am. 112, 30-33.

Grant, K.W. (2003). "Auditory supplements to speechreading," Institute of Electronics, Information and Communication Engineers (IEICE) - Technical Report, SP2003-46, 15-19.

Grant, K.W., Greenberg, S., Poeppel, D., van Wassenhove, V. (2004). "Effects of spectro-temporal asynchrony in auditory and auditory-visual speech processing," Seminars in Hearing 25, 241-255.

Grant, K.W., van Wassenhove, V., and Poeppel, D. (2004). "Detection of auditory (cross-spectral) and auditory-visual (cross-modal) synchrony," Speech Communication. Special Issue on Audio Visual Speech Processsing, J.-L. Schwartz, F. Bertommier, M.-A. Cathiard and R. de Mori (Eds.), 44(1-4), 43-53.
Walden, B.E., Surr, R.K., Cord, M.T., Grant, K.W., and Dyrlund, O. (2004). "Effect of signal-to-noise ratio on microphone preferences and benefit," Proceedings of the 49th International Congress of Hearing Aid Acousticians (CD) - Frankfurt am Main, 60-63.

Van Wassenhove, V., Grant, K.W., and Poeppel, D. (2005). "Visual speech speeds up the neural processing of auditory speech," Proceedings of the National Academy of Sciences (PNAS) 102, 1181-1186.

Walden, B.E., Surr, R.K., Grant, K.W., Summers, V., Cord, M.T., and Dyrlund, O. (2005). "Effect of Signal-to-Noise Ratio on Directional Microphone Benefit and Preference," J. Am. Acad. Audiol. 16, 662–676.

Van Wassenhove, V., Grant, K.W., and Poeppel, D. (2007). "Temporal window of integration in auditory-visual speech perception," Neuropsychologia 45, 598-607.

Grant, K.W., Tufts, J.B., and Greenberg, S. (2007). "Integration efficiency for speech perception within and across sensory modalities," J. Acoust. Soc. Am. 121, 1164-1176.

Grant, K.W., Elhilali, M., Shamma, S.A., Walden, B.E., Surr; R.K., Cord, M.T., and Summers, V. (2008). "An Objective Measure for Selecting Microphone Modes in OMNI/DIR Hearing-Aid Circuits," Ear and Hearing, 29, 199-213.

Summers, V., Grant, K.W., Walden, B.E., Cord, M.T., Surr, R.K., and Elhilali, M. (in press). "Evaluation of a "Direct-Comparison" Approach to Automatic Switching in Omnidirectional/Directional Hearing Aids," J. Am. Acad. Audiol.

Grant, K.W., and Seitz, P.F. (in revision) "Time course of phonetic information for auditory, visual, and auditory-visual spoken words by hearing-impaired listeners," J. Acoust. Soc. Am.

Grant, K.W., and Bernstein, J.G.W. (submitted). "Auditory and auditory-visual frequency-band importance functions for consonant recognition," J. Acoust. Soc. Am.