Skip to main content
Top
Published in: Journal of the Association for Research in Otolaryngology 2/2007

01-06-2007

Visually-guided Attention Enhances Target Identification in a Complex Auditory Scene

Authors: Virginia Best, Erol J. Ozmeral, Barbara G. Shinn-Cunningham

Published in: Journal of the Association for Research in Otolaryngology | Issue 2/2007

Login to get access

Abstract

In auditory scenes containing many similar sound sources, sorting of acoustic information into streams becomes difficult, which can lead to disruptions in the identification of behaviorally relevant targets. This study investigated the benefit of providing simple visual cues for when and/or where a target would occur in a complex acoustic mixture. Importantly, the visual cues provided no information about the target content. In separate experiments, human subjects either identified learned birdsongs in the presence of a chorus of unlearned songs or recalled strings of spoken digits in the presence of speech maskers. A visual cue indicating which loudspeaker (from an array of five) would contain the target improved accuracy for both kinds of stimuli. A cue indicating which time segment (out of a possible five) would contain the target also improved accuracy, but much more for birdsong than for speech. These results suggest that in real world situations, information about where a target of interest is located can enhance its identification, while information about when to listen can also be helpful when targets are unfamiliar or extremely similar to their competitors.
Literature
go back to reference Arbogast TL, Kidd G Jr. Evidence for spatial tuning in informational masking using the probe-signal method. J. Acoust. Soc. Am. 108:1803–1810, 2000.PubMedCrossRef Arbogast TL, Kidd G Jr. Evidence for spatial tuning in informational masking using the probe-signal method. J. Acoust. Soc. Am. 108:1803–1810, 2000.PubMedCrossRef
go back to reference Asemi N, Sugita Y, Suzuki Y. Auditory search asymmetry between normal Japanese speech sounds and time-reversed speech sounds distributed on the frontal-horizontal plane. Acoust. Sci. Technol. 24:145–147, 2003.CrossRef Asemi N, Sugita Y, Suzuki Y. Auditory search asymmetry between normal Japanese speech sounds and time-reversed speech sounds distributed on the frontal-horizontal plane. Acoust. Sci. Technol. 24:145–147, 2003.CrossRef
go back to reference Beck DM, Kastner S. Stimulus context modulates competition in human extrastriate cortex. Nat. Neurosci. 8:1110–1116, 2005.PubMedCrossRef Beck DM, Kastner S. Stimulus context modulates competition in human extrastriate cortex. Nat. Neurosci. 8:1110–1116, 2005.PubMedCrossRef
go back to reference Best V, Ozmeral E, Gallun FJ, Sen K, Shinn-Cunningham BG. Spatial unmasking of birdsong in human listeners: energetic and informational factors. J. Acoust. Soc. Am. 118:3766–3773, 2005.PubMedCrossRef Best V, Ozmeral E, Gallun FJ, Sen K, Shinn-Cunningham BG. Spatial unmasking of birdsong in human listeners: energetic and informational factors. J. Acoust. Soc. Am. 118:3766–3773, 2005.PubMedCrossRef
go back to reference Brungart DS, Simpson BD. Cocktail party listening in a dynamic multitalker environment. Percept. Psychophys., 2007, in press. Brungart DS, Simpson BD. Cocktail party listening in a dynamic multitalker environment. Percept. Psychophys., 2007, in press.
go back to reference Buchtel HA, Butter CM, Ayvasik B. Effects of stimulus source and intensity on covert orientation to auditory stimuli. Neuropsychology 34:979–985, 1996.CrossRef Buchtel HA, Butter CM, Ayvasik B. Effects of stimulus source and intensity on covert orientation to auditory stimuli. Neuropsychology 34:979–985, 1996.CrossRef
go back to reference Carhart R, Tillman TW, Greetis ES. Perceptual masking in multiple sound backgrounds. J. Acoust. Soc. Am. 45:694–703, 1969.PubMedCrossRef Carhart R, Tillman TW, Greetis ES. Perceptual masking in multiple sound backgrounds. J. Acoust. Soc. Am. 45:694–703, 1969.PubMedCrossRef
go back to reference Durlach NI, Mason CR, Gallun FJ, Shinn-Cunningham BG, Colburn HS, Kidd G Jr. Psychometric functions for fixed and randomly mixed maskers. J. Acoust. Soc. Am. 118:2482–2497, 2005.PubMedCrossRef Durlach NI, Mason CR, Gallun FJ, Shinn-Cunningham BG, Colburn HS, Kidd G Jr. Psychometric functions for fixed and randomly mixed maskers. J. Acoust. Soc. Am. 118:2482–2497, 2005.PubMedCrossRef
go back to reference Eramudugolla R, Irvine DRF, McAnally KI, Martin RL, Mattingley JB. Directed attention eliminates “change deafness” in complex auditory scenes. Curr. Biol. 15:1108–1113, 2005.PubMedCrossRef Eramudugolla R, Irvine DRF, McAnally KI, Martin RL, Mattingley JB. Directed attention eliminates “change deafness” in complex auditory scenes. Curr. Biol. 15:1108–1113, 2005.PubMedCrossRef
go back to reference Ericson MA, Brungart DS, Simpson BD. Factors that influence intelligibility in multitalker speech displays. Int. J. Aviation Psychol. 14:311–332, 2004. Ericson MA, Brungart DS, Simpson BD. Factors that influence intelligibility in multitalker speech displays. Int. J. Aviation Psychol. 14:311–332, 2004.
go back to reference Freyman RL, Balakrishnan U, Helfer KS. Spatial release from informational masking in speech recognition. J. Acoust. Soc. Am. 109:2112–2122, 2001.PubMedCrossRef Freyman RL, Balakrishnan U, Helfer KS. Spatial release from informational masking in speech recognition. J. Acoust. Soc. Am. 109:2112–2122, 2001.PubMedCrossRef
go back to reference Green DM, Weber DL. Detection of temporally uncertain signals. J. Acoust. Soc. Am. 67:1304–1311, 1980.PubMedCrossRef Green DM, Weber DL. Detection of temporally uncertain signals. J. Acoust. Soc. Am. 67:1304–1311, 1980.PubMedCrossRef
go back to reference Helfer KS, Freyman RL. The role of visual speech cues in reducing energetic and informational masking. J. Acoust. Soc. Am. 117:842–849, 2005.PubMedCrossRef Helfer KS, Freyman RL. The role of visual speech cues in reducing energetic and informational masking. J. Acoust. Soc. Am. 117:842–849, 2005.PubMedCrossRef
go back to reference Jones MR, Moynihan H, MacKenzie N, Puente J. Temporal aspects of stimulus-driven attending in dynamic arrays. Psychol. Sci. 13:313–319, 2002.PubMedCrossRef Jones MR, Moynihan H, MacKenzie N, Puente J. Temporal aspects of stimulus-driven attending in dynamic arrays. Psychol. Sci. 13:313–319, 2002.PubMedCrossRef
go back to reference Kidd G Jr, Arbogast TL, Mason CR, Gallun FJ. The advantage of knowing where to listen. J. Acoust. Soc. Am. 118:3804–3815, 2005.PubMedCrossRef Kidd G Jr, Arbogast TL, Mason CR, Gallun FJ. The advantage of knowing where to listen. J. Acoust. Soc. Am. 118:3804–3815, 2005.PubMedCrossRef
go back to reference Kidd G Jr, Mason CR, Brughera A, Hartmann WM. The role of reverberation in release from masking due to spatial separation of sources for speech identification. Acustica united with Acta Acustica 114:526–536, 2005. Kidd G Jr, Mason CR, Brughera A, Hartmann WM. The role of reverberation in release from masking due to spatial separation of sources for speech identification. Acustica united with Acta Acustica 114:526–536, 2005.
go back to reference Lufti RA, Kistler DJ, Callahan MR, Wightman FL. Psychometric functions for informational masking. J. Acoust. Soc. Am. 114:3273–3282, 2003.CrossRef Lufti RA, Kistler DJ, Callahan MR, Wightman FL. Psychometric functions for informational masking. J. Acoust. Soc. Am. 114:3273–3282, 2003.CrossRef
go back to reference Mondor TA, Zatorre RJ. Shifting and focusing auditory spatial attention. J. Exp. Psychol. Hum. Percept. Perform. 21:387–409, 1995.PubMedCrossRef Mondor TA, Zatorre RJ. Shifting and focusing auditory spatial attention. J. Exp. Psychol. Hum. Percept. Perform. 21:387–409, 1995.PubMedCrossRef
go back to reference Parasuraman R, Warm JS, See JE. Brain systems of vigilance. In: Parasuraman R (ed) The Attentive Brain. Cambridge, Massachusetts, MIT Press, 1998. Parasuraman R, Warm JS, See JE. Brain systems of vigilance. In: Parasuraman R (ed) The Attentive Brain. Cambridge, Massachusetts, MIT Press, 1998.
go back to reference Pollack I. Auditory informational masking. J. Acoust. Soc. Am. 57:S5, 1975.CrossRef Pollack I. Auditory informational masking. J. Acoust. Soc. Am. 57:S5, 1975.CrossRef
go back to reference Posner MI, Boies SJ. Components of attention. Psychol. Rev. 78:391–408, 1971.CrossRef Posner MI, Boies SJ. Components of attention. Psychol. Rev. 78:391–408, 1971.CrossRef
go back to reference Raz A, Buhle J. Typologies of attentional networks. Nat. Rev., Neurosci. 7:367–379, 2006.CrossRef Raz A, Buhle J. Typologies of attentional networks. Nat. Rev., Neurosci. 7:367–379, 2006.CrossRef
go back to reference Richards VM, Neff DL. Cuing effects for informational masking. J. Acoust. Soc. Am. 115:289–300, 2004.PubMedCrossRef Richards VM, Neff DL. Cuing effects for informational masking. J. Acoust. Soc. Am. 115:289–300, 2004.PubMedCrossRef
go back to reference Shen J, Reingold EM. Visual search asymmetry: the influence of stimulus familiarity and low-level features. Percept. Psychophys. 63:464–475, 2001.PubMed Shen J, Reingold EM. Visual search asymmetry: the influence of stimulus familiarity and low-level features. Percept. Psychophys. 63:464–475, 2001.PubMed
go back to reference Shiu L, Pashler H. Negligible effects of spatial precuing on identification of single digits. J. Exp. Psychol. Hum. Percept. Perform. 20:1037–1054, 1994.CrossRef Shiu L, Pashler H. Negligible effects of spatial precuing on identification of single digits. J. Exp. Psychol. Hum. Percept. Perform. 20:1037–1054, 1994.CrossRef
go back to reference Spence CJ, Driver J. Covert spatial orienting in audition: exogenous and endogenous mechanisms. J. Exp. Psychol. Hum. Percept. Perform. 20:555–574, 1994.CrossRef Spence CJ, Driver J. Covert spatial orienting in audition: exogenous and endogenous mechanisms. J. Exp. Psychol. Hum. Percept. Perform. 20:555–574, 1994.CrossRef
go back to reference Sumby WH, Pollack I. Visual contribution to speech intelligibility in noise. J. Acoust. Soc. Am. 26:212–215, 1954.CrossRef Sumby WH, Pollack I. Visual contribution to speech intelligibility in noise. J. Acoust. Soc. Am. 26:212–215, 1954.CrossRef
go back to reference Summers V, Molis Mr. Speech recognition in fluctuating and continuous maskers: effects of hearing loss and presentation level. J. Speech Lang. Hear. Res. 47:245–256, 2004.PubMedCrossRef Summers V, Molis Mr. Speech recognition in fluctuating and continuous maskers: effects of hearing loss and presentation level. J. Speech Lang. Hear. Res. 47:245–256, 2004.PubMedCrossRef
go back to reference Vecera SP, Farah MJ. Is visual image segmentation a bottom-up or an interactive process? Percept. Psychophys. 59:1280–1296, 1997.PubMed Vecera SP, Farah MJ. Is visual image segmentation a bottom-up or an interactive process? Percept. Psychophys. 59:1280–1296, 1997.PubMed
go back to reference Wang Q, Cavanagh P, Green M. Familiarity and pop-out in visual search. Percept. Psychophys. 56:495–500, 1994.PubMed Wang Q, Cavanagh P, Green M. Familiarity and pop-out in visual search. Percept. Psychophys. 56:495–500, 1994.PubMed
go back to reference Watson CS. Uncertainty, informational masking and the capacity of immediate auditory memory. In: Yost WA and Watson CS (eds) Auditory Processing of Complex Sounds. Hillsdale, NJ, Lawrence Erlbaum, 1987. Watson CS. Uncertainty, informational masking and the capacity of immediate auditory memory. In: Yost WA and Watson CS (eds) Auditory Processing of Complex Sounds. Hillsdale, NJ, Lawrence Erlbaum, 1987.
go back to reference Winowski DE, Knudsen EI. Top-down gain control of the auditory space map by gaze control circuitry in the barn owl. Nature 439:336–339, 2006.CrossRef Winowski DE, Knudsen EI. Top-down gain control of the auditory space map by gaze control circuitry in the barn owl. Nature 439:336–339, 2006.CrossRef
go back to reference Wright BA, Fitzgerald MB. The time course of auditory attention in a simple auditory detection task. Percept. Psychophys. 66:508–516, 2004.PubMed Wright BA, Fitzgerald MB. The time course of auditory attention in a simple auditory detection task. Percept. Psychophys. 66:508–516, 2004.PubMed
go back to reference Wright BA, Saberi K. Strategies used to detect auditory signals in small sets of random maskers. J. Acoust. Soc. Am. 105:1765–1775, 1999.PubMedCrossRef Wright BA, Saberi K. Strategies used to detect auditory signals in small sets of random maskers. J. Acoust. Soc. Am. 105:1765–1775, 1999.PubMedCrossRef
go back to reference Zemel RS, Behrmann M, Mozer MC, Bavelier D. Experience-dependent perceptual grouping and object-based attention. J. Exp. Psychol. Hum. Percept. Perform. 28:202–217, 2002.CrossRef Zemel RS, Behrmann M, Mozer MC, Bavelier D. Experience-dependent perceptual grouping and object-based attention. J. Exp. Psychol. Hum. Percept. Perform. 28:202–217, 2002.CrossRef
go back to reference Zurek PM. Binaural advantages and directional effects in speech intelligibility. In: Studebaker GA and Hochberg I (eds) Acoustical Factors Affecting Hearing Aid Performance. Boston, Allyn and Bacon, 1993. Zurek PM. Binaural advantages and directional effects in speech intelligibility. In: Studebaker GA and Hochberg I (eds) Acoustical Factors Affecting Hearing Aid Performance. Boston, Allyn and Bacon, 1993.
Metadata
Title
Visually-guided Attention Enhances Target Identification in a Complex Auditory Scene
Authors
Virginia Best
Erol J. Ozmeral
Barbara G. Shinn-Cunningham
Publication date
01-06-2007
Publisher
Springer-Verlag
Published in
Journal of the Association for Research in Otolaryngology / Issue 2/2007
Print ISSN: 1525-3961
Electronic ISSN: 1438-7573
DOI
https://doi.org/10.1007/s10162-007-0073-z

Other articles of this Issue 2/2007

Journal of the Association for Research in Otolaryngology 2/2007 Go to the issue