A Fast Note about Slowed Temporal Processing in Older Adults

This page as PDF

Kathy Pichora-Fuller's column “What’s new about getting older?” delves into all aspects of ageing and hearing ranging from health policy developments to neurophysical research on the aging auditory brain.

Some comments of older listeners about their experiences of hearing problems that we need to think about more:

 “Most TV or radio speakers are too fast and while I am trying to make sense of the first statement they are away onto the third or fourth sentence so I soon have to drop out and so lose interest.”

“For me, distinct enunciation helps greatly, in completely sounding each word, the speaker goes more slowly and therefore gives the recipient time to assimilate and adapt the sounds to meaning.”

“When asking for repetition of statements it seems to be a way of giving the brain cells time to put sounds into meaning.”

“Using a photographic term, I have a very shallow depth of field and sounds are soon out of focus.

Why do older adults with (near-)normal audiograms describe speech as unclear and too fast?

Audibility:

It is well known that the prevalence of audiometric hearing loss increases markedly as adults age. The hallmark of age-related hearing loss (ARHL) is elevated audiometric thresholds in the high-frequencies. Reduced audibility of high-frequency speech cues can result in difficulties understanding speech because of misperceptions, including errors resulting from confusions among low-energy voiceless consonants (e.g., p-t-k; f-th). Amplification provides a good solution to problems in speech understanding that are attributable to reduced audibility. Nevertheless, temporal processing problems may continue to contribute to difficulties understanding speech in noise even when amplification has been provided (Humes, 2007).

For those younger than 65 years of age, the number who self-report hearing loss is greater than the number who have impaired hearing based on their audiometric thresholds (Bainbridge & Wallhagen, 2014). Despite having (near-)normal audiometric thresholds and little difficulty in quiet listening conditions, these older adults frequently complain about difficulties understanding speech in noisy everyday sitatuions. Many who are seeking hearing help from audiologists for the first time are told that they are not yet candidates for hearing aids because their audiograms are “too good”. More needs to be done to help them address listening difficulties that likely involve declines in supra-threshold auditory temporal processing.

Supra-threshold Auditory Temporal Processing:

Many auditory spectral cues are important for speech perception. Spectral patterns, as displayed in spectrograms, correspond to speech cues such as vowel formants, consonant transitions, or consonant frication noise. In addition to spectral cues, many different auditory temporal cues serve speech perception (Phillips, 1995; Pichora-Fuller & Souza, 2003). Different temporal cues are involved in the perception of speech cues at various linguistic levels. Periodicity coding of voice fundamental frequency and harmonic cues enable voice identification (e.g., cues used to differentiate female from male voices). The period (duration of one cycle of the fundamental frequency) of adult voices is about 5 ms for males and about 10 ms for females. Gaps in speech provide a phonemic cue that signals the presence of stop consonants that are necessary to differentiate words (e.g., split vs. slit; catch vs. cash). Word-level phonemic contrasts based on gaps are in the range from about 20 ms in fast speech to about 40 ms in slow speech. Sentence-level speech envelope patterns, such as changes in pitch and rate, provide prosodic cues to syntax (e.g., You like that? vs. You like that!), with typical syllable durations up to around 250 ms. In contrast to spectral cues, temporal cues are more robust over a wider range of signal levels from just above threshold up to uncomfortable levels of speech. Speech intelligibility is driven mostly by spectral cues in the higher frequencies (above 1500 Hz) whereas it is driven mostly by temporal cues in lower frequencies (below 1500 Hz) inputs. Spectral cues become less helpful but temporal cues become especially helpful when listeners must segregate multiple simultaneous voices during listening in noise.

Importantly, there are different types of ARHL. The sensory type (resulting from outer hair cell damage in the cochlea) and the metabolic type (resulting from changes in the stria vascularis of the cochlea that affect endo-cochlear potentials) or a combination of sensory and metabolic ARHL are characterized by elevated audiometric thresholds at high-frequencies (Dubno et al., 2013). In contrast, neural ARHL may result in declines in temporal processing in the absence of elevations in audiometric thresholds (Liberman & Kujawa, 2017). About half of 60-year old men and half of 70-year-old women have (near-)normal audiometric thresholds that are no more than 25 dB HL at frequencies up to and including 3 kHz (ISO 7029:2017). Compared to younger adults, most of these older adults with (near-)normal audiograms perform more poorly on tasks using non-speech or speech stimuli to measure temporal processing.  

Compared to younger adults, older adults have poorer pure-tone frequency difference limens and they benefit less from differences in voice fundamental frequency when identifying vowels spoken concurrently by two talkers (e.g., Vongpaisal et al., 2007) or when identifying words in sentences spoken by a talker when the voice of a competing talker is the same as or different from the target talker (Besser et al., 2015). To detect the presence of a gap, older adults need longer gaps between two tones of the same frequency and they also need longer gaps between a noise and a tone (Pichora-Fuller et al., 2005). Similarly, for speech signals, older adults need longer gap durations than younger adults to detect phonemically significant gaps between two identical vowels (u_u heard as upu) or between a fricative and a vowel (s_u heard as spoo). The minimum gap needed to differentiate words, with or without a stop consonant (e.g., spoon vs. soon), is larger for older than for younger listeners, with both age groups needing a smaller gap in fast speech than in slow speech (Haubert & Pichora-Fuller, 1999). An experiment measuring the signal-to-noise ratio (SNR) threshold for word recognition in three-word nonsense sentences illustrates the abilities of younger and older listeners to segregate target speech from various maskers depending on the presence of competing voice (Ezzatian et al., 2015). Not surprisingly, across all masking conditions, younger listeners recognized words at more challenging SNRs than older listeners. Both younger and older listeners had the least difficulty when the masker was steady-state noise (i.e., when the masker provided no competing temporal speech cues). Both age groups had the most difficulty recognizing key words in the three-word nonsense sentences when the masker was intact two-talker speech. Notably, when the masker was intact two-talker speech, the performance of both groups was worse on the first than the third word of the target non-sense sentence such that, as the sentence unfolded in time, both age group improved in segregating the speech of the target talker from the competing two-talker masker (masker had both competing voice and envelope cues). For older adults, but not younger adults, stream segregation was also improved from the first to the third word in the nonsense sentences when 16-band vocoding was used to distort the masker (masker voice cues were impoverished but competing envelope cues remained), suggesting that younger listeners could rapidly segregate the target from the masker when there were competing envelope cues, but the presence of competing envelope cues slowed older adults in segregating the target from the masker.

Conclusion

In summary, compared to younger adults, most older adults with (near-)normal audiograms have multiple declines in auditory temporal processing that may undermine speech understanding in noise. Older adults are less able to use periodicity cues to differentiate voices; they need longer gaps to detect phonemic contrasts; they are slower in segregating speech and maskers with competing voice and/or speech envelope cues. Such age-related declines in temporal processing likely explain their common complaints of difficulty understanding speech and self-reported hearing problems that are not predictable from audiometric thresholds. Age-related declines in temporal processing can also undermine music perception (Russo et al., 2012; Pichora-Fuller, 2020) and the identification of vocal emotion (Dupus & Pichora-Fuller, 2015; Goy et al., 2018). Furthermore, reduced supra-threshold auditory temporal processing may diminish cues that facilitate cognitive processes involved in comprehension, memory and attention (Pichora-Fuller et al., 2017). Audiologists could use existing assessment tools such as the LiSN-S test (Cameron et al., 2011) to gain insight into whether or not listeners benefit from voice differences between target and competing voices (e.g., Besser et al. 2015). They could also use questionnaires such as the SSQ (Gatehouse & Noble, 2004; Banh et al., 2012). Most importantly, audiologists should discuss the everyday listening needs of older adults that could be addressed by using assistive technologies or environmental modifications or communication strategies to improve communication accessibility in challenging listening situations, even if their audiograms seem to be “too good” for them to be candidates for hearing aids (Humes et al., 2020).


References

  1. Banh, J., Singh, G., & Pichora-Fuller, M.K. (2012). Age affects responses on the Speech, Spatial, and Qualities of Hearing Scale (SSQ) for adults with minimal audiometric loss. Journal of the American Academy of Audiology, 23, 81-91. https://doi.org/10.3766/jaaa.23.2.2
  2. Bainbridge, K. E., & Wallhagen, M. I. (2014). Hearing loss in an aging American population: Extent, Impact, and Management. The Annual Review of Public Health, 35, 139-52. https://doi.org/10.1146/annurev-publhealth-032013-182510
  3. Besser, J., Festen, J.M., Goverts, S.T., Kramer, S.E., & Pichora-Fuller, M.K. (2015). Speech-in-speech listening on the LiSN-S test by older adults with good audiograms depends on cognition and hearing acuity at high frequencies. Ear and Hearing, 36, 24-41. DOI; 10.1097/AUD.0000000000000096
  4. Cameron, S., Glyde, H., & Dillon, H. (2011). Listening in Spatialized Noise-Sentences Test (LiSN-S): Normative and retest reliability data for adolescents and adults up to 60 years of age. J Am Acad Audiol, 22, 697–709. https://doi.org/10.3766/jaaa.22.10.7
  5. Dubno, J. R., Eckert, M. A., Lee, F. S., Matthews, L. J., & Schmiedt, R. A. (2013). Classifying human audiometric phenotypes of age-related hearing loss from animal models. Journal of the Association for Research in Otolaryngology : JARO, 14(5), 687–701. https://doi.org/10.1007/s10162-013-0396-x
  6. Dupuis, K., & Pichora-Fuller, M. K. (2015). Aging affects identification of vocal emotions in semantically neutral sentences. Journal of Speech, Language and Hearing Research, 58(3), 1061-1076. DOI: 10.1044/2015_JSLHR-H-14-0256
  7. Ezzatian, P., Li, L., Pichora-Fuller, M.K., & Schneider, B. A. (2015). Delayed stream segregation in older adults: More than just informational masking. Ear and Hearing, 36(4), 482-484. doi: 10.1097/AUD.0000000000000139
  8. Gatehouse, S., & Noble, W. (2004). The Speech, Spatial and Qualities of Hearing Scale (SSQ). International journal of audiology, 43(2), 85–99. https://doi.org/10.1080/14992020400050014
  9. Goy, H., Pichora-Fuller, M. K., Singh, G., & Russo, F. A. (2018). Hearing aids benefit recognition of words in emotional speech but not emotion identification. Trends in Hearing, 22. journals.sagepub.com/doi/full/10.1177/2331216518801736.
  10. Haubert, N., & Pichora-Fuller, M.K. (1999). The perception of spoken language by elderly listeners: Contributions of auditory temporal processes. Canadian Acoustics, 27(3), 96-97. https://jcaa.caa-aca.ca/index.php/jcaa/article/view/1235/972
  11. Humes L. E. (2007). The contributions of audibility and cognitive factors to the benefit provided by amplified speech to older adults. Journal of the American Academy of Audiology, 18(7), 590–603. https://doi.org/10.3766/jaaa.18.7.6
  12. Humes, L. E., Pichora-Fuller, M. K., & Hickson, L. (2020). Functional consequences of impaired hearing in older adults and implications for intervention (pp. 257-280). In K. Helfer, E. Bartlett, A. Popper, & R. R. Fay (Eds), Aging and hearing: Causes and consequences, Springer Handbook of Auditory Research 72. Springer Nature Switzerland AG. https://doi.org/10.1007/978-3-030-49367-7_11
  13. ISO 7029:2017. Acoustics — Statistical distribution of hearing thresholds related to age and gender (Edition 3, 2017). https://www.iso.org/standard/42916.html
  14. Liberman, M. C., & Kujawa, S. G. (2017). Cochlear synaptopathy in acquired sensorineural hearing loss: Manifestations and mechanisms. Hearing Research, 349, 138–147. https://doi.org/10.1016/j.heares.2017.01.003
  15. Phillips, Dennis. (1995). Central auditory processing: A view from auditory neuroscience. American Journal of Otology, 16(3), 338-352. Retrieved from http://ovidsp.ovid.com/ovidweb.cgi?T=JS&PAGE=reference&D=ovftb&NEWS=N&AN=00000455-199505000-00013.
  16. Pichora-Fuller, M.K., Alain, C., & Schneider, B. (2017). Older adults at the cocktail party (pp. 227-259). In J. Middlebrooks, J. Simon, A. Popper, & R. R. Fay (Eds), The auditory system at the cocktail party, Springer Handbook of Auditory Research. Springer: Berlin. https://doi.org/10.1007/978-3-319-51662-2_9
  17. Pichora-Fuller, M. K. (2020). Hearing and aging (pp 69-103). In L. Cuddy, S. Belleville, & A. Moussard (Eds.), Aging and Music. Elsevier Academic Press: London, UK. https://doi.org/10.1016/B978-0-12-817422-7.00003-1
  18. Pichora-Fuller, M.K., & Souza, P. (2003).  Effects of aging on auditory processing of speech. International Journal of Audiology, 42 (Supp 2), S11-S16. https://doi.org/10.3109/14992020309074638
  19. Pichora-Fuller, M.K., Schneider, B., Benson, N., Hamstra, S., & Storzer, E. (2005). Age differences in detecting gaps in speech and non-speech. Journal of the Acoustical Society of America, 117, 2572. https://doi.org/10.1121/1.4788573
  20. Russo, F., Ives, T., Goy, H., Pichora-Fuller, M.K., & Patterson, R. (2012). Age-related differences in melodic perception are probably mediated by temporal processing: Empirical and computational evidence. Ear and Hearing, 33, 177-186. doi: 10.1097/AUD.0b013e318233acee
  21. Vongpaisal, T., & Pichora-Fuller, M.K. (2007). Effect of age on F0 difference limen and concurrent vowel identification. Journal of Speech, Language and Hearing Research, 50, 1139-1156. https://doi.org/10.1044/1092-4388(2007/079)
About the author

Kathy Pichora-Fuller, PhD, Aud(C), RAUD, FCAHS, Professor Emerita of Psychology at U of T

Kathy Pichora-Fuller is Professor Emerita in Psychology at the University of Toronto and Adjunct Professor in Gerontology at Simon Fraser University. She is the audiology expert for the Canadian Longitudinal Study of Aging and a member of the sensory-cognitive-communication team of the Canadian Consortium on Neuro-degeneration in Aging. She translates her lab-based research on auditory and cognitive aging to address the needs of older adults who have sensory and cognitive declines. Currently, she serves on the editorial board of Ear and Hearing and is the Past President of the International Collegium of Rehabilitative Audiology.