Speech Recognition and Working Memory Capacity in Young-Elderly ...

2 downloads 41 Views 248KB Size Report
Pichora-Fuller, 2000; Wingfield & Stine-Morrow, 2000). Amongst the cognitive abilities related to the processing of speech, the working memory capacity has ...
Canadian Journal of Experimental Psychology 2009, Vol. 63, No. 3, 216 –226

© 2009 Canadian Psychological Association 1196-1961/09/$12.00 DOI: 10.1037/a0014321

Speech Recognition and Working Memory Capacity in Young-Elderly Listeners: Effects of Hearing Sensitivity Teresa C. Cervera, Maria J. Soler, Carmen Dasi, and Juan C. Ruiz University of Valencia Young normal-hearing listeners and young-elderly listeners between 55 and 65 years of age, ranging from near-normal hearing to moderate hearing loss, were compared using different speech recognition tasks (consonant recognition in quiet and in noise, and time-compressed sentences) and working memory tasks (serial word recall and digit ordering). The results showed that the group of young-elderly listeners performed worse on both the speech recognition and working memory tasks than the young listeners. However, when pure-tone audiometric thresholds were used as a covariate variable, the significant differences between groups disappeared. These results support the hypothesis that sensory decline in young-elderly listeners seems to be an important factor in explaining the decrease in speech processing and working memory capacity observed at these ages. Keywords: consonant recognition, compressed speech, working memory, elderly

condition. Moreover, presbycusic listeners are more vulnerable to noise. Loss of hearing acuity produces not only an attenuation of or a reduction in the overall level of the speech and the background noise, but also an increase in the signal-to-noise ratio necessary to accomplish the speech recognition (Plomp, 1986). The effects of hearing sensitivity in elderly listeners have also been observed in the studies of rapid speech recognition (Gordon-Salant & Fitzgibbons, 1993, 1999, 2001; Gordon-Salant, Fitzgibbons, & Friedman, 2007). These studies used time-compression sentences to simulate rapid speech (or a fast speaking rate). In addition to sensory decline, age-related cognitive changes are considered another factor that contributes to the speech processing difficulties in elderly listeners (CHABA, 1988; Schneider & Pichora-Fuller, 2000; Wingfield & Stine-Morrow, 2000). Amongst the cognitive abilities related to the processing of speech, the working memory capacity has received considerable attention. The decline in working memory capacity with age is well documented (Salthouse, 1991; Wingfield, Stine-Morrow, Lahar, & Aberdeen, 1988). The working memory system is particularly relevant in cognitive speech processing. It is conceived as both a storage system and a processing system (Baddeley, 1986; Baddeley & Hitch, 1974), with a “central executive,” an attentional mechanism that controls processing; and two subsidiary components, the “phonological loop” specialized in temporal retention of verbal information and a visuospatial sketchpad specialized in the retention of visual information. Working memory is thought to be crucial in language processing, but especially in auditory speech processing, because the intermediate products of comprehension have to be kept active until the listener is able to understand the message. Working memory deficits in elderly listeners can be the cause of some speech understanding difficulties. However, some authors have provided evidence showing that working memory deficits in elderly listeners with some degree of hearing loss can also be the result of an impoverished linguistic input (“effortful hypothesis,” Rabbit, 1968, 1991). According to this hypothesis, during speech

Speech processing difficulties often accompany elderly listeners in daily communication situations (Committee on Hearing, Bioacoustics, and Biomechanics [CHABA], 1988). These difficulties could be the result of age-related changes in the peripheral auditory system, such as an elevation of the pure-tone audiometric threshold, especially at high frequencies (Davis, 1994; Willot, 1991); changes in the central auditory system (Frisina & Frisina, 1997; Kim, Frisina, Mapes, Hickman, & Frisina, 2006); and some cognitive changes (CHABA, 1988; Pichora-Fuller, Schneider, & Daneman, 1995; Wingfield, Tun, & McCoy, 2005). Numerous studies have shown the primary role of the elevation of hearing thresholds in explaining the difficulties in speech recognition in elderly listeners compared to young listeners. The relevance of hearing sensitivity has been shown with different speech materials and listening conditions (Divenyi & Haupt, 1997; Humes, 1996; Humes et al., 1994; Jerger, Jerger, Oliver, & Pirozzolo, 1989; Jerger, Jerger, & Pirozzolo, 1991; van Rooij & Plomp, 1990, 1992; van Rooij, Plomp, & Oberleke, 1989). The studies by Helfer and Wilber (1990) and Helfer and Huntley (1991) used nonsense syllables in quiet and in background noise with the purpose of studying consonant confusions and types of perceptual errors in elderly listeners. Philips, Gordon-Salant, Fitzgibbons, and Yeni-Komshian (2000) also used nonsense syllables in quiet and in noise to study temporal resolution and frequency resolution deficits in these listeners. The use of nonsense speech stimuli is relevant in assessing speech recognition ability when semantic and contextual top-down cues are not available to the listener. The use of masking noise in this assessment is essential, because it reproduces a more realistic everyday listening

Teresa C. Cervera and Maria J. Soler, Department of Psychology, University of Valencia; Carmen Dasi and Juan C. Ruiz, Department of Methodology of Behavioural Sciences, University of Valencia. Correspondence concerning this article should be addressed to Teresa Cervera, Departamento de Psicologı´a Ba´sica. Facultad de Psicologı´a., Blasco Ibanez, 21, 46010 Valencia SPAIN. E-mail: [email protected] 216

SPEECH RECOGNITION AND WORKING MEMORY

understanding in effortful or adverse listening conditions, all the processing resources are directed toward the perceptual processing, which has priority. As a result, there will be fewer resources for storage (McCoy et al., 2005; Murphy, Craik, Li, & Schneider, 2000; Schneider, Daneman, & Pichora-Fuller, 2002; Wingfield et al., 2005). Pichora-Fuller et al. (1995) studied the link between working memory capacity, auditory function, and speech understanding in elderly listeners. In this study, the listeners had to identify the last word in a sentence and retain it through n successive sentences. Different levels of signal-to-noise ratio were used. Old listeners recalled fewer words in noise conditions than young listeners. Their results support the hypothesis that in adverse listening conditions (age-related hearing deterioration and noise), hearing difficulty consumes cognitive resources that would otherwise be allocated to concurrent and subsequent processes of remembering and comprehending. However, controversy still remains about the exact contribution of hearing and working memory on speech processing. A factor that might be relevant is the exact age at which the older listeners are tested. Some studies have found that prediction of speech recognition based on hearing thresholds decreases in accuracy when the listener’s age increases beyond 70 years (Shebercoe & Studebaker, 2003) or 80 years (Magnusson, 1996). Divenyi, Stark, and Haupt (2005) found that the decline in speech understanding is faster than the decline in hearing thresholds from the age of 70 on. Souza, Boike, Witherell, and Tremblay (2007) found that audibility was a better predictor of speech recognition for listeners between 50 and 65 years of age than for the other older groups of listeners used in the study (67 to 76 and 77 to 82 years of age), who fell below the predicted scores. These studies suggest that hearing acuity could explain most of the differences in the speech recognition performance in young-elderly listeners (under 65 years of age). Beyond this age, there is probably a greater involvement of cognitive factors. A review of the research literature on speech understanding in elderly listeners during the last two decades (Tables 1a and b) showed that most of the studies have used a wide range of listener ages, from 60 to 80 years, with a mean age of around 70 years. Table 1a summarizes the studies that have evaluated speech recognition and working memory capacity in elderly listeners (although some of them assessed working memory capacity with the sole purpose of matching the younger and older participants on this capacity (Jenstad & Souza, 2007; Lee & Humes, 1992; Tun, 1998; Wingfield, Tun, Koh, & Rosen, 1999). Table 1b summarizes the studies of speech recognition that did not take cognitive factors into account. These tables show that few studies have made the distinction between young-old and old-old listeners (Table 1a: Jenstad & Souza, 2007; Lee & Humes, 1992; Table 1b: Humes & Christopherson, 1991). Other studies have used a middle-aged group (Table 1b: Ohde & Abou-Khalil, 2001; Vaugham & Letowski, 1997). Only a few studies included a group of elderly listeners less than 65 years of age (Table 1b: Gelfand, Pipper, & Silman, 1985, 1986; Snell, Mapes, Hickman, & Frisina, 2002; Versfeld & Dreschler, 2001). Thus, it appears to be an age range, between 55 to 65, that has received little attention, despite the fact that signs of age-related hearing loss (presbycusis) are expected at these ages (Davies, Ostri, & Parving, 1991; Divenyi et al., 2005; Gates, Cooper, Kannel, & Miller, 1990; Morrell, Gordon-Salant, Pearson, Brand,

217

& Fozard, 1996). In order to complete the knowledge about the speech processing difficulties across the adult life span, studying this range of ages can be quite useful especially with regard to whether there are speech processing difficulties, a decrease in working memory capacity, and the role played by hearing acuity. The present study was designed to answer to these questions using listeners between 55 and 65 years of age. The hypothesis underlying the present study is that the degradation of sensory input due to some degree of hearing loss that typically accompanies aging, even in young-elderly listeners, could explain the decrease in both speech recognition and working memory capacity. In order to accomplish this objective, different measures of speech recognition and working memory performance were obtained in a group of young-elderly listeners and compared to those obtained in young listeners. The contribution of hearing sensitivity to the performance on these tasks was assessed by including each individual’s auditory pure-tone threshold (PTA) as a covariate in the analysis of the differences between the young and elderly groups. If age-related decline on speech recognition and working memory tasks is caused by the increase in the hearing thresholds of the listener, the differences in the scores obtained by the two groups will disappear when the PTA is used as a covariate. The speech recognition tasks included consonant recognition (in vowel-consonant-vowel sequences; VCV) in quiet and in background noise and sentences under normal and fast speaking rate (or time-compression) conditions. Because the effects of hearing might be different for each of them, the speech recognition was studied at two different processing levels: phoneme and sentence. Moreover, the relationship between working memory capacity and each of these speech processing levels might be different as well. Consonants were presented in noise conditions, because consonant recognition in quiet and in noise is usually part of the audiological examination of individuals with presbycusis. On the other hand, the sentences were presented in conditions of fast speaking rate, because a decrease in temporal processing abilities is part of the aging process. In addition, using sentences makes it possible to examine the use of contextual information. There is evidence that older listeners can successfully use the contextual information provided by the sentence to compensate for their sensory loss (Dubno, Ahlstrom, & Honwith, 2000; Pichora-Fuller et al., 1995). Thus, it is expected that sentence materials will be correctly perceived in nondegraded listening conditions by the youngelderly listeners, although the performance will decrease in the time-compressed condition. On the other hand, it is expected that the elderly listeners will have difficulties in the recognition of consonants both in quiet and, especially, in noise conditions. In the absence of contextual or top-down information, the listeners must rely on the acoustical information that is degraded due to some degree of hearing loss. Working memory capacity was assessed using a serial recall task or word span, as a measure of storage capacity of the phonological loop (Baddeley, 1986) and a digit ordering task as a measure of the storage and processing capabilities of the central executive (Cooper, Sagar, Jordan, Harvey, & Sullivan, 1991). The digit ordering was included as an alternative to the listening span task (Pichora-Fuller et al., 1995), which is the auditory equivalent of the Daneman and Carpenter span task (Daneman & Carpenter, 1980). The digit ordering task was used in the present study,

CERVERA, SOLER, DASI, AND RUIZ

218

Table 1 Summary of Studies of Speech Recognition in Elderly Listeners Including (A) and Not Including (B) Working Memory Measures. YNH (Young Normal-Hearing), YHL (Young Hearing-Loss), ENH (Elderly Normal-Hearing), EHL (Elderly Hearing-Loss/Presbycusis), Y-O (Young-Old), O-O (Old-Old), MNH (Middle Age Normal-Hearing) a) Study

Samples

N (elderly groups)

Age range (elderly groups)

Jerger et al., 1989 van Roij et al., 1989 van Roij & Plomp, 1990 Jerger et al., 1991 Lee & Humes, 1992 van Roij & Plomp, 1992 Humes et al., 1994 Pichora-Fuller et al., 1995 Humes, 1996 Gordon-Salant & Fitzgibbons, 1997 Tun, 1998 Wingfield et al., 1999 Souza & Kitch, 2001 Humes, 2002 Vaugham et al., 2002 Wingfield et al., 2003 Humes, 2005 Humes & Floyd, 2005 Humes et al., 2007 Jenstad & Souza, 2007 Souza et al., 2007

EHL YNH, ENH, EHL YNH, EHI EHL Y-O, O-O (HL) YNH, EHI EHL YNH, YH, ENH, EHL EHL YNH, YHL, ENH, EHL YNH, ENH YNH, ENH YNH, YHL, ENH, EHL EHL YNH, ENH, EHL EHL EHL YNH, EHL YNH, ENH, EHL Y-O, O-O ( HL) ENH, EHL, EHL

130 24, 24 72 200

50–91

Samples

N (elderly groups)

Age range (elderly groups)

16/decade 16/decade

20–60 20–60 ⬎ 60 62–79, 65–76 63.74, 75–83 61–82 60–81 45–55, 65–75 65–7, 65–76 67–81, 68–81 63–80 65–72 52–5, 70–76 58–70, 55–73 60–74 55–88 65–76, 65–76 60–84 65–80 66–81 ⬎ 65 63–84 65–85 65–76, 65–76

72 50 8, 8 50 10, 10 18 15 12, 9 171 18, 18 30 213 24 10, 16 11, 14 10, 12, 13

60–93 50–91 63–74, 75–83 60–93 63–84 67–91, 65–77 63–84 65–76, 65–76 64–78 61–80 67–81, 71–95 60–89 61–80 60–89 62–86 61–80, 60–87 62–74, 75–88 50–65, 67–75, 77–82

Mean age (elderly groups) 69.9 72.8 73.9 69.7 73.9 72.3 75.8, 70.4

71.9 74.0, 78.0 73.0 67.3, 70.3 72.2 73.2 75.6 70.0, 75.0 58.2, 71.6, 80.0

b) Study Gelfand et al., 1985 Gelfand et al., 1986 Helfer & Wilber, 1990 Helfer & Huntley, 1991 Humes & Christopherson, 1991 Wingfield et al., 1991 Divenyi & Haupt, 1997 Vaugham & Letowski, 1997 Gordon-Salant & Fitzgibbons, 1999 Halling & Humes, 2000 Wingfield et al., 2000 Gordon-Salant & Fitzgibbons, 2001 Ohde & Abou-Khalil, 2001 Versfeld & Dreschler, 2001 Dubno et al., 2002 Snell et al., 2002 Gordon-Salant & Fitzgibbons, 2004 Divenyi et al., 2005 Ha¨llgren et al., 2005 McCoy et al., 2005 Schneider et al., 2005 Amos & Humes, 2007 Golomet al., 2007 Gordon-Salant et al., 2007

5 decades (NH) 5 decades (NH) YNH, EHL YNH, ENH, EHL Y-O, O-O (HL) YNH, ENH EHL YNH, MNH, EHL YNH, YHL, ENH, YNH, ENH, EHL YNH, ENH YNH, YHL, ENH, YNH, MNH, ENH YNH, YHL, ENH, YNH, ENH, EHL YNH, EHL YNH, YHL, ENH, EHL YHL, EHL ENH, EHL YNH, EHL YNH, EHL YHN, ENH YNH, YHL, ENH,

EHL EHL EHL EHL

EHL

because it measures working memory capacity independently from speech processing (MacDonald & Christiansen, 2002) and because it does not require complex instructions. This measure of working memory capacity has not previously been used in the studies of speech recognition difficulties in elderly listeners. Based on the hypothesis that differences in working memory capacity between young and elderly listeners could be attributed to

8, 8 18, 18 18 45 12, 18 10 8, 12 18 13, 13 8, 8 11, 12 8 28 11, 15 29 12 12, 12 36 16 10 to 16

Mean age (elderly groups)

70.1, 70.8 69.1 64.4 48.8, 69.1 72.1, 73.1 69.9 67.3, 70.3 54.9, 73.7 64.6 67.3 68.7 69.6 71.8 72.9, 73, 4 69.7, 74 72.4

the sensory degradation of the stimuli caused by the decrease in hearing acuity (McCoy et al., 2005; Murphy et al., 2000; Schneider et al., 2002; Wingfield et al., 2005), we expect that the differences in hearing thresholds of the listeners will account for the differences between the two groups of listeners. Finally, the relationships between speech recognition and working memory measures were examined. Individual differences in

SPEECH RECOGNITION AND WORKING MEMORY

219

working memory capacity were expected to be related to the performance on the sentence recognition, because successive elements have to be identified and retained until a meaning is obtained in conditions of highly demanding (or “effortful”) bottom-up processing (time-compression or rapid speech). However the recognition of VCV stimuli was not expected to be related to working memory capacity.

ogy or a history of prolonged exposure to high-level environmental noise. All of them were considered to have a good general cognitive functioning, as they successfully participated in the courses at the University of Valencia for elderly people. All the participants were native Castilian-Spanish speakers.

Method

Prior to the experiment, the participants’ hearing was measured. Pure-tone air conduction thresholds were measured at 250, 500, 1000, 2000, 4000, and 8000 Hz in both ears using an Audiotest 330 audiometer in a sound-proof room. As a measure of hearing sensitivity, we used the pure-tone average (PTA) of the 500, 1000, 2000, and 4000 Hz frequencies (PTA4) in both ears for each subject. This measure is similar to those previously used in other studies of speech recognition in the elderly, where sensory acuity was considered in the experimental design. Divenyi et al. (2005) used the average thresholds of the 125 to 4000 Hz frequencies. Helfer and Huntley (1991) used two different measures: PTA1, which was calculated across 500, 1000, and 2000 frequencies; and PTA2, which was calculated for higher frequencies (2, 3, 4, and 6 kHz).

Participants The group of young listeners included 28 participants, ranging in age from 19 to 25 years with a mean of 21.65 (SD ⫽ 1.7). They were undergraduate students at the University of Valencia who participated voluntarily in the study. All the participants had normal audiograms (pure-tone air-conduction thresholds in the better ear of less than 20 dB SPL from 250 to 8000 Hz (American National Standards Institute, 1996). See Figure 1. The group of young-elderly listeners were 27 participants enrolled in special courses that the University of Valencia offers people over 55 years of age and were selected after an audiometrical testing, conducted to filter out those participants with more than moderate hearing loss. Their ages ranged from 55 to 65 (with a mean age of 60 and SD ⫽ 5). They participated voluntarily in the study. Their audiograms were variable, ranging from near normal pure-tone thresholds (although the thresholds at the higher frequencies could be greater than those of the young participants) to mild-to-moderate hearing losses (pure-tone air-conduction thresholds in the better ear in the range of 25 to 55 dB SPL from 250 to 8000 Hz (see Figure 1). Thus, the group of young-elderly listeners presented a wide range of the typical progressive mild to moderate hearing loss, especially in high frequencies. All the older participants had approximately bilateral symmetrical pure-tone thresholds. The etiology of their hearing was presbycusis, characterized by a progressive hearing loss in the last five to 10 years. None of these participants used hearing aids. None of them had a history of neurological disorders or middle ear pathol-

KHz 0.5

1

2

4

8

0 10 20

dB

30 40 50 60

young elderly

70

Figure 1. Mean (and standard deviation) pure-tone air conduction thresholds for young and young-elderly listeners.

Audiological Testing

Apparatus and Stimuli Speech stimuli: Consonants. The consonants were presented in VCV format (vowel-consonant-vowel). They consisted of the Spanish consonants /b/, /d/, /g/, /p/, /t/, /k/, /s/, /␪/, /f/, /␹ /, /r/, //␭, /L/, /␭/, /m/, and /n/ combined with the vowel /a/ forming /aba/, /ada/, and so forth sequences. The VCV format was selected over the CVC format, frequently used in the studies with the English language, because it is more representative of the syllabic structure of the Spanish language. The VCV stimuli were spoken by a native female speaker of the Spanish language. Three repetitions of each of the CVC stimuli were required from the speaker. The clearest production of each of them was selected. In addition, there was a requirement that the duration of the utterance had to be approximately 400 ms. The recording was made in a sound-proof room using a Sennheiser HMD 224 microphone set at 25 cm from the lips and directly digitalized in the computer using an Edirol UA-5 sound card, with a sampling frequency of 11.025 kHz, and then low-pass filtered at 5.5 kHz to prevent aliasing. The speech materials were edited with Adobe Audition sound editor software. First, the 16 speech stimuli were made to have an equal duration of 400 ms. Visual inspection of the waveform and the spectrogram was used to determine optimal points at which to excise the VCV segment. Thus, some milliseconds of the last vowel of some of the VCV stimuli could be removed. Then, the intensity of each stimulus was also adjusted to make them have an equal root mean square (RMS) across all the VCV stimuli. To create the masking noise condition, we used flat spectrum noise, (low-pass 5.5 kHz at a sampling frequency of 11.025 kHz, using a 200 order FIR filter). The noise was added to each speech stimulus to make the SNR during the VCV segment equal to 6 dB SPL. These manipulations were performed using MATLAB 5.3 routines (Math Works, Inc., 1990). Speech stimuli: Sentences. Ten sentences were used in the experiment, two of which were used in the practice session and eight in the experimental session. They were syntactically correct

220

CERVERA, SOLER, DASI, AND RUIZ

with a low semantic predictability. In the Spanish language, there are not standardized sentence materials where predictability is controlled as it is in the Speech Perception in Noise sentences (SPIN; Kalikow, Stevens, & Elliot, 1977) used in most of the studies of speech perception in elderly listeners with the English language. Therefore, a previous study had been carried out to select the sentence materials to be used in the experiment. An initial pool of 20 sentences was created. As a criterion of low predictability, it was determined that the content words in the sentence (nouns, adjectives, verbs, and adverbs) had no semantic relationship (i.e., “Ese pez suelto podria ser un gran problema” “This loose fish could be a big problem”). Each sentence had between five and nine words, four to seven of which were content words. We created four versions of each sentence. In each version, one of the content words in the sentence was deleted (i.e., “Ese pez ____ podria ser un gran problema”), creating four incomplete forms of the same sentence, yielding a total of 20 sentences ⫻ 4 forms ⫽ 80 incomplete sentences. With this pool of sentences, we prepared four different sets, each consisting of a list of the 20 sentences, and each of them having one deleted word. The deleted word could be the first, second, third, or fourth content word in the sentence, so that each sentence appeared in each set with one different deleted content word randomly assigned each time. Each set, consisting of 20 sentences each, was presented to a group of 50 participants from the University of Valencia, on a paper-and-pencil test. A total of 200 subjects participated in this task (4 sets ⫻ 50 participants). The versions of the sentences assigned to each set were counterbalanced across listeners, so that each version was presented an equal number of times. The participants were asked to fill in the word they thought was most likely to occur in this sentence. The criterion for selecting the sentences to be used in the present study was that none of the participants guessed the deleted key word. Ten sentences from the initial pool were selected. The selected sentences were spoken by a female native Spanish speaker. She was asked to speak at a speaking rate typical of “clear speech,” in other words, as if she were talking to an audience. Several repetitions of each sentence were required until the one that met the requirements was selected. The utterances were recorded directly into the computer. All the digitalized sentences were edited with sound editing software. They were low-pass filtered and equated in intensity. The recording, editing, and manipulating procedure was the same as the one followed for the VCV stimuli. The sentences were manipulated to create two different compression ratios or speaking rate conditions for each sentence: “normal” and “fast.” The normal speaking rate was the original sentence with no modifications. The speaking rate, determined by dividing the number of syllables in the sentence by the total duration in seconds, yielded 5.3 syllables per second. The fast speaking rate was created by a 50% time-compression of the total duration of the sentence (this was equivalent to 10.6 syllables per second). To create the 50% time-compression, each sentence was artificially time compressed using the Adobe Audition sound editor, which uses the Pitch-Synchronous Overlap and Add algorithm (PSOLA; Moulines & Laroche, 1995). This algorithm performs uniform duration reduction in the time domain with small changes in the frequency domain. Each digitized version of the sentence (normal and fast) was stored separately on a computer hard disk, after the sentences were equated in their intensity (RMS level).

The purpose of the 50% compression condition used in the present experiment was to find a significant decrease in the recognition of the sentences, without creating a floor effect. Thus, our procedure was based on prior studies that used compressed sentences. Tun (1998) used sentences time-compressed to 60% and 80% of their original time. Wingfield et al. (1999) used passages with 55% and 68% time compressions. Gordon-Salant and Fitzgibbons (2001) used a 50% time compression. Schneider, Daneman, and Murphy (2005) used 33% and 50% time-compressions, and Humes, Burk, Coughlin, Busey, and Strauser (2007) used conditions of 45% and 33% time-compression. Stimuli for the working memory tasks: Serial recall. The stimuli used in the serial recall task were 28 Spanish words from Alameda and Cuetos’s (1995) dictionary of frequencies. All the words were bisyllabic with a mean frequency of 16.7 occurrences per million. Seven sets of stimuli ranging from one to seven words were used. The words were produced by a female native Spanish speaker, using the same recording, editing, and manipulating procedures as in the previous sections. The stimuli were organized into the different lists using a stimuli presentation computer program. The words were presented continuously at a rate of one word per second. Stimuli for the working memory tasks: Digit ordering. In the digit ordering task, digits from 1 to 9 were used. Six lists ranging from two to seven digits were formed (one for each sequence length). Each list had one digit more than the preceding one. The two digits in the initial list were randomly assigned, as well as the following digits in each successive list. The recording, digitalizing, and organisation procedures of the sets of stimuli were the same as in the previous task.

Procedure The listeners performed the tasks individually in a sound-proof room in two sessions. Session 1 contained all the speech recognition tasks, and Session 2 contained the verbal working memory tasks. Previously, each individual’s pure-tone hearing threshold was determined. Session 1 began with the consonant recognition tasks, which included the 16 VCV stimuli. Each was presented three times, yielding a total of 48 stimuli. Each stimulus was presented in two listening conditions: in quiet and in background noise. The order of the stimuli and conditions was randomized across the listeners. The stimuli were presented via a computer routed to the input of the clinical audiometer Audiotest 330 to control the level of the signal output. The participants listened to the stimuli diotically through TDH39 Audiometric headsets. The stimuli were presented to the young participants at 70 dB SPL. For the elderly participants, the level was individually adjusted so that it was 15dB SPL above their hearing pure-tone thresholds at 4000 Hz, to ensure audibility. The presentation of the stimuli was controlled by the experimenter by pressing the space bar on the computer. The participants were required to listen to the stimuli, identify the consonant in the VCV sequence, and then mark their response on a form containing 17 letters corresponding to the 17 consonants. No special training in phonetics was required for this task, because in the Spanish language the correspondence between phonemes and letters is biunivocal, at least for the stimuli presented in this task. After each response, the next stimulus was presented. Prior to

SPEECH RECOGNITION AND WORKING MEMORY

Results Speech Recognition Consonants. The mean percent correct recognition scores obtained in quiet and noise conditions for the two groups of listeners are shown in Figure 2. This figure shows that the young-elderly listeners had lower scores than the young listeners in both conditions as expected. While the recognition was nearly perfect (98.95%) in the younger group, the performance was about 11% lower in the older group (with 89.7% recognition scores). A similar difference of about 10% was found in the noise condition (72% and 62% recognition scores for the younger and the older

young 100 Correct Recognition (%)

the experimental session, the listeners performed a practice session in order to become familiarized with the stimuli and the task. Feedback was provided during the practice, but not during the experiment. A percent correct score was derived for each condition based on the total number of correctly recognized consonants relative to the total number of stimuli presented. Second, the participants were presented with the sentence materials. The eight sentences on the test were presented in the two compression conditions as follows: Each listener was presented with two blocks of stimuli. One block contained four of the sentences in the no compression condition, whereas the other block contained the other four sentences in the compression condition. The sentences assigned to each block were counterbalanced across listeners, so that each sentence and each compression condition was presented an equal number of times. The order of presentation of the two blocks was the same for all the listeners. The presentation of the sentences was controlled by the experimenter through the computer. The task of the listeners was, after listening to the sentence, to repeat verbatim as many words as possible from the sentence. Listeners were encouraged to guess any words that were not clear or not intelligible. The experimenter registered the response, and then the next stimulus was presented. The listener’s responses were scored as the number of content words accurately identified. Omissions of plurals or changes in verb tense were also accepted. Percent scores were derived for the total number of words correctly repeated from the four sentences in each condition, based on the total number of content words in the four sentences. Prior to the experimental condition, the participants were given adequate practice. For this purpose, two sentences were presented in both time-compression conditions. They received feedback in this practice session but not in the experimental session. Session 2 contained the two working memory tasks. All the participants performed them in the same order: serial order recall and then the digit ordering task. In the serial recall task, the participants had to recall each list of words from the first item to the last item in the same order in which they were presented. No time restrictions were given. The number of lists correctly recalled was scored, and the corresponding percentage out of the total pool of the seven lists was calculated. In the digit ordering task, six random sequences of digits were presented. The subjects were asked to recall each sequence of digits but in ascending order. The number of series correctly recalled was scored, and then the percentage out of all the lists (six lists) was calculated.

221

elderly

80

60

40

20

0 Quiet

Noise Conditions

Figure 2. Mean (and standard deviation) correct recognition scores (%) of consonants (in VCV sequences) in quiet and in noise conditions for young and young-elderly listeners.

group, respectively). Thus, it seems that noise produced a similar decrease in the recognition of the consonants in both young and young-elderly listeners. To evaluate the effects of age and listening conditions, an ANOVA was performed. The scores obtained in the two listening conditions were used as a dependent variable for a mixed design, with age as a between-subjects factor (young and young-elderly) and listening condition (quiet and noise) as a within-subject factor. In this analysis, the main effects for both age, F(1, 53) ⫽ 14.52, p ⬍ .01, partial ␩2 ⫽ 0.21, and listening conditions, F(1, 53) ⫽ 27 8.67, p ⬍ .01, partial ␩2 ⫽ 0.84, were significant. The interaction was not significant. The demonstration that young and young-elderly listeners had significant differences in the recognition of consonants in both conditions, quiet and noise, raised the question of whether the different hearing status of the listeners could have accounted for this result. Thus, we conducted an ANCOVA analysis, using hearing sensitivity (pure-tone thresholds from 500 to 4000 Hz PTA4) as a covariate. The relationships between the performance on each of the consonant recognition tasks (in quiet and in noise conditions) and PTA4 were calculated by means of Pearson correlation analyses. Table 2 shows the correlation matrix (which included the other speech and working memory tasks as well). The performance in both consonant recognition conditions had a negative significant correlation with hearing sensitivity ( p ⬍ .01). The ANCOVA with the recognition of consonants as a dependent variable and age (young and young-elderly) and listening conditions (quiet and noise) as independent factors, with hearing sensitivity (PTA4) as a covariable, showed that the listening conditions had significant effects, F(1, 52) ⫽ 32.17, p ⬍ .01, partial

CERVERA, SOLER, DASI, AND RUIZ

222

Table 2 Matrix of Correlations Between Speech Recognition and Working Memory Measures Measures 1. 2. 3. 4. 5. 6. 7. ⴱ

1

PTA4 VCV quiet VCV noise Normal speaking rate Fast speaking rate Serial recall Digit ordering

p ⬍ .05.

ⴱⴱ

— — — — — — —

2

3 ⴱⴱ

⫺.54

⫺.46 .53ⴱⴱ

⫺.17 .29 .03

5

6 ⴱⴱ

⫺.58 .53ⴱⴱ .40ⴱⴱ .59ⴱⴱ

7 ⴱ

⫺.33 .12 .05 .12 .37ⴱ

⫺.39ⴱ .15 .19 .13 .36ⴱ .43ⴱⴱ

p ⬍ .01.

␩2 ⫽ 0.38. In contrast, no significant differences were observed for age or for the interaction. Sentences. Mean percent recognition scores of the two groups of listeners in the two speech rate conditions (original or “normal” rate, and time-compression of 50% or “fast” rate) are shown in Figure 3. The normal speech condition produced very high recognition scores (96.8% and 93.7% for the young and young-elderly listeners, respectively). The fast rate condition produced 53% recognition scores in the young-elderly group and a higher value of 74% in the young group. The differences in the recognition scores of the sentences between the normal and fast conditions were about 22% in the young group and higher (40%) in the elderly group. The data obtained from the two conditions of time compression or speech rate, normal and fast, were submitted to a mixed design ANOVA in which age (young and young-elderly) was a betweensubjects factor and speech rate (normal and fast) was a within-

young 100 Correct Recognition (%)

4 ⴱⴱ

elderly

80

60

40

20

0 Normal

Fast Speech rate

Figure 3. Mean (and standard deviation) correct recognition scores (%) of sentences in normal and fast speaking rate conditions for young and young-elderly listeners.

subject factor. Significant effects for both age, F(1, 53) ⫽ 13.64, p ⬍ .01, partial ␩2 ⫽ 0.20, and speech rate, F(1, 53) ⫽ 163.55, p ⬍ .01, partial ␩2 ⫽ 0.75, were found. The interaction was also significant, F(1, 53) ⫽ 8.44, p ⬍ .05, partial ␩2 ⫽ 0.13. As the elderly listeners differed on hearing thresholds (PTA4), we conducted an ANCOVA with PTA4 as a covariable, in order to determine whether hearing sensitivity could account for the differences between younger and older adults in the recognition of the sentences. The correlations between the performance in each speech rate condition (normal and fast) are shown in Table 2. This table shows that there were negative significant correlations between the fast speech rate condition and PTA4 ( p ⬍ .01), but not between the normal speech rate condition and PTA4. Thus, an ANCOVA was conducted to partial out the effects of hearing sensitivity with the recognition scores of sentences as a dependent variable and age (young and young-elderly) and speech rate (normal and fast) as independent factors, with hearing sensitivity (PTA4) as a covariate. It showed significant effects for speech rate conditions, F(1, 52) ⫽ 4.41, p ⬍ .05, partial ␩2 ⫽ 0.07. But there were no significant effects for age and speech rate or for the interaction.

Working Memory Serial recall and digit ordering. The data obtained on both working memory tasks by young and young-elderly participants are shown in Figure 4. Young-elderly participants obtained lower percent scores than younger ones, both on serial recall and digit ordering tasks. In order to assess whether the observed differences between the two groups of participants were significant, we conducted a t test for each task. The t test for serial recall, using the percentage of lists correctly repeated as the dependent variable, showed significant differences between younger and older subjects, t(53) ⫽ 2.05, p ⬍ .05. The same analysis was performed for the digit ordering task, in which the dependent measure was also the percentage of series correctly recalled. The t test also showed significant differences, t(53) ⫽ 2.49, p ⬍ .01. As Table 2 shows, the performance on the serial recall task showed a significant negative correlation with hearing sensitivity (PTA4; p ⬍ .05). The performance on digit ordering showed a negative significant correlation with PTA4 as well ( p ⬍ .05). Thus, an ANCOVA using hearing sensitivity (PTA4) as a covariate factor was performed to partial out the influence of hearing sensitivity on the performance on each of the working memory tasks. The ANCOVA on the serial recall data showed no signifi-

SPEECH RECOGNITION AND WORKING MEMORY

young

Correct Recognition (%)

100

elderly

80

60

40

20

0 Serial recall Digit ordering Working memory Figure 4. Mean (and standard deviation) correct recognition scores (%) in serial recall and digit ordering tasks for young and young-elderly listeners.

cant differences between young and elderly subjects. The same thing occurred for the digit ordering data. The correlation matrix (see Table 2) shows the Pearson correlation values amongst the different speech recognition and working memory measures. The individual scores of the listeners in the two consonant recognition conditions (quiet and noise) correlated positively and significantly ( p ⬍ .01). The performance in the two conditions of sentence recognition (normal and fast speech rate) correlated positively and significantly ( p ⬍ .01). On the other hand, the performance on the two working memory measures correlated positively and significantly ( p ⬍ .01). There were no significant correlations between the performance on each of the two consonant recognition tasks (in quiet and in noise) and on each of the two working memory tasks. However, the performance on the two working memory task measures correlated positively and significantly with the performance on the recognition of sentences presented at the fast speech rate ( p ⬍ .01).

Discussion The research on speech recognition in the elderly has been lacking in subjects between 55 and 65 years (young-elderly). The present study reveals many interesting results with this group of participants. 1) Young-elderly listeners with different degrees of age-related hearing loss performed worse than young listeners on both speech recognition and working memory tasks, although these differences can be attributed mainly to the differences in hearing thresholds. 2) The recognition of sentences in youngelderly listeners in ideal listening conditions was very high and similar to that of younger listeners. 3) Consonants, even in quiet conditions, were more poorly perceived by the young-elderly than

223

by the young listeners. 4) The effects of background noise on consonant recognition were similar in the young-elderly and younger groups of listeners. 5) Fast speaking rate produced a greater decrease in sentence recognition in the group of youngelderly listeners than in the group of younger listeners. 6) The group of young-elderly listeners showed less working memory capacity than the group of young listeners, as was expected. Again, the differences in hearing thresholds seemed to explain these differences. The worse performance on the perception of consonants in quiet and noise by young-elderly listeners, and its relationship with hearing thresholds, agrees with the results obtained in previous studies with listeners above 60 years of age. Philips et al. (2000) found significant differences between elderly listeners with normal hearing and those with moderate hearing loss (which were subdivided into two groups according to their word recognition abilities) both in quiet and in noise. Helfer and Wilber (1990) and Helfer and Huntley (1991) employed two groups of elderly listeners according to their hearing sensitivity (with near-normal hearing and with hearing loss typical of presbycusis). They found effects of both age and hearing-thresholds of the listeners in noise conditions, although they did not find differences between groups in the quiet condition. The study by Gelfand et al. (1985), which examined listeners in their 50s and 60s, found significant differences between them and the younger groups of listeners (in their 20s, 30s and 40s), but all the listeners had similar normal hearing thresholds. On the other hand, in the present study, the sentences were well recognized by young-elderly listeners in the normal speech rate condition. This finding may be due to the fact that contextual linguistic information (even when it is only syntactic and not semantic) can be used just as well by young-elderly listeners as it can by younger listeners. Previous studies have confirmed that this ability is not impaired in elderly listeners (Dubno et al., 2000; Wingfield, Aberdeen, & Stine, 1991). However, time-compression made the sentences more difficult to recognize for the young-elderly than for the young listeners, as was expected (Gordon-Salant & Fitzgibbons, 1993, 1999; Humes et al., 2007; Schneider et al., 2005, 2002; Tun, 1998; Wingfield et al., 1999). However, the results on the recognition of compressed sentences in the present study with the Spanish language are not directly comparable to those obtained in previous studies with the English language, because the compression ratio varies across the studies and because English and Spanish are languages with different syllabic structures and prosodies. This disadvantage in the elderly listeners has been explained as reflecting the difficulty in processing brief acoustic cues imposed by compressed techniques (or by rapid speech rates; GordonSalant & Fitzgibbons, 2001; Gordon-Salant et al., 2007). This difficulty may be the result of a poorer auditory system in the older listeners compared to the younger listeners (Humes et al., 2007). The results in the present study support this hypothesis. When the effects of hearing thresholds were statistically partialled out, there were no significant differences between the two groups of listeners. The association between recognition of compressed sentences and PTA is not surprising. The recognition of rapid speech is thought to be related to temporal processing abilities, and some studies have shown that a decrease in hearing thresholds covary with loss of spectral and temporal resolution (Halling & Humes, 2000; Humes, Espinoza-Varas, & Watson, 1988). Other studies,

224

CERVERA, SOLER, DASI, AND RUIZ

however, have suggested that age-related slowing of processing may be the factor responsible for the difficulties in the recognition of fast speech in elderly listeners (Tun, 1998; Wingfield et al., 1999). The role of hearing loss in the recognition of rapid speech has been studied by Schneider et al. (2005), who compared different methods of speeding up speech. When rapid speech was obtained by deletion of steady-state segments and pauses (which did not affect the temporal acoustical cues relevant in consonant perception), instead of a uniform time deletion of every nth segments, the differences in the recognition scores between young and older listeners were not significant. The present study used uniform time-compression as a method to simulate fast speech. This method produces distortion of some temporal acoustic cues, and it is possible that the auditory system of the young-elderly listeners is not capable of coping with these distortions. This situation could explain the association between recognition of fast speech and PTA found in the present study. Together with these perceptual difficulties, it is possible that the decreases in working memory capacity shown by the elderly listeners are also involved in the recognition of compressed sentences in the present study. When listeners with poorer auditory acuity are presented with a degraded acoustic signal, such as time-compressed sentences, the demands on the working memory increase; the temporary retention of the incoming speech signal is disrupted, and the performance on sentence recognition decreases. The correlations amongst the performances on compressed sentence recognition, working memory capacity, and PTA suggest the involvement of perceptual and cognitive (working memory capacity) difficulties in the recognition of rapid speech. The association between age-related hearing loss and performance on the working memory tasks (serial word recall and digit ordering) found in the present study, agrees with previous studies (McCoy et al., 2005; Pichora-Fuller et al., 1995; Schneider et al., 2002; Wingfield et al., 2005). These authors explain their results in concordance with the “effortful hypotheses” (Rabbit, 1968, 1991). From this point of view, the working memory is an active system that must allocate both storing and processing information. If the process of recognising the speech material is effortful (even when the reception is successful), due to age-related hearing loss, most of the resources must be allocated to the perception of the speech signal, and fewer resources may be available to accomplish other tasks such as retaining or manipulating the stored materials. The results in the present study can be interpreted based on this hypothesis, and they provide additional data for the age range from 55 to 65, which had not been evaluated before. Thus, we can conclude that for listeners in this age range and with typical presbycusis audiograms, decreases in working memory capacity can be expected, although these decreases seem to be associated with hearing acuity in conditions of degradation of the sensory input, which require allocation of working memory, which, consequently, decreases the speech processing performance. Although hearing acuity loss at these ages could explain the decrease in both speech recognition and working memory capacity, this explanation should not rule out the possibility that beyond the age of 65, the involvement of cognitive factors might be greater. Previous studies have suggested that the role of cognitive deficits increases with age. The study by Divenyi et al. (2005) showed that the decline in speech processing is faster than the

decline in the hearing threshold from the seventh decade on. In addition, the studies by Lindenberger and Baltes (1994) and Baltes and Lindenberger (1997) showed that the sensory decline precedes, and might be the cause of, the cognitive decline in elderly subjects.

Re´sume´ Des jeunes participants avec des capacite´s auditives normales ainsi que des jeunes aıˆne´s aˆge´s de 55 a` 65 ans, dont les capacite´s auditives se situent entre une audition quasi normale et une perte mode´re´e de l’audition, ont e´te´ compare´s en fonction de diffe´rentes taˆches de reconnaissance du langage (reconnaissance de la consonne dans le silence et le bruit et phrases a` compression temporelle) et de taˆches de me´moire de travail (rappel se´riel de mots et mise en ordre de nombres). Les re´sultats ont montre´ que les jeunes aıˆne´s obtiennent des re´sultats infe´rieurs a` ceux des aıˆne´s aux taˆches de reconnaissance du langage et aux taˆches de me´moire de travail. Cependant, lorsque des seuils audiome´triques a` tonalite´ pure e´taient place´s en covariable, la diffe´rence significative entre les groupes disparaissait. Ces re´sultats appuient l’hypothe`se selon laquelle le de´clin sensoriel chez les jeunes aıˆne´s serait un facteur important afin d’expliquer la diminution des capacite´s de traitement du langage et de la me´moire de travail observe´e a` cet aˆge. Mots-cle´s : reconnaissance de la consonne, langage compresse´, me´moire de travail, personnes aˆge´es

References Alameda, J. R., & Cuetos, F. (1995). Diccionario de frecuencias de las unidades linguisticas del castellano. Oviedo: Servicio de publicaciones de la Universidad de Oviedo. American National Standards Institute. (1996). Specification for audiometers (ANSI S3.6 –1996). New York: Author. Amos, N. E., & Humes, L. E. (2007). Contribution of high frequencies to speech recognition in quiet and noise in listeners with varying degrees of high-frequency sensorineural hearing loss. Journal of Speech, Language, and Hearing Research, 50, 819 – 834. Baddeley, A. D. (1986). Working memory. New York: Oxford University Press. Baddeley, A. D., & Hitch, G. J. (1974). Working memory. In G. A. Bower (Ed.), Recent advances in learning and motivation (Vol. VIII, pp. 47–90). New York: Academic Press. Baltes, P. B., & Lindenberger, U. (1997). Emergence of a powerful connection between sensory and cognitive functions across the adult life span: A new window to the study of cognitive aging?. Psychology and Aging, 12, 12–21. Committee on Hearing, Bioacoustics, and Biomechanics. (1988). Speech understanding and aging. Journal of the Acoustical Society of America, 83, 859 – 895. Cooper, J. A., Sagar, H. J., Jordan, N., Harvey, N. S., & Sullivan, E. V. (1991). Cognitive impairment in early, untreated Parkinson Disease and its relationship to motor disability. Brain, 114, 2095–2122. Daneman, M., & Carpenter, P. A. (1980). Individual differences in working memory and reading. Journal of Verbal Learning and Verbal Behaviour, 19, 450 – 466. Davis, A. (1994). Hearing in adults. London: Whurr Publishers Ltd. Davis, A. C., Ostri. B., & Parving, A. (1991). Longitudinal study of hearing. Acta Oto-Laryngologica, 476(Suppl.), 12–22. Divenyi, P. L., & Haupt, K. M. (1997). Audiological correlates of speech understanding deficits in elderly listeners with mild-to-moderate hearing loss. II. Correlation analysis. Ear and Hearing, 18, 100 –113.

SPEECH RECOGNITION AND WORKING MEMORY Divenyi, P. L., Stark, P. B., & Haupt, K. M. (2005). Decline of speech understanding and auditory thresholds in the elderly. Journal of the Acoustical Society of America, 118, 1089 –1100. Dubno, J. R., Ahlstrom, J. B., & Horwitz, A. R. (2000). Use of context by young and aged adults with normal hearing. Journal of the Acoustical Society of America, 107, 538 –546. Frisina, D. R., & Frisina, R. D. (1997). Speech recognition in noise and presbycusis. Relations to possible neural mechanisms. Hearing Research, 106, 95–104. Gates, G. A., Cooper, J. C., Kannel, W. B., & Miller, N. J. (1990). Hearing in the elderly: The Framingham Cohort, 1983–1985. Part I: Basic audiometric tests results. Ear and Hearing, 11, 247–256. Gelfand, S. A., Piper, N., & Silman, S. (1985). Consonant recognition in quiet as a function of aging among normal hearing subjects. Journal of the Acoustical Society of America, 78, 1198 –1206. Gelfand, S. A., Piper, N., & Silman, S. (1986). Consonant recognition in quiet and in noise with aging among normal hearing listeners. Journal of the Acoustical Society of America, 80, 1589 –1598. Golom, J. D., Peelle, J. E., & Wingfield, A. (2007). Effects of stimulus variability and adult aging on adaptation to time-compressed speech. Journal of the Acoustical Society of America, 121, 1710 –1708. Gordon-Salant, S., & Fitzgibbons, P. J. (1993). Temporal factors and speech recognition performance in young and elderly listeners. Journal of Speech, Language, and Hearing Research, 36, 1276 –1285. Gordon-Salant, S., & Fitzgibbons, P. J. (1997). Selected cognitive factors and speech recognition performance among young and elderly listeners. Journal of Speech, Language, and Hearing Research, 40, 423– 431. Gordon-Salant, S., & Fitzgibbons, P. J. (1999). Profile of auditory temporal processing in older adults. Journal of Speech, Language, and Hearing Research, 42, 300 –311. Gordon-Salant, S., & Fitzgibbons, P. J. (2001). Sources of age-related recognition difficulty for time-compressed speech. Journal of Speech, Language, and Hearing Research, 44, 709 –719. Gordon-Salant, S., & Fitzgibbons, P. J. (2004). Effects of stimulus and noise rate variability on speech perception by younger and older adults. Journal of the Acoustical Society of America, 115, 1808 –1817. Gordon-Salant, S., Fitzgibbons, P. J., & Friedman, S. A. (2007). Recognition of time-compressed and natural speech with selective temporal enhancements by young and elderly listeners. Journal of Speech and Hearing Research, 50, 1181–1193. Ha¨llgren, M., Larsby, B., Lyxell, B., & Arlinger, S. (2005). Speech understanding in quiet and noise, with and without hearing aids. International Journal of Audiology, 44, 574 –583. Halling, D. C., & Humes, L. E. (2000). Factors affecting the recognition of reverberant speech by elderly listeners. Journal of Speech, Language, and Hearing Research, 43, 414 – 431. Helfer, K. S., & Huntley, R. A. (1991). Aging and consonant errors in reverberation and noise. Journal of the Acoustical Society of America, 90, 1786 –1796. Helfer, K. S., & Wilber, L. A. (1990). Hearing loss, aging, and speech perception in reverberation and noise. Journal of Speech, Language, and Hearing Research, 33, 149 –155. Humes, L. E. (1996). Speech understanding in the elderly. Journal of the American Academy of Audiology, 7, 161–167. Humes, L. E. (2002). Factors underlying the speech-recognition performance of elderly hearing-aid wearers. Journal of the Acoustical Society of America, 112, 1112–1132. Humes, L. E. (2005). Do “auditory processing” tests measure auditory processing in the elderly? Ear and Hearing, 26, 109 –119. Humes, L. E., Burk, M. H., Coughlin, M. P., Busey, T. A., & Strauser, L. E. (2007). Auditory speech recognition and visual text recognition in younger and older adults: Similarities and differences between modalities and the effects of presentation rate. Journal of Speech, Language, and Hearing Research, 50, 283–303.

225

Humes, L. E., & Christopherson, L. (1991). Speech identification of hearing-impaired persons. Journal of Speech and Hearing Research, 34, 686 – 693. Humes, L. E., Espinoza-Varas, B., & Watson, C. S. (1988). Modeling sensorineural hearing loss. I. Model and retrospective evaluation. Journal of the Acoustical Society of America, 83, 188 –202. Humes, L. E., & Floyd, S. S. (2005). Measures of working memory, sequence learning and speech recognition in the elderly. Journal of Speech, Language, and Hearing Research, 48, 224 –235. Humes, L. E., Watson, B. U., Christensen, L. A., Cokely, C. G., Halling, D. C., & Lee, L. (1994). Factors associated with individual differences in clinical measures of speech recognition among the elderly. Journal of Speech and Hearing Research, 37, 465– 474. Jenstad, L. M., & Souza, P. M. (2007). Temporal envelope changes of compression and speech rate: Combined effects on recognition for older adults. Journal of Speech, Language, and Hearing Research, 50, 1123– 1138. Jerger, J., Jerger, S., Oliver, T., & Pirozzolo, F. (1989). Speech understanding in the elderly. Ear and Hearing, 10, 79 – 89. Jerger, J., Jerger, S., & Pirozzolo, F. (1991). Correlational analysis of speech audiometric scores, hearing loss, age, and cognitive abilities in the elderly. Ear and Hearing, 12, 103–109. Kalikow, D. N., Stevens, K. N., & Eliot, L. L. (1977). Development of a test of speech intelligibility in noise using sentence materials with controlled word predictability. Journal of the Acoustical Society of America, 61, 1337–1351. Kim, S., Frisina, R. D., Mapes, F. M., Hickman, E. D., & Frisina, D. R. (2006). Effects of age on binaural speech intelligibility in normal hearing adults. Speech Communication, 48, 591–597. Lee, L. W., & Humes, L. E. (1992). Factors associated with speechrecognition ability of the hearing impaired elderly. Asha, 34, 212. Lindenberg, U., & Baltes, P. B. (1994). Sensory functioning and intelligence in old age: A strong connection. Psychology and Aging, 9, 339 –355. MacDonald, M. C., & Christiansen, M. H. (2002). Reassessing working memory: A comment on Just & Carpenter (1992) and Waters & Caplan (1996). Psychological Review, 109, 35–54. Magnusson, L. (1996). Predictions the speech recognition performance of elderly individuals with sensorineural hearing impairment: A procedure based on the Speech Intelligibility Index. Scandinavian Audiology, 25, 215–222. Math Works, Inc. (1990). 386-Matlab user’s guide. South Natick, MA: Math Works, Inc. McCoy, S. L., Tun, P. A., Cox, L. C., Colangelo, M., Stewart, R. A., & Wingfield, A. (2005). Hearing loss and perceptual effort: Downstream effects on elder adults’ memory for speech. Quarterly Journal of Experimental Psychology, 58A, 22–23. Morrell, C. H., Gordon-Salant, S., Pearson, J. D., Brant, L. J., & Fozard, J. L. (1996). Age- and gender-specific reference ranges for hearing level and longitudinal changes in hearing level. Journal of the Acoustical Society of America, 100, 1949 –1967. Moulines, E., & Laroche, J. (1995). Non-parametric techniques for pitch scale and time-scale modification of speech. Speech Communication, 16, 175–205. Murphy, D. R., Craik, F. I. M., Li, K. Z. H., & Schneider, B. A. (2000). Comparing the effects of aging and background noise on short term memory performance. Psychology and Aging, 15, 323–334. Ohde, R., & Abou-Khalil, R. (2001). Age differences for stop-consonant and vowel perception in adults. Journal of the Acoustical Society of America, 110, 2156 –2166. Philips, S. L., Gordon-Salant, S., Fitzgibbons, P. J., & Yeni-Komshian, G. (2000). Frequency and temporal resolution in elderly listeners with good and poor word recognition. Journal of Speech, Language, and Hearing Research, 43, 217–228.

226

CERVERA, SOLER, DASI, AND RUIZ

Pichora-Fuller, M. K., Schneider, B. A., & Daneman, M. (1995). How young and old adults listen to and remember speech in noise. Journal of the Acoustical Society of America, 97, 593– 608. Plomp, R. (1986). A signal-to noise ratio model for the speech reception threshold of the hearing impaired. Journal of the Speech and Hearing Research, 29, 146 –154. Rabbit, P. M. A. (1968). Channel capacity, intelligibility and immediate memory. Quarterly Journal of Experimental Psychology, 20, 241–248. Rabbit, P. M. A. (1991). Mild hearing loss can cause apparent memory failures with increase with age and reduce with IQ. Acta Otolaryngologica, 476(Suppl.), 167–176. Salthouse, T. A. (1991). Theoretical perspectives on aging. Hillsdale, NY: Erlbaum. Schneider, B. A., Daneman, M., & Murphy, D. R. (2005). Speech comprehension difficulties in older adults: Cognitive slowing or age related changes in hearing. Psychology and Aging, 20, 261–271. Schneider, B. A., Daneman, M., & Pichora-Fuller, M. K. (2002). Listening in aging adults. From discourse to psychoacoustics. Canadian Journal of Experimental Psychology, 56, 139 –152. Schneider, B. A., & Pichora-Fuller, M. K. (2000). Implications of perceptual deterioration for cognitive aging research. In F. I. M. Craik & T. A. Salthouse (Eds.), Handbook of aging and cognition (2nd ed., pp. 155– 220). Mahwah, NJ: Erlbaum. Shebercoe, R. L., & Studebaker, G. A. (2003). Audibility-index predictions of normal-hearing and hearing-impaired listeners’ performance on the connected speech test. Ear and Hearing, 24, 71– 88. Snell, K. B., Mapes, F. M., Hickman, E. D., & Frisina, R. (2002). Word recognition in competing babble and the effects of age, temporal processing, and absolute sensitivity. Journal of the Acoustical Society of America, 112, 720 –727. Souza, P. E., Boike, K. T., Witherell, K., & Tremblay, K. (2007). Prediction of speech recognition from audibility in older listeners with hearing loss: Effects of age, amplification, and background noise. Journal of the American Academy of Audiology, 18, 54 – 65. Souza, P. E., & Kitch, V. (2001). The contribution of amplitude envelope cues to sentence identification in young and aged listeners. Ear and Hearing, 22, 112–119. Tun, P. A. (1998). Fast noisy speech: Age differences in processing rapid speech with background noise. Psychology and Aging, 13, 424 – 434. Van Rooij, J. C. G. M., & Plomp, R. (1990). Auditive and cognitive factors in speech perception by elderly listeners II: Multivariate analysis. Journal of the Acoustical Society of America, 88, 2611–2624. Van Rooij, J. C. G. M., & Plomp, R. (1992). Auditive and cognitive factors in speech perception by elderly listeners III: Additional data and final

discussion. Journal of the Acoustical Society of America, 91, 1028 – 1033. Van Rooij, J. C. G. M., Plomp, R., & Oberleke, J. F. (1989). Auditive and cognitive factors in speech perception by elderly listeners I: Development of test battery. Journal of the Acoustical Society of America, 86, 1294 –1309. Vaugham, N. E., Furukawa, I., Balasinham, N., Mortz, M., & Fausti, S. A. (2002). Time-expanded speech and speech recognition in older adults. Journal of Rehabilitation Research and Development, 39, 559 –566. Vaugham, N. E., & Letowsky, T. (1997). Effects of age, speech rate, and type of test on temporal auditory processing. Journal of Speech, Language, and Hearing Research, 40, 1192–1200. Versfeld, N. J., & Dreschler, W. A. (2001). The relationship between the intelligibility of time-compressed speech and speech in noise in young and elderly listeners. Journal of the Acoustical Society of America, 111, 401– 408. Willot, J. F. (1991). Aging and the Auditory System: Anatomy, physiology, and psychophysics. San Diego, CA: Singular. Wingfield, A., Aberdeen, J. J., & Stine, E. A. L. (1991). Word onset gating and linguistic context in spoken word recognition by young and elderly adults. Journal of Gerontology: Psychological Sciences, 46, 127–129. Wingfield, A., Lindfield, K. C., & Goodglass, H. (2000). Effects of age and hearing sensitivity on the use of prosodic information in spoken word recognition. Journal of Speech, Language, and Hearing Research, 43, 915–925. Wingfield, A., Peelle, J. E., & Grossman, M. (2003). Speech rate and syntactic complexity as multiplicative factors in speech comprehension by young and older adults. Aging, Neuropsychology, and Cognition, 10, 310 –322. Wingfield, A., & Stine-Morrow, E. A. L. (2000). Language and speech. In F. I. M. Craik & T. A. Salthouse (Eds.), Handbook of aging and cognition (2nd ed., pp. 359 – 416). Mahwah, NJ: Erlbaum. Wingfield, A., Stine-Morrow, E. A. L., Lahar, C. J., & Aberdeen, J. S. (1988). Does the capacity of working memory change with age?. Experimental Aging Research, 14, 103–107. Wingfield, A., Tun, P. A., Koh, C. K., & Rosen, M. J. (1999). Regaining lost time: Adult aging and the effect of time restoration on recall of time-compressed speech. Psychology and Aging, 14, 380 –389. Wingfield, A., Tun, P. A., & McCoy, S. L. (2005). Hearing loss in older adulthood. What it is and how it interacts with cognitive performance. Current Directions in Psychological Science, 14, 144 –148.

Received September 18, 2007 Accepted October 8, 2008 䡲