Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 8 de 8
Filter
Add more filters











Database
Language
Publication year range
1.
J Acoust Soc Am ; 110(1): 425-40, 2001 Jul.
Article in English | MEDLINE | ID: mdl-11508968

ABSTRACT

Although both perceived vocal effort and intensity are known to influence the perceived distance of speech, little is known about the processes listeners use to integrate these two parameters into a single estimate of talker distance. In this series of experiments, listeners judged the distances of prerecorded speech samples presented over headphones in a large open field. In the first experiment, virtual synthesis techniques were used to simulate speech signals produced by a live talker at distances ranging from 0.25 to 64 m. In the second experiment, listeners judged the apparent distances of speech stimuli produced over a 60-dB range of different vocal effort levels (production levels) and presented over a 34-dB range of different intensities (presentation levels). In the third experiment, the listeners judged the distances of time-reversed speech samples. The results indicate that production level and presentation level influence distance perception differently for each of three distinct categories of speech. When the stimulus was high-level voiced speech (produced above 66 dB SPL 1 m from the talker's mouth), the distance judgments doubled with each 8-dB increase in production level and each 12-dB decrease in presentation level. When the stimulus was low-level voiced speech (produced at or below 66 dB SPL at 1 m), the distance judgments doubled with each 15-dB increase in production level but were relatively insensitive to changes in presentation level at all but the highest intensity levels tested. When the stimulus was whispered speech, the distance judgments were unaffected by changes in production level and only decreased with increasing presentation level when the intensity of the stimulus exceeded 66 dB SPL. The distance judgments obtained in these experiments were consistent across a range of different talkers, listeners, and utterances, suggesting that voice-based distance cueing could provide a robust way to control the apparent distances of speech sounds in virtual audio displays.


Subject(s)
Distance Perception , Loudness Perception , Sound Localization , Speech Acoustics , Speech Perception , Adult , Female , Humans , Male , Middle Aged , Psychoacoustics , Voice Quality
3.
J Acoust Soc Am ; 109(3): 1101-9, 2001 Mar.
Article in English | MEDLINE | ID: mdl-11303924

ABSTRACT

Although most recent multitalker research has emphasized the importance of binaural cues, monaural cues can play an equally important role in the perception of multiple simultaneous speech signals. In this experiment, the intelligibility of a target phrase masked by a single competing masker phrase was measured as a function of signal-to-noise ratio (SNR) with same-talker, same-sex, and different-sex target and masker voices. The results indicate that informational masking, rather than energetic masking, dominated performance in this experiment. The amount of masking was highly dependent on the similarity of the target and masker voices: performance was best when different-sex talkers were used and worst when the same talker was used for target and masker. Performance did not, however, improve monotonically with increasing SNR. Intelligibility generally plateaued at SNRs below 0 dB and, in some cases, intensity differences between the target and masking voices produced substantial improvements in performance with decreasing SNR. The results indicate that informational and energetic masking play substantially different roles in the perception of competing speech messages.


Subject(s)
Perceptual Masking , Speech Perception/physiology , Verbal Behavior , Adult , Color Perception , Female , Humans , Male , Middle Aged , Time Factors
4.
J Acoust Soc Am ; 110(5 Pt 1): 2527-38, 2001 Nov.
Article in English | MEDLINE | ID: mdl-11757942

ABSTRACT

Although many researchers have examined the role that binaural cues play in the perception of spatially separated speech signals, relatively little is known about the cues that listeners use to segregate competing speech messages in a monaural or diotic stimulus. This series of experiments examined how variations in the relative levels and voice characteristics of the target and masking talkers influence a listener's ability to extract information from a target phrase in a 3-talker or 4-talker diotic stimulus. Performance in this speech perception task decreased systematically when the level of the target talker was reduced relative to the masking talkers. Performance also generally decreased when the target and masking talkers had similar voice characteristics: the target phrase was most intelligible when the target and masking phrases were spoken by different-sex talkers, and least intelligible when the target and masking phrases were spoken by the same talker. However, when the target-to-masker ratio was less than 3 dB, overall performance was usually lower with one different-sex masker than with all same-sex maskers. In most of the conditions tested, the listeners performed better when they were exposed to the characteristics of the target voice prior to the presentation of the stimulus. The results of these experiments demonstrate how monaural factors may play an important role in the segregation of speech signals in multitalker environments.


Subject(s)
Attention , Perceptual Masking , Speech Perception , Adult , Auditory Threshold , Dichotic Listening Tests , Female , Humans , Male , Middle Aged , Reference Values , Sound Spectrography , Speech Acoustics
5.
Percept Psychophys ; 62(1): 48-65, 2000 Jan.
Article in English | MEDLINE | ID: mdl-10703255

ABSTRACT

Four response methods for indicating the perceived locations of nearby objects were evaluated: the direct-location (DL) method, where a response pointer is moved directly to the perceived location of the target; the large-head (LH) and small-head (SH) methods, where the pointer is moved to the target location relative to a full-scale or half-scale manikin head; and the verbal report (VR) method, where the spherical coordinates of the target location are indicated verbally. Measurements with a visual target indicated that the DL method was relatively unbiased and considerably more accurate than the other methods, which were all roughly equivalent. Correcting for bias improved accuracy in the LH, SH, and VR responses, but not to the level of the uncorrected DL responses. Replacing the visual target with an acoustic stimulus approximately doubled the errors with the DL response but indicated similar performance in the front and rear hemispheres. The results suggest that DL is the most appropriate response method for close-range localization experiments.


Subject(s)
Visual Perception/physiology , Adult , Auditory Perception/physiology , Female , Humans , Male , Reaction Time
6.
J Acoust Soc Am ; 106(4 Pt 1): 1956-68, 1999 Oct.
Article in English | MEDLINE | ID: mdl-10530020

ABSTRACT

Although many researchers have examined auditory localization for relatively distant sound sources, little is known about the spatial perception of nearby sources. In the region within 1 m of a listener's head, defined as the "proximal region," the interaural level difference increases dramatically as the source approaches the head, while the interaural time delay is roughly independent of distance. An experiment has been performed to evaluate proximal-region localization performance. An auditory point source was moved to a random position within 1 m of the subject's head, and the subject responded by pointing to the perceived location of the sound with an electromagnetic position sensor. The overall angular error (17 degrees) was roughly comparable to previously measured results in distal-region experiments. Azimuth error increased slightly as the sound source approached the head, but elevation performance was essentially independent of source distance. Distance localization performance was generally better than has been reported in distal-region experiments and was strongly dependent on azimuth, with the stimulus-response correlation ranging from 0.85 to the side of the head to less than 0.4 in the median plane. The results suggest that the enlarged binaural difference cues found in the head-related transfer function (HRTF) for nearby sources are important to auditory distance perception in the proximal region.


Subject(s)
Auditory Perception/physiology , Sound Localization/physiology , Adult , Humans , Male , Psychoacoustics
7.
J Acoust Soc Am ; 106(3 Pt 1): 1465-79, 1999 Sep.
Article in English | MEDLINE | ID: mdl-10489704

ABSTRACT

Although researchers have long recognized the unique properties of the head-related transfer function (HRTF) for nearby sources (within 1 m of the listener's head), virtually all of the HRTF measurements described in the literature have focused on source locations 1 m or farther from the listener. In this study, HRTFs for sources at distances from 0.12 to 1 m were calculated using a rigid-sphere model of the head and measured using a Knowles Electronic Manikin for Acoustic Research (KEMAR) and an acoustic point source. Both the calculations and the measurements indicate that the interaural level difference (ILD) increases substantially for lateral sources as distance decreases below 1 m, even at low frequencies where the ILD is small for distant sources. In contrast, the interaural time delay (ITD) is roughly independent of distance even when the source is close. The KEMAR measurements indicate that the direction of the source relative to the outer ear plays an important role in determining the high-frequency response of the HRTF in the horizontal plane. However, the elevation-dependent characteristics of the HRTFs are not strongly dependent on distance, and the contribution of the pinna to the HRTF is independent of distance beyond a few centimeters from the ear. Overall, the results suggest that binaural cues play an important role in auditory distance perception for nearby sources.


Subject(s)
Ear Canal , Sound Localization , Dichotic Listening Tests , Functional Laterality , Humans , Manikins , Models, Theoretical , Pitch Perception , Psychoacoustics
8.
J Acoust Soc Am ; 106(6): 3589-602, 1999 Dec.
Article in English | MEDLINE | ID: mdl-10615699

ABSTRACT

A series of experiments has examined the auditory localization of a nearby (< 1 m) sound source under four conditions: (1) a fixed-amplitude condition where loudness-based distance cues were available; (2) a monaural condition where the contralateral ear was occluded by an ear-plug and muff; (3) a high-pass condition where the stimulus bandwidth was 3 Hz to 15 kHz; and (4) a low-pass condition where the stimulus bandwidth was 200 Hz to 3 kHz. The results of these experiments were compared to those of a previous experiment that measured localization performance for a nearby broadband, random-amplitude source [Brungart et al., J. Acoust. Soc. Am. 106, 1956-1968 (1999)]. Directional localization performance in each condition was consistent with the results of previous far-field localization experiments. Distance localization accuracy improved slightly in the fixed-amplitude condition relative to the earlier broadband random-amplitude experiment, especially near the median plane, but was severely degraded in the monaural condition. Distance accuracy was also found to be highly dependent on the low-frequency energy of the stimulus: in the low-pass condition, distance accuracy was similar to that in the broadband condition, while in the high-pass condition, distance accuracy was significantly reduced. The results suggest that low-frequency interaural level differences are the dominant auditory distance cue in the proximal region.


Subject(s)
Auditory Perception/physiology , Sound Localization/physiology , Humans
SELECTION OF CITATIONS
SEARCH DETAIL