Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 37
Filter
Add more filters










Publication year range
1.
Exp Brain Res ; 221(4): 403-12, 2012 Sep.
Article in English | MEDLINE | ID: mdl-22821079

ABSTRACT

The integration of visual and auditory inputs in the human brain occurs only if the components are perceived in temporal proximity, that is, when the intermodal time difference falls within the so-called subjective synchrony range. We used the midpoint of this range to estimate the point of subjective simultaneity (PSS). We measured the PSS for audio-visual (AV) stimuli in a synchrony judgment task, in which subjects had to judge a given AV stimulus using three response categories (audio first, synchronous, video first). The relevant stimulus manipulation was the duration of the auditory and visual components. Results for unimodal auditory and visual stimuli have shown that the perceived onset shifts to relatively later positions with increasing stimulus duration. These unimodal shifts should be reflected in changing PSS values, when AV stimuli with different durations of the auditory and visual components are used. The results for 17 subjects showed indeed a significant shift of the PSS for different duration combinations of the stimulus components. Because the shifts were approximately equal for duration changes in either of the components, no net shift of the PSS was observed as long as the durations of the two components were equal. This result indicates the need to appropriately account for unimodal timing effects when quantifying intermodal synchrony perception.


Subject(s)
Auditory Perception/physiology , Time Perception/physiology , Visual Perception/physiology , Acoustic Stimulation/methods , Adult , Aged , Feedback, Sensory/physiology , Female , Humans , Male , Middle Aged , Photic Stimulation/methods , Reaction Time/physiology , Time Factors , Young Adult
2.
J Acoust Soc Am ; 110(2): 1074-88, 2001 Aug.
Article in English | MEDLINE | ID: mdl-11519576

ABSTRACT

This article presents a quantitative binaural signal detection model which extends the monaural model described by Dau et al. [J. Acoust. Soc. Am. 99, 3615-3622 (1996)]. The model is divided into three stages. The first stage comprises peripheral preprocessing in the right and left monaural channels. The second stage is a binaural processor which produces a time-dependent internal representation of the binaurally presented stimuli. This stage is based on the Jeffress delay line extended with tapped attenuator lines. Through this extension, the internal representation codes both interaural time and intensity differences. In contrast to most present-day models, which are based on excitatory-excitatory interaction, the binaural interaction in the present model is based on contralateral inhibition of ipsilateral signals. The last stage, a central processor, extracts a decision variable that can be used to detect the presence of a signal in a detection task, but could also derive information about the position and the compactness of a sound source. In two accompanying articles, the model predictions are compared with data obtained with human observers in a great variety of experimental conditions.


Subject(s)
Auditory Perception/physiology , Dichotic Listening Tests , Functional Laterality/physiology , Models, Neurological , Neural Inhibition/physiology , Sound Localization/physiology , Attention/physiology , Cochlear Nerve/physiology , Humans , Psychoacoustics
3.
J Acoust Soc Am ; 110(2): 1089-104, 2001 Aug.
Article in English | MEDLINE | ID: mdl-11519577

ABSTRACT

This and two accompanying articles [Breebaart et al., J. Acoust. Soc. Am. 110, 1074-1088 (2001); 110, 1105-1117 (2001)] describe a computational model for the signal processing in the binaural auditory system. The model consists of several stages of monaural and binaural preprocessing combined with an optimal detector. In the present article the model is tested and validated by comparing its predictions with experimental data for binaural discrimination and masking conditions as a function of the spectral parameters of both masker and signal. For this purpose, the model is used as an artificial observer in a three-interval, forced-choice adaptive procedure. All model parameters were kept constant for all simulations described in this and the subsequent article. The effects of the following experimental parameters were investigated: center frequency of both masker and target, bandwidth of masker and target, the interaural phase relations of masker and target, and the level of the masker. Several phenomena that occur in binaural listening conditions can be accounted for. These include the wider effective binaural critical bandwidth observed in band-widening NoS(pi) conditions, the different masker-level dependence of binaural detection thresholds for narrow- and for wide-band maskers, the unification of IID and ITD sensitivity with binaural detection data, and the dependence of binaural thresholds on frequency.


Subject(s)
Dichotic Listening Tests , Functional Laterality/physiology , Neural Inhibition/physiology , Perceptual Masking/physiology , Pitch Discrimination/physiology , Auditory Threshold/physiology , Cochlear Nerve/physiology , Humans , Psychoacoustics , Reference Values , Sound Spectrography
4.
J Acoust Soc Am ; 110(2): 1105-17, 2001 Aug.
Article in English | MEDLINE | ID: mdl-11519578

ABSTRACT

This paper and two accompanying papers [Breebaart et al., J. Acoust. Soc. Am. 110, 1074-1088 (2001); 110, 1089-1104 (2001)] describe a computational model for the signal processing of the binaural auditory system. The model consists of several stages of monaural and binaural preprocessing combined with an optimal detector. Simulations of binaural masking experiments were performed as a function of temporal stimulus parameters and compared to psychophysical data adapted from literature. For this purpose, the model was used as an artificial observer in a three-interval, forced-choice procedure. All model parameters were kept constant for all simulations. Model predictions were obtained as a function of the interaural correlation of a masking noise and as a function of both masker and signal duration. Furthermore, maskers with a time-varying interaural correlation were used. Predictions were also obtained for stimuli with time-varying interaural time or intensity differences. Finally, binaural forward-masking conditions were simulated. The results show that the combination of a temporal integrator followed by an optimal detector in the time domain can account for all conditions that were tested, except for those using periodically varying interaural time differences (ITDs) and those measuring interaural correlation just-noticeable differences (jnd's) as a function of bandwidth.


Subject(s)
Dichotic Listening Tests , Functional Laterality/physiology , Models, Neurological , Neural Inhibition/physiology , Perceptual Masking/physiology , Pitch Discrimination/physiology , Time Perception/physiology , Auditory Threshold/physiology , Cochlear Nerve/physiology , Differential Threshold/physiology , Humans , Psychoacoustics , Sound Spectrography
5.
J Acoust Soc Am ; 109(5 Pt 1): 2085-97, 2001 May.
Article in English | MEDLINE | ID: mdl-11386560

ABSTRACT

Both in speech synthesis and in sound coding it is often beneficial to have a measure that predicts whether, and to what extent, two sounds are different. This paper addresses the problem of estimating the perceptual effects of small modifications to the spectral envelope of a harmonic sound. A recently proposed auditory model is investigated that transforms the physical spectrum into a pattern of specific loudness as a function of critical band rate. A distance measure based on the concept of partial loudness is presented, which treats detectability in terms of a partial loudness threshold. This approach is adapted to the problem of estimating discrimination thresholds related to modifications of the spectral envelope of synthetic vowels. Data obtained from subjective listening tests using a representative set of stimuli in a 3IFC adaptive procedure show that the model makes reasonably good predictions of the discrimination threshold. Systematic deviations from the predicted thresholds may be related to individual differences in auditory filter selectivity. The partial loudness measure is compared with previously proposed distance measures such as the Euclidean distance between excitation patterns and between specific loudness applied to the same experimental data. An objective test measure shows that the partial loudness measure and the Euclidean distance of the excitation patterns are equally appropriate as distance measures for predicting audibility thresholds. The Euclidean distance between specific loudness is worse in performance compared with the other two.


Subject(s)
Auditory Threshold/physiology , Speech Perception/physiology , Humans , Loudness Perception/physiology , Noise , Perceptual Masking/physiology , Phonetics , Psychoacoustics , Speech/physiology
6.
J Acoust Soc Am ; 109(1): 331-45, 2001 Jan.
Article in English | MEDLINE | ID: mdl-11206162

ABSTRACT

This paper investigated the influence of stimulus uncertainty in binaural detection experiments and the predictions of several binaural models for such conditions. Masked thresholds of a 500-Hz sinusoid were measured in an NrhoSpi condition for both running and frozen-noise maskers using a three interval, forced-choice (3IFC) procedure. The nominal masker correlation varied between 0.64 and 1, and the bandwidth of the masker was either 10, 100, or 1,000 Hz. The running-noise thresholds were expected to be higher than the frozen-noise thresholds because of stimulus uncertainty in the running-noise conditions. For an interaural correlation close to +1, no difference between frozen-noise and running-noise thresholds was expected for all values of the masker bandwidth. These expectations were supported by the experimental data: for interaural correlations less than 1.0, substantial differences between frozen and running-noise conditions were observed for bandwidths of 10 and 100 Hz. Two additional conditions were tested to further investigate the influence of stimulus uncertainty. In the first condition a different masker sample was chosen on each trial, but the correlation of the masker was forced to a fixed value. In the second condition one of two independent frozen-noise maskers was randomly chosen on each trial. Results from these experiments emphasized the influence of stimulus uncertainty in binaural detection tasks: if the degree of uncertainty in binaural cues was reduced, thresholds decreased towards thresholds in the conditions without any stimulus uncertainty. In the analysis of the data, stimulus uncertainty was expressed in terms of three theories of binaural processing: the interaural correlation, the EC theory, and a model based on the processing of interaural intensity differences (IIDs) and interaural time differences (ITDs). This analysis revealed that none of the theories tested could quantitatively account for the observed thresholds. In addition, it was found that, in conditions with stimulus uncertainty, predictions based on correlation differ from those based on the EC theory.


Subject(s)
Attention , Dichotic Listening Tests , Sound Localization , Auditory Threshold , Humans , Loudness Perception , Perceptual Masking , Psychoacoustics , Sound Spectrography
7.
J Acoust Soc Am ; 108(2): 723-34, 2000 Aug.
Article in English | MEDLINE | ID: mdl-10955639

ABSTRACT

This paper is concerned with modulation and beat detection for sinusoidal carriers. In the first experiment, temporal modulation transfer functions (TMTFs) were measured for carrier frequencies between 1 and 10 kHz. Modulation rates covered the range from 10 Hz to about the rate equaling the critical bandwidth at the carrier frequency. In experiment 2, TMTFs for three carrier frequencies were obtained as a function of the carrier level. In the final experiment, thresholds for the detection of either the lower or the upper modulation sideband (beat detection) were measured for "carrier" frequencies of 5 and 10 kHz, using the same range of modulation rates as in experiment 1. The TMTFs for carrier frequencies of 2 kHz and higher remained flat up to a modulation rate of about 100-130 Hz and had similar values across carrier frequencies. For higher rates, modulation thresholds initially increased and then decreased rapidly, reflecting the subjects' ability to resolve the sidebands spectrally. Detection thresholds generally improved with increasing carrier level, but large variations in the exact level dependence were observed, across subjects as well as across carrier frequencies. For beat rates up to about 70 Hz (at 5 kHz) and 100 Hz (at 10 kHz), beat detection thresholds were the same for the upper and the lower sidebands and were about 6 dB higher than the level per sideband at the modulation-detection threshold. At higher rates the threshold for both sidebands increased, but the increase was larger for the lower sideband. This reflects an asymmetry in masking with more masking towards lower frequencies. Only at rates well beyond the maximum of the TMTF did detection for the lower sideband start to be better than that for the upper sideband. The asymmetry at intermediate frequency separations can be explained by assuming that detection always takes place in filters centered above the stimulus spectrum. The shape of the TMTF and the beat-detection data reflects a limitation in resolving fast amplitude variations, which must occur central to the inner-ear filtering. Its characteristic resembles that of a first-order low-pass filter with a cutoff frequency of about 150 Hz.

8.
J Acoust Soc Am ; 106(5): 2752-60, 1999 Nov.
Article in English | MEDLINE | ID: mdl-10573891

ABSTRACT

A model is presented which calculates the intrinsic envelope power of a bandpass noise carrier within the passband of a hypothetical modulation filter tuned to a specific modulation frequency. Model predictions are compared to experimentally obtained amplitude modulation (AM) detection thresholds. In experiment 1, thresholds for modulation rates of 5, 25, and 100 Hz imposed on a bandpass Gaussian noise carrier with a fixed upper cutoff frequency of 6 kHz and a bandwidth in the range from 1 to 6000 Hz were obtained. In experiment 2, three noises with different spectra of the intrinsic fluctuations served as the carrier: Gaussian noise, multiplied noise, and low-noise noise. In each case, the carrier was spectrally centered at 5 kHz and had a bandwidth of 50 Hz. The AM detection thresholds were obtained for modulation frequencies of 10, 20, 30, 50, 70, and 100 Hz. The intrinsic envelope power of the carrier at the output of the modulation filter tuned to the signal modulation frequency appears to provide a good estimate for AM detection threshold. The results are compared with predictions on the basis of the more complex auditory processing model by Dau et al.


Subject(s)
Auditory Threshold/physiology , Noise , Signal Detection, Psychological/physiology , Humans , Models, Biological
9.
J Acoust Soc Am ; 106(4 Pt 1): 1940-7, 1999 Oct.
Article in English | MEDLINE | ID: mdl-10530018

ABSTRACT

Thresholds for sinusoidal signals masked by noise of various bandwidths were obtained for three binaural configurations: N0S0 (both masker and signal interaurally in phase), N0S pi (masker interaurally in phase and signal interaurally phase-reversed), and N pi S0 (masker interaurally phase-reversed and signal interaurally in phase). Signal frequencies of 125, 250, 500, 1000, 2000, and 4000 Hz were combined with masker bandwidths of 5, 10, 25, 50, 100, 250, 500, 1000, 2000, 4000, and 8000 Hz, with the restriction that masker bandwidths never exceeded twice the signal frequency. The overall noise power was kept constant at 70 dB SPL for all bandwidths. Results, expressed as signal-to-total-noise power ratios, show that N0S0 thresholds generally decrease with increasing bandwidth, even for subcritical bandwidths. Only at frequencies of 2 and 4 kHz do thresholds appear to remain constant for bandwidths around the critical bandwidth. N0S pi thresholds are generally less dependent on bandwidth up to two or three times the (monaural) critical bandwidth. Beyond this bandwidth, thresholds decrease with a similar slope as for the N0S0 condition. N pi S0 conditions show about the same bandwidth dependence as N0S pi, but thresholds in the former condition are generally higher. This threshold difference is largest at low frequencies and disappears above 2 kHz. An explanation for wider operational binaural critical bandwidth is given which assumes that binaural disparities are combined across frequency in an optimally weighted way.


Subject(s)
Auditory Perception/physiology , Hearing/physiology , Perceptual Masking , Auditory Threshold/physiology , Humans
10.
J Acoust Soc Am ; 106(2): 979-92, 1999 Aug.
Article in English | MEDLINE | ID: mdl-10462803

ABSTRACT

This paper investigates the relative contribution of various interaural cues to binaural unmasking in conditions with an interaurally in-phase masker and an out-of-phase signal (MoS pi). By using a modified version of multiplied noise as the masker and a sinusoid as the signal, conditions with only interaural intensity differences (IIDs), only interaural time differences (ITDs), or combinations of the two were realized. In addition, the experimental procedure allowed the presentation of specific combinations of static and dynamically varying interaural differences. In these conditions with multiplied noise as masker, the interaural differences have a bimodal distribution with a minimum at zero IID or ITD. Additionally, by using the sinusoid as masker and the multiplied noise as signal, a unimodal distribution of the interaural differences was realized. Through this variation in the shape of the distributions, the close correspondence between the change in the interaural cross correlation and the size of the interaural differences is no longer found, in contrast to the situation for a Gaussian-noise masker [Domnitz and Colburn, J. Acoust. Soc. Am. 59, 598-601 (1976)]. When analyzing the mean thresholds across subjects, the experimental results could not be predicted from parameters of the distributions of the interaural differences (the mean, the standard deviation, or the root-mean-square value). A better description of the subjects' performance was given by the change in the interaural correlation, but this measure failed in conditions which produced a static interaural intensity difference. The data could best be described by using the energy of the difference signal as the decision variable, an approach similar to that of the equalization and cancellation model.


Subject(s)
Auditory Perception/physiology , Noise , Auditory Threshold/physiology , Humans , Models, Theoretical , Normal Distribution , Perceptual Masking , Time Factors
11.
J Acoust Soc Am ; 105(1): 522-35, 1999 Jan.
Article in English | MEDLINE | ID: mdl-9921675

ABSTRACT

This article presents the results of listening experiments and psychoacoustical modeling aimed at evaluating the pitch synchronous overlap-and-add (PSOLA) technique. This technique can be used for simultaneous modification of pitch and duration of natural speech, using simple and efficient time-domain operations on the speech waveform. The first set of experiments tested the ability of subjects to discriminate double-formant stimuli, modified in fundamental frequency using PSOLA, from unmodified stimuli. Of the potential auditory discrimination cues induced by PSOLA, cues from the first formant were found to generally dominate discrimination performance. In the second set of experiments the influence of vocal perturbation, i.e., jitter and shimmer, on discriminability of PSOLA-modified single-formant stimuli was determined. The data show that discriminability deteriorates at most modestly in the presence of jitter and shimmer. With the exception of a few conditions, the trends in these data could be replicated by either using a modulation-discrimination or an intensity-discrimination model, dependent on the formant frequency. As a baseline experiment detection thresholds for jitter and shimmer were measured. Thresholds for jitter could be replicated by using either the modulation-discrimination or the intensity-discrimination model, dependent on the (mean) fundamental frequency of stimuli. The thresholds for shimmer could be accurately predicted for stimuli with a 250-Hz fundamental, but less accurately in the case of a 100-Hz fundamental.


Subject(s)
Pitch Discrimination/physiology , Psychoacoustics , Speech Perception/physiology , Vocal Cords/physiology , Adult , Auditory Threshold , Humans
12.
J Acoust Soc Am ; 103(4): 2100-10, 1998 Apr.
Article in English | MEDLINE | ID: mdl-9566331

ABSTRACT

Detection thresholds were measured with a multiplied-noise masker that was in phase in both ears and a sinusoidal signal which was either in phase or out of phase (NoSo and NoS pi conditions). The masker was generated by multiplying a low-pass noise with a sinusoidal carrier. The signal was a sinusoid with the same frequency as the carrier and a constant phase offset, theta, with respect to the carrier. By adjusting the phase offset, the stimulus properties were varied in such a way that only interaural time delays (theta = pi/2) or interaural intensity differences (theta = 0) were present within the NoS pi stimulus. Thresholds were measured at a center frequency of 4 kHz as a function of bandwidth for theta = pi/2 and for theta = 0. In a second experiment thresholds were measured for a bandwidth of 25 Hz as a function of the center frequency. The results show that narrow-band BMLDs at 4 kHz can amount to 30 dB for the theta = 0 condition. For this condition, narrow-band BMLDs are also reasonably constant across frequency, in contrast to results obtained with standard Gaussian-noise maskers. For theta = pi/2, BMLDs are restricted to the frequency region below 2 kHz provided that the masker is narrow band, but BMLDs of up to 15 dB are found at 4 kHz if the masker is 50 Hz or wider. The frequency dependence of the binaural thresholds seems to be best explained by assuming that the stimulus waveforms are compressed before binaural interaction.


Subject(s)
Auditory Perception/physiology , Dichotic Listening Tests/methods , Noise , Perceptual Masking/physiology , Auditory Threshold , Humans , Models, Biological , Normal Distribution
13.
J Acoust Soc Am ; 103(3): 1573-9, 1998 Mar.
Article in English | MEDLINE | ID: mdl-9514021

ABSTRACT

Release of masking for a sinusoidal signal of 5 kHz masked by a 25-Hz-wide noise band centered around 5 kHz was measured. The masking release was provided by a second noise band that was comodulated with the on-frequency masker band. For CMR configurations the second noise band was centered at 3 kHz and presented to the ipsi-lateral or to the contra-lateral ear. For BMLD configurations the second band was centered at 5 kHz and presented to the contra-lateral ear. In another condition the second noise band also contained the signal presented with such a phase that maximal differences in the envelope resulted. For both the CMR and the BMLD paradigm, the masking release for the latter condition was larger than for the former condition. To assess further the similarity between monaural and binaural masking release, a sinusoidal masker and either a noise or a sinusoidal signal were used. The data indicate that, at high frequencies, envelope correlation may be a valuable cue for CMR as well as for the BMLD.


Subject(s)
Auditory Perception/physiology , Perceptual Masking , Humans , Noise
14.
J Acoust Soc Am ; 102(5 Pt 1): 2892-905, 1997 Nov.
Article in English | MEDLINE | ID: mdl-9373976

ABSTRACT

This paper presents a quantitative model for describing data from modulation-detection and modulation-masking experiments, which extends the model of the "effective" signal processing of the auditory system described in Dau et al. [J. Acoust. Soc. Am. 99, 3615-3622 (1996)]. The new element in the present model is a modulation filterbank, which exhibits two domains with different scaling. In the range 0-10 Hz, the modulation filters have a constant bandwidth of 5 Hz. Between 10 Hz and 1000 Hz a logarithmic scaling with a constant Q value of 2 was assumed. To preclude spectral effects in temporal processing, measurements and corresponding simulations were performed with stochastic narrow-band noise carriers at a high center frequency (5 kHz). For conditions in which the modulation rate (fmod) was smaller than half the bandwidth of the carrier (delta f), the model accounts for the low-pass characteristic in the threshold functions [e.g., Viemeister, J. Acoust. Soc. Am. 66, 1364-1380 (1979)]. In conditions with fmod > delta f/2, the model can account for the high-pass characteristic in the threshold function. In a further experiment, a classical masking paradigm for investigating frequency selectivity was adopted and translated to the modulation-frequency domain. Masked thresholds for sinusoidal test modulation in the presence of a competing modulation masker were measured and simulated as a function of the test modulation rate. In all cases, the model describes the experimental data to within a few dB. It is proposed that the typical low-pass characteristic of the temporal modulation transfer function observed with wide-band noise carriers is not due to "sluggishness" in the auditory system, but can instead be understood in terms of the interaction between modulation filters and the inherent fluctuations in the carrier.


Subject(s)
Auditory Perception , Perceptual Masking , Signal Detection, Psychological , Acoustics , Auditory Threshold , Humans , Noise
15.
J Acoust Soc Am ; 102(5 Pt 1): 2906-19, 1997 Nov.
Article in English | MEDLINE | ID: mdl-9373977

ABSTRACT

A multi-channel model, describing the effects of spectral and temporal integration in amplitude-modulation detection for a stochastic noise carrier, is proposed and validated. The model is based on the modulation filterbank concept which was established in the accompanying paper [Dau et al., J. Acoust. Soc. Am. 102, 2892-2905 (1997)] for modulation perception in narrow-band conditions (single-channel model). To integrate information across frequency, the detection process of the model linearly combines the channel outputs. To integrate information across time, a kind of "multiple-look" strategy, is realized within the detection stage of the model. Both data from the literature and new data are used to validate the model. The model predictions agree with the results of Eddins [J. Acoust. Soc. Am. 93, 470-479 (1993)] that the "time constants" associated with the temporal modulation transfer functions (TMTF) derived for narrow-band stimuli do not vary with carrier frequency region and that they decrease monotonically with increasing stimulus bandwidth. The model is able to predict masking patterns in the modulation-frequency domain, as observed experimentally by Houtgast [J. Acoust. Soc. Am. 85, 1676-1680 (1989)]. The model also accounts for the finding by Sheft and Yost [J. Acoust. Soc. Am. 88, 796-805 (1990)] that the long "effective" integration time constants derived from the data are two orders of magnitude larger than the time constants derived from the cutoff frequency of the TMTF. Finally, the temporal-summation properties of the model allow the prediction of data in a specific temporal paradigm used earlier by Viemeister and Wakefield [J. Acoust. Soc. Am. 90, 858-865 (1991)]. The combination of the modulation filterbank concept and the optimal decision algorithm proposed here appears to present a powerful strategy for describing modulation-detection phenomena in narrow-band and broadband conditions.


Subject(s)
Auditory Perception/physiology , Perceptual Masking , Auditory Threshold , Humans , Noise , Time Factors
16.
J Acoust Soc Am ; 102(5 Pt 1): 2954-65, 1997 Nov.
Article in English | MEDLINE | ID: mdl-9373982

ABSTRACT

Thresholds for the detection of increments and decrements in level of 70 dB SPL sinusoidal signals were measured as a function signal duration (10, 20, or 200 ms), pedestal duration before the signal (10 ms, 200 ms, or pedestal on continuously) and frequency (250, 1000, or 4000 Hz). The sinusoids were presented in a low-pass filtered background noise with an overall level of 68-69 dB SPL which had two purposes: (1) to mask spectral splatter; (2) to induce an adaptation effect, which caused the continuous 4000-Hz pedestal (but not the other two pedestals) to decay to inaudibility (adaptation). We were particularly interested in determining whether the difference in noise-induced adaptation across frequency would influence the pattern of results. Seven normal-hearing subjects were used. Thresholds improved with increasing frequency and with increasing duration for both increments and decrements. However, the effect of increment/decrement duration decreased with increasing frequency; at 4000 Hz thresholds were almost the same for increment durations of 10 and 20 ms. The energy of the increments at threshold increased markedly with increasing increment duration (especially from 20 to 200 ms), suggesting a dominant role for the onsets of the increments as opposed to ongoing differences in level. Increasing the pedestal duration before the increment from 10 to 200 ms slightly improved thresholds for increment and decrement durations of 10 and 20 ms. Increment thresholds were similar for the gated and continuous pedestals at all frequencies, even though the 4000-Hz continuous pedestal decayed to inaudibility. However, thresholds for 200-ms increments were somewhat lower for continuous than for gated pedestals, and supplementary experiments found a larger gated-continuous difference for pedestals presented in quiet. Making the pedestal continuous adversely affected performance for the 10- and 20-ms decrements, but not for the 200-ms decrement. We suggest that the results for decrement detection may be affected by neural long-term adaptation, although they are not clearly related to loudness adaptation.


Subject(s)
Auditory Perception , Auditory Threshold , Adult , Aged , Humans , Loudness Perception , Male , Middle Aged , Time Factors
17.
J Acoust Soc Am ; 101(4): 2202-13, 1997 Apr.
Article in English | MEDLINE | ID: mdl-9104022

ABSTRACT

This article presents two experiments dealing with a psychoacoustical evaluation of the pitch-synchronous overlap-and-add (PSOLA) technique. This technique has been developed for modification of duration and fundamental frequency of speech and is based on simple waveform manipulations. Both experiments were aimed at deriving the sensitivity of the auditory system to the basic distortions introduced by PSOLA. In experiment I, manipulation of fundamental frequency was applied to synthetic single-formant stimuli under minimal stimulus uncertainty, level roving, and formant-frequency roving. In experiment II, the influence of the positioning of the so-called "pitch markers" was studied. Depending on the formant and fundamental frequency, experimental data could be described reasonably well by either a spectral intensity-discrimination model or a temporal model based on detecting changes in modulation of the output of a single auditory filter. Generally, the results were in line with psychoacoustical theory on the auditory processing of resolved and unresolved harmonics.


Subject(s)
Pitch Perception , Psychoacoustics , Speech Perception , Adult , Audiometry, Pure-Tone , Humans , Models, Theoretical , Psychometrics , Terminology as Topic
18.
J Acoust Soc Am ; 101(3): 1671-80, 1997 Mar.
Article in English | MEDLINE | ID: mdl-9069634

ABSTRACT

A new experimental technique for studying binaural processing at high frequencies is introduced. Binaural masking level differences (BMLDs) for the conditions N0S pi and N pi S0 were measured for a tonal signal in narrow-band noise at 125, 250, and 4000 Hz. In addition, "transposed" stimuli were generated, which were centered at 4000 Hz, but were designed to preserve within the envelope the temporal "fine-structure" information available at the two lower frequencies. The BMLDs measured with the 125-Hz transposed stimuli were essentially the same as BMLDs from the regular 125-Hz condition. The transposed 250-Hz stimuli generally produced smaller BMLDs than the stimuli centered at 250 Hz, but the pattern of results as a function of masker bandwidth was the same. The patterns of results from the transposed stimuli are different from those of the 4000-Hz condition and, consistent with the low-frequency masker data, generally show higher BMLDs. The results indicate that the mechanisms underlying binaural processing at low and high frequencies are similar, and that frequency-dependent differences in BMLDs probably reflect the inability of the auditory system to encode the temporal fine structure of high-frequency stimuli.


Subject(s)
Auditory Perception , Perceptual Masking , Humans , Noise , Time Factors
19.
J Acoust Soc Am ; 99(6): 3615-22, 1996 Jun.
Article in English | MEDLINE | ID: mdl-8655793

ABSTRACT

This paper describes a quantitative model for signal processing in the auditory system. The model combines a series of preprocessing stages with an optimal detector as the decision device. The present paper gives a description of the various preprocessing stages and of the implementation of the optimal detector. The output of the preprocessing stages is a time-varying activity pattern to which "internal noise" is added. In the decision process, a stored temporal representation of the signal to be detected (template) is compared with the actual activity pattern. The comparison amounts to calculating the correlation between the two temporal patterns and is comparable to a "matched filtering" process. The detector itself derives the template at the beginning of each simulated threshold measurement from a suprathreshold value of the stimulus. The model allows one to estimate thresholds with the same signals and psychophysical procedures as those used in actual experiments. In the accompanying paper [Dau et al., J. Acoust. Soc. Am. 99, 3623-3631 (1996)] data obtained for human observers are compared with the optimal-detector model for various masking conditions.


Subject(s)
Auditory Perception/physiology , Models, Theoretical , Signal Detection, Psychological/physiology , Basilar Membrane/physiology , Feedback , Hair Cells, Auditory/physiology , Humans
20.
J Acoust Soc Am ; 99(6): 3623-31, 1996 Jun.
Article in English | MEDLINE | ID: mdl-8655794

ABSTRACT

This and the accompanying paper [Dau et al., J. Acoust. Soc. Am. 99, 3615-3622 (1996)] describe a quantitative model for signal processing in the auditory system. The model combines several stages of preprocessing with a decision device that has the properties of an optimal detector. The present paper compares model predictions for a variety of experimental conditions with the performance of human observers. Simulated and psychophysically determined thresholds were estimated with a three-interval forced-choice adaptive procedure. All model parameters were kept constant for all simulations discussed in this paper. For frozen-noise maskers, the effects of the following stimulus parameters were examined: signal frequency, signal phase, temporal position and duration of the signal within the masker under conditions of simultaneous masking, masker level, and masker duration under conditions of forward masking, and backward masking. The influence of signal phase and the temporal position of the signal, including positions at masker onset, was determined for a random-noise masker and compared with corresponding results obtained for a frozen noise. The model describes all the experimental data with an accuracy of a few dB with the following exceptions: forward-masked thresholds obtained with brief maskers are too high and the change in threshold with a change in signal duration is too small. Both discrepancies have their origin in the adaptation stages in the preprocessing part of the model. On the basis of the wide range of simulated conditions we conclude that the present model is a successful approach to describing the detection process in the human auditory system.


Subject(s)
Auditory Perception/physiology , Signal Detection, Psychological/physiology , Adult , Auditory Threshold , Humans , Noise , Perceptual Masking , Psychophysics , Time Factors
SELECTION OF CITATIONS
SEARCH DETAIL
...