Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 5 de 5
Filter
Add more filters










Database
Language
Publication year range
1.
J Acoust Soc Am ; 141(4): EL420, 2017 04.
Article in English | MEDLINE | ID: mdl-28464674

ABSTRACT

This letter investigates the use of subglottal resonances (SGRs) for noise-robust speaker identification (SID). It is motivated by the speaker specificity and stationarity of subglottal acoustics, and the development of noise-robust SGR estimation algorithms which are reliable at low signal-to-noise ratios for large datasets. A two-stage framework is proposed which combines the SGRs with different cepstral features. The cepstral features are used in the first stage to reduce the number of target speakers for a test utterance, and then SGRs are used as complementary second-stage features to conduct identification. Experiments with the TIMIT and NIST 2008 databases show that SGRs, when used in conjunction with power-normalized cepstral coefficients and linear prediction cepstral coefficients, can improve the performance significantly (2%-6% absolute accuracy improvement) across all noise conditions in mismatched situations.


Subject(s)
Acoustics , Glottis/physiology , Phonation , Speech Acoustics , Speech Production Measurement/methods , Voice Quality , Accelerometry , Algorithms , Automation , Female , Humans , Male , Signal Processing, Computer-Assisted , Sound Spectrography , Time Factors
2.
J Acoust Soc Am ; 137(6): 3436-46, 2015 Jun.
Article in English | MEDLINE | ID: mdl-26093432

ABSTRACT

This paper offers a re-evaluation of the mechanical properties of the tracheo-bronchial soft tissues and cartilage and uses a model to examine their effects on the subglottal acoustic input impedance. It is shown that the values for soft tissue elastance and cartilage viscosity typically used in models of subglottal acoustics during phonation are not accurate, and corrected values are proposed. The calculated subglottal acoustic input impedance using these corrected values reveals clusters of weak resonances due to soft tissues (SgT) and cartilage (SgC) lining the walls of the trachea and large bronchi, which can be observed empirically in subglottal acoustic spectra. The model predicts that individuals may exhibit SgT and SgC resonances to variable degrees, depending on a number of factors including tissue mechanical properties and the dimensions of the trachea and large bronchi. Potential implications for voice production and large pulmonary airway tissue diseases are also discussed.


Subject(s)
Acoustics , Bronchi/physiology , Cartilage/physiology , Glottis/physiology , Phonation , Speech , Trachea/physiology , Voice , Biomechanical Phenomena , Bronchi/anatomy & histology , Cartilage/anatomy & histology , Elastic Modulus , Glottis/anatomy & histology , Humans , Models, Anatomic , Models, Biological , Pressure , Respiration , Sound Spectrography , Trachea/anatomy & histology , Vibration , Viscosity
3.
J Acoust Soc Am ; 132(4): 2592-602, 2012 Oct.
Article in English | MEDLINE | ID: mdl-23039452

ABSTRACT

This paper presents a large-scale study of subglottal resonances (SGRs) (the resonant frequencies of the tracheo-bronchial tree) and their relations to various acoustical and physiological characteristics of speakers. The paper presents data from a corpus of simultaneous microphone and accelerometer recordings of consonant-vowel-consonant (CVC) words embedded in a carrier phrase spoken by 25 male and 25 female native speakers of American English ranging in age from 18 to 24 yr. The corpus contains 17,500 utterances of 14 American English monophthongs, diphthongs, and the rhotic approximant [[inverted r]] in various CVC contexts. Only monophthongs are analyzed in this paper. Speaker height and age were also recorded. Findings include (1) normative data on the frequency distribution of SGRs for young adults, (2) the dependence of SGRs on height, (3) the lack of a correlation between SGRs and formants or the fundamental frequency, (4) a poor correlation of the first SGR with the second and third SGRs but a strong correlation between the second and third SGRs, and (5) a significant effect of vowel category on SGR frequencies, although this effect is smaller than the measurement standard deviations and therefore negligible for practical purposes.


Subject(s)
Glottis/physiology , Language , Phonation , Speech Acoustics , Voice Quality , Accelerometry , Adolescent , Age Factors , Biomechanical Phenomena , Body Height , Female , Humans , Male , Sex Factors , Sound Spectrography , Speech Production Measurement , Vibration , Young Adult
4.
J Acoust Soc Am ; 130(4): 2108-15, 2011 Oct.
Article in English | MEDLINE | ID: mdl-21973365

ABSTRACT

Previous studies of subglottal resonances have reported findings based on relatively few subjects, and the relations between these resonances, subglottal anatomy, and models of subglottal acoustics are not well understood. In this study, accelerometer signals of subglottal acoustics recorded during sustained [a:] vowels of 50 adult native speakers (25 males, 25 females) of American English were analyzed. The study confirms that a simple uniform tube model of subglottal airways, closed at the glottis and open at the inferior end, is appropriate for describing subglottal resonances. The main findings of the study are (1) whereas the walls may be considered rigid in the frequency range of Sg2 and Sg3, they are yielding and resonant in the frequency range of Sg1, with a resulting ~4/3 increase in wave propagation velocity and, consequently, in the frequency of Sg1; (2) the "acoustic length" of the equivalent uniform tube varies between 18 and 23.5 cm, and is approximately equal to the height of the speaker divided by an empirically determined scaling factor; (3) trachea length can also be predicted by dividing height by another empirically determined scaling factor; and (4) differences between the subglottal resonances of males and females can be accounted for by height-related differences.


Subject(s)
Phonation , Speech Acoustics , Trachea/physiology , Adolescent , Adult , Biomechanical Phenomena , Body Height , Female , Humans , Male , Models, Biological , Pressure , Sex Factors , Sound Spectrography , Time Factors , Trachea/anatomy & histology , Vibration , Young Adult
5.
J Acoust Soc Am ; 129(5): EL197-203, 2011 May.
Article in English | MEDLINE | ID: mdl-21568375

ABSTRACT

This letter focuses on the automatic estimation of the first subglottal resonance (Sg1). A database comprising speech and subglottal data of native American English speakers and bilingual Spanish/English speakers was used for the analysis. Data from 11 speakers (five males and six females) were used to derive an empirical relation among the first formant frequency, fundamental frequency, and Sg1. Using the derived relation, Sg1 was automatically estimated from voiced sounds in English and Spanish sentences spoken by 22 different speakers (11 males and 11 females). The error in estimating Sg1 was less than 50 Hz, on average.


Subject(s)
Algorithms , Phonetics , Sound Spectrography/methods , Acceleration , Acoustics , Adult , Copying Processes , Databases, Factual , Female , Humans , Language , Larynx , Male , Sex Characteristics , Speech Recognition Software
SELECTION OF CITATIONS
SEARCH DETAIL
...