Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 2 de 2
Filter
Add more filters










Database
Language
Publication year range
1.
J Acoust Soc Am ; 100(3): 1819-34, 1996 Sep.
Article in English | MEDLINE | ID: mdl-8817906

ABSTRACT

Vocal tract models are often used to study the problem of mapping from the acoustic transfer function to the vocal tract area function (inverse mapping). Unfortunately, results based on vocal tract models are strongly affected by the assumptions underlying the models. In this study, the mapping from acoustics (digitized speech samples) to articulation (measurements of the positions of receiver coils placed on the tongue, jaw, and lips) is examined using human data from a single speaker: Simultaneous acoustic and articulator measurements made for vowel-to-vowel transitions, /g/ closures, and transitions into and out of /g/ closures. Articulator positions were measured using an EMMA system to track coils placed on the lips, jaw, and tongue. Using these data, look-up tables were created that allow articulator positions to be estimated from acoustic signals. On a data set not used for making look-up tables, correlations between estimated and actual coil positions of around 94% and root-mean-squared errors around 2 mm are common for coils on the tongue. An error source evaluation shows that estimating articulator positions from quantized acoustics gives root-mean-squared errors that are typically less than 1 mm greater than the errors that would be obtained from quantizing the articulator positions themselves. This study agrees with and extends previous studies of human data by showing that for the data studied, speech acoustics can be used to accurately recover articulator positions.


Subject(s)
Phonetics , Speech Acoustics , Speech/physiology , Humans , Speech Production Measurement
2.
J Acoust Soc Am ; 99(6): 3707-17, 1996 Jun.
Article in English | MEDLINE | ID: mdl-8655802

ABSTRACT

Desirable characteristics of a vocal-tract parametrization include accuracy, low dimensionality, and generalizability across speakers and languages. A low-dimensional, speaker-independent linear parametrization of vowel tongue shapes can be obtained using the PARAFAC three-mode factor analysis procedure [Harshman et al., J. Acoust. Soc. Am. 62, 693-707 (1977)]. Harshman et al. applied PARAFAC to midsagittal x-ray vowel data from five English speakers, reporting that two speaker-independent factors are required to accurately represent the tongue shape measured along anatomically normalized vocal-tract diameter grid lines. Subsequently, the cross-linguistic generality of this parametrization was brought into question by the application of PARAFAC to Icelandic vowel data, where three nonorthogonal factors were reported [Jackson, J. Acoust. Soc. Am. 84, 124-143 (1988)]. This solution is shown to be degenerate; a reanalysis of Jackson's Icelandic data produces two factors that match Harshman et al.'s factors for English vowels, contradicting Jackson's distinction between English and Icelandic language-specific "articulatory primes". To obtain vowel factors not constrained by artificial measurement grid lines, x-ray tongue shape traces of six English speakers were marked with 13 equally spaced points. PARAFAC analysis of this unconstrained (x,y) coordinate data results in two factors that are clearly interpretable in terms of the traditional vowel quality dimensions front/back, high/low.


Subject(s)
Phonetics , Speech/physiology , Tongue/physiology , Cross-Cultural Comparison , Humans , Iceland , Linguistics , Speech Production Measurement , United States
SELECTION OF CITATIONS
SEARCH DETAIL
...