Pesquisa | Portal Regional da BVS (teste)

1.

The relation of velopharyngeal coupling area and vocal tract scaling to identification of stop-nasal cognates.

Story, Brad H; Bunton, Kate.

J Acoust Soc Am ; 154(6): 3741-3759, 2023 12 01.

Artigo em Inglês | MEDLINE | ID: mdl-38099832

RESUMO

The purpose of this study was to determine whether the threshold of velopharyngeal (VP) coupling area at which listeners switch from identifying a consonant as a stop to a nasal in North American English was different for speech produced by a model based on an adult male, an adult female, and a 4-year-old child. V1CV2 stimuli were generated with a speech production model that encodes phonetic segments as relative acoustic targets imposed on an underlying vocal tract and laryngeal structure that can be scaled according to sex and age. Each V1CV2 was synthesized with a set of VP coupling functions whose maximum area ranged from 0 to 0.1 cm2. Results showed that scaling the vocal tract and vocal folds had essentially no effect on the VP coupling area at which listener identification shifted from stop to nasal. The range of coupling areas at which the crossover occurred was 0.037-0.049 cm2 for the male model, 0.040-0.055 cm2 for the female model, and 0.039-0.052 cm2 for the 4-year-old child model, and overall mean was 0.044 cm2. Calculations of band limited peak nasalance indicated that 85% peak nasalance during the consonant was well aligned with listener responses.

Assuntos

Laringe , Fala , Adulto , Feminino , Masculino , Humanos , Pré-Escolar , Acústica , Idioma , Nariz

2.

Acoustical Theory of Vowel Modification Strategies in Belting.

Herbst, Christian T; Story, Brad H; Meyer, David.

J Voice ; 2023 Apr 18.

Artigo em Inglês | MEDLINE | ID: mdl-37080890

RESUMO

Various authors have argued that belting is to be produced by "speech-like" sounds, with the first and second supraglottic vocal tract resonances (fR1 and fR2) at frequencies of the vowels determined by the lyrics to be sung. Acoustically, the hallmark of belting has been identified as a dominant second harmonic, possibly enhanced by first resonance tuning (fR1≈2fo). It is not clear how both these concepts - (a) phonating with "speech-like," unmodified vowels; and (b) producing a belting sound with a dominant second harmonic, typically enhanced by fR1 - can be upheld when singing across a singer's entire musical pitch range. For instance, anecdotal reports from pedagogues suggest that vowels with a low fR1, such as [i] or [u], might have to be modified considerably (by raising fR1) in order to phonate at higher pitches. These issues were systematically addressed in silico with respect to treble singing, using a linear source-filter voice production model. The dominant harmonic of the radiated spectrum was assessed in 12987 simulations, covering a parameter space of 37 fundamental frequencies (fo) across the musical pitch range from C3 to C6; 27 voice source spectral slope settings from -4 to -30 dB/octave; computed for 13 different IPA vowels. The results suggest that, for most unmodified vowels, the stereotypical belting sound characteristics with a dominant second harmonic can only be produced over a pitch range of about a musical fifth, centered at fo≈0.5fR1. In the [É] and [É] vowels, that range is extended to an octave, supported by a low second resonance. Data aggregation - considering the relative prevalence of vowels in American English - suggests that, historically, belting with fR1≈2fo was derived from speech, and that songs with an extended musical pitch range likely demand considerable vowel modification. We thus argue that - on acoustical grounds - the pedagogical commandment for belting with unmodified, "speech-like" vowels can not always be fulfilled.

3.

Computer simulation of vocal tract resonance tuning strategies with respect to fundamental frequency and voice source spectral slope in singing.

Herbst, Christian T; Story, Brad H.

J Acoust Soc Am ; 152(6): 3548, 2022 12.

Artigo em Inglês | MEDLINE | ID: mdl-36586864

RESUMO

A well-known concept of singing voice pedagogy is "formant tuning," where the lowest two vocal tract resonances ( fR1, fR2) are systematically tuned to harmonics of the laryngeal voice source to maximize the level of radiated sound. A comprehensive evaluation of this resonance tuning concept is still needed. Here, the effect of fR1, fR2 variation was systematically evaluated in silico across the entire fundamental frequency range of classical singing for three voice source characteristics with spectral slopes of -6, -12, and -18 dB/octave. Respective vocal tract transfer functions were generated with a previously introduced low-dimensional computational model, and resultant radiated sound levels were expressed in dB(A). Two distinct strategies for optimized sound output emerged for low vs high voices. At low pitches, spectral slope was the predominant factor for sound level increase, and resonance tuning only had a marginal effect. In contrast, resonance tuning strategies became more prevalent and voice source strength played an increasingly marginal role as fundamental frequency increased to the upper limits of the soprano range. This suggests that different voice classes (e.g., low male vs high female) likely have fundamentally different strategies for optimizing sound output, which has fundamental implications for pedagogical practice.

Assuntos

Canto , Voz , Masculino , Feminino , Humanos , Simulação por Computador , Som , Vibração

4.

Anatomic development of the upper airway during the first five years of life: A three-dimensional imaging study.

Chuang, Ying Ji; Hwang, Seong Jae; Buhr, Kevin A; Miller, Courtney A; Avey, Gregory D; Story, Brad H; Vorperian, Houri K.

PLoS One ; 17(3): e0264981, 2022.

Artigo em Inglês | MEDLINE | ID: mdl-35275939

RESUMO

PURPOSE: Normative data on the growth and development of the upper airway across the sexes is needed for the diagnosis and treatment of congenital and acquired respiratory anomalies and to gain insight on developmental changes in speech acoustics and disorders with craniofacial anomalies. METHODS: The growth of the upper airway in children ages birth to 5 years, as compared to adults, was quantified using an imaging database with computed tomography studies from typically developing individuals. Methodological criteria for scan inclusion and airway measurements included: head position, histogram-based airway segmentation, anatomic landmark placement, and development of a semi-automatic centerline for data extraction. A comprehensive set of 2D and 3D supra- and sub-glottal measurements from the choanae to tracheal opening were obtained including: naso-oro-laryngo-pharynx subregion volume and length, each subregion's superior and inferior cross-sectional-area, and antero-posterior and transverse/width distances. RESULTS: Growth of the upper airway during the first 5 years of life was more pronounced in the vertical and transverse/lateral dimensions than in the antero-posterior dimension. By age 5 years, females have larger pharyngeal measurement than males. Prepubertal sex-differences were identified in the subglottal region. CONCLUSIONS: Our findings demonstrate the importance of studying the growth of the upper airway in 3D. As the lumen length increases, its shape changes, becoming increasingly elliptical during the first 5 years of life. This study also emphasizes the importance of methodological considerations for both image acquisition and data extraction, as well as the use of consistent anatomic structures in defining pharyngeal regions.

Assuntos

Imageamento Tridimensional , Laringe , Adulto , Pontos de Referência Anatômicos , Criança , Pré-Escolar , Estudos Transversais , Feminino , Humanos , Imageamento Tridimensional/métodos , Masculino , Faringe/diagnóstico por imagem

5.

Apraxia of speech and the study of speech production impairments: Can we avoid further confusion? Reply to Romani (2021).

Mailend, Marja-Liisa; Maas, Edwin; Story, Brad H.

Cogn Neuropsychol ; 38(4): 309-317, 2021.

Artigo em Inglês | MEDLINE | ID: mdl-34881683

RESUMO

We agree with Cristina Romani (CR) about reducing confusion and agree that the issues raised in her commentary are central to the study of apraxia of speech (AOS). However, CR critiques our approach from the perspective of basic cognitive neuropsychology. This is confusing and misleading because, contrary to CR's claim, we did not attempt to inform models of typical speech production. Instead, we relied on such models to study the impairment in the clinical category of AOS (translational cognitive neuropsychology). Thus, the approach along with the underlying assumptions is different. This response aims to clarify these assumptions, broaden the discussion regarding the methodological approach, and address CR's concerns. We argue that our approach is well-suited to meet the goals of our recent studies and is commensurate with the current state of the science of AOS. Ultimately, a plurality of approaches is needed to understand a phenomenon as complex as AOS.

Assuntos

Afasia , Apraxias , Afasia/complicações , Apraxias/etiologia , Confusão/complicações , Feminino , Humanos , Fala , Distúrbios da Fala , Medida da Produção da Fala

6.

The relation of velopharyngeal coupling area to the identification of stop versus nasal consonants in North American English based on speech generated by acoustically driven vocal tract modulations.

Story, Brad H; Bunton, Kate.

J Acoust Soc Am ; 150(5): 3618, 2021 11.

Artigo em Inglês | MEDLINE | ID: mdl-34852618

RESUMO

The purpose of this study was to determine the threshold of velopharyngeal coupling area at which listeners switch from identifying a consonant as a stop to a nasal in North American English, based on V1CV2 stimuli generated with a speech production model that encodes phonetic segments as relative acoustic targets. Each V1CV2 was synthesized with a set of velopharyngeal coupling functions whose area ranged from 0 to 0.1 cm2. Results show that consonants were identified by listeners as a stop when the coupling area was less than 0.035-0.057 cm2, depending on place of articulation and final vowel. The smallest coupling area (0.035 cm2) at which the stop-to-nasal switch occurred was found for an alveolar consonant in the /ÉCi/ context, whereas the largest (0.057 cm2) was for a bilabial in /ÉCÉ/. For each stimulus, the balance of oral versus nasal acoustic energy was characterized by the peak nasalance during the consonant. Stimuli with peak nasalance below 40% were mostly identified by listeners as stops, whereas those above 40% were identified as nasals. This study was intended to be a precursor to further investigations using the same model but scaled to represent the developing speech production system of male and female talkers.

Assuntos

Percepção da Fala , Fala , Feminino , Humanos , Masculino , América do Norte , Fonética , Medida da Produção da Fala

7.

The Effects of Remote Signal Transmission and Recording on Acoustical Measures of Simulated Essential Vocal Tremor: Considerations for Remote Treatment Research and Telepractice.

Lester-Smith, Rosemary A; Jebaily, Charles G; Story, Brad H.

J Voice ; 2021 Oct 23.

Artigo em Inglês | MEDLINE | ID: mdl-34702610

RESUMO

PURPOSE: Studies on medical and behavioral interventions for essential vocal tremor (EVT) have shown inconsistent effects on acoustical and perceptual outcome measures across studies and across participants. Remote acoustical and perceptual assessments might facilitate studies with larger samples of participants and repeated measures that could clarify treatment effects and identify optimal treatment candidates. Furthermore, remote acoustical and perceptual assessment might allow clinicians to monitor clients' treatment responses and optimize treatment approaches during telepractice. Thus, the purpose of this study was to evaluate the accuracy of remote signal transmission and recording for acoustical and perceptual assessment of EVT. METHOD: Simulations of EVT were produced using a computational model and were recorded using local and remote procedures to represent client- and clinician-end recordings respectively. Acoustical analyses measured the extent and rate of fundamental frequency (fo) and intensity modulation to represent vocal tremor severity and the cepstral peak prominence (CPPS) to represent voice quality. The data were analyzed using repeated measures analysis of variance (ANOVA) with recording as the within-subjects factor and sex of the computational model as the between-subjects factor. RESULTS: There was a significant main effect of recording on the rate of fo modulation and significant interactions of recording and sex for the extent of intensity modulation, rate of intensity modulation, and CPPS. Posthoc pairwise comparisons and analysis of effect size indicated that recording procedures had the largest effect on the extent of intensity modulation for male simulations, the rate of intensity modulation for male and female simulations, and the CPPS for male and female simulations. Despite having disabled all known software and computer audio enhancing options and having stable ethernet connections, there was inconsistent attenuation of signal amplitude in remote recordings that was most problematic for samples with a breathy voice quality but also affected samples with typical and pressed voice qualities. CONCLUSIONS: Acoustical measures that correlate to perception of vocal tremor and voice quality were altered by remote signal transmission and recording. In particular, signal transmission and recording in Zoom altered time-based estimates of intensity modulation and CPPS with male and female simulations of EVT and magnitude-based estimates of intensity modulation with male simulations of EVT. In contrast, signal transmission and recording in Zoom minimally altered time- and magnitude-based estimates of fo modulation with male and female simulations of EVT. Therefore, acoustical and perceptual assessments of EVT should be performed using audio recordings that are collected locally on the participant- or client-end, particularly when measuring modulation of intensity and CPP or estimating vocal tremor severity and voice quality. Development of procedures for collecting local audio recordings in remote settings may expand data collection for treatment research and enhance telepractice.

8.

Identification of voiced stop consonants produced by acoustically driven vocal tract modulations.

Story, Brad H; Bunton, Kate.

JASA Express Lett ; 1(8): 085203, 2021 08.

Artigo em Inglês | MEDLINE | ID: mdl-36154248

RESUMO

A recently developed speech production model, in which speech segments are specified by relative acoustic events called resonance deflection patterns, was used to generate speech signals that were presented to listeners in a perceptual test. The purpose was to determine the effect of variations of the magnitude and polarity of the third resonance deflection on identification of the consonant in a V1CV2 disyllable while the deflections of the first and second resonances were held constant. Result showed that listeners' identification changed from /d/ to /É¡/ when the polarity of the third resonance deflection switched from positive to negative.

Assuntos

Fonética , Voz , Acústica , Acústica da Fala

9.

Examining speech motor planning difficulties in apraxia of speech and aphasia via the sequential production of phonetically similar words.

Mailend, Marja-Liisa; Maas, Edwin; Beeson, Pélagie M; Story, Brad H; Forster, Kenneth I.

Cogn Neuropsychol ; 38(1): 72-87, 2021 02.

Artigo em Inglês | MEDLINE | ID: mdl-33249997

RESUMO

This study investigated the underlying nature of apraxia of speech (AOS) by testing two competing hypotheses. The Reduced Buffer Capacity Hypothesis argues that people with AOS can plan speech only one syllable at a time Rogers and Storkel [1999. Planning speech one syllable at a time: The reduced buffer capacity hypothesis in apraxia of speech. Aphasiology, 13(9-11), 793-805. https://doi.org/10.1080/026870399401885]. The Program Retrieval Deficit Hypothesis states that selecting a motor programme is difficult in face of competition from other simultaneously activated programmes Mailend and Maas [2013. Speech motor programming in apraxia of speech: Evidence from a delayed picture-word interference task. American Journal of Speech-Language Pathology, 22(2), S380-S396. https://doi.org/10.1044/1058-0360(2013/12-0101)]. Speakers with AOS and aphasia, aphasia without AOS, and unimpaired controls were asked to prepare and hold a two-word utterance until a go-signal prompted a spoken response. Phonetic similarity between target words was manipulated. Speakers with AOS had longer reaction times in conditions with two similar words compared to two identical words. The Control and the Aphasia group did not show this effect. These results suggest that speakers with AOS need additional processing time to retrieve target words when multiple motor programmes are simultaneously activated.

Assuntos

Afasia/fisiopatologia , Apraxias/fisiopatologia , Fonética , Distúrbios da Fala/fisiopatologia , Fala , Adulto , Idoso , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Tempo de Reação , Medida da Produção da Fala/métodos

10.

Effects of sampling rate and type of anti-aliasing filter on linear-predictive estimates of formant frequencies in men, women, and children.

Milenkovic, Paul H; Wagner, Madison; Kent, Raymond D; Story, Brad H; Vorperian, Houri K.

J Acoust Soc Am ; 147(3): EL221, 2020 03.

Artigo em Inglês | MEDLINE | ID: mdl-32237805

RESUMO

The purpose of this study was to assess the effect of downsampling the acoustic signal on the accuracy of linear-predictive (LPC) formant estimation. Based on speech produced by men, women, and children, the first four formant frequencies were estimated at sampling rates of 48, 16, and 10 kHz using different anti-alias filtering. With proper selection of number of LPC coefficients, anti-alias filter and between-frame averaging, results suggest that accuracy is not improved by rates substantially below 48 kHz. Any downsampling should not go below 16 kHz with a filter cut-off centered at 8 kHz.

Assuntos

Acústica , Fala , Criança , Feminino , Humanos , Masculino , Acústica da Fala

11.

A model of speech production based on the acoustic relativity of the vocal tract.

Story, Brad H; Bunton, Kate.

J Acoust Soc Am ; 146(4): 2522, 2019 10.

Artigo em Inglês | MEDLINE | ID: mdl-31671993

RESUMO

A model is described in which the effects of articulatory movements to produce speech are generated by specifying relative acoustic events along a time axis. These events consist of directional changes of the vocal tract resonance frequencies that, when associated with a temporal event function, are transformed via acoustic sensitivity functions, into time-varying modulations of the vocal tract shape. Because the time course of the events may be considerably overlapped in time, coarticulatory effects are automatically generated. Production of sentence-level speech with the model is demonstrated with audio samples and vocal tract animations.

Assuntos

Modelos Biológicos , Medida da Produção da Fala , Fala/fisiologia , Acústica , Humanos , Arcada Osseodentária/fisiologia , Laringe/fisiologia , Lábio/fisiologia , Masculino , Língua/fisiologia

12.

Speech motor planning in the context of phonetically similar words: Evidence from apraxia of speech and aphasia.

Mailend, Marja-Liisa; Maas, Edwin; Beeson, Pélagie M; Story, Brad H; Forster, Kenneth I.

Neuropsychologia ; 127: 171-184, 2019 04.

Artigo em Inglês | MEDLINE | ID: mdl-30817912

RESUMO

The purpose of this study was to test two competing hypotheses about the nature of the impairment in apraxia of speech (AOS). The Reduced Buffer Capacity Hypothesis argues that people with AOS can hold only one syllable at a time in the speech motor planning buffer. The Program Retrieval Deficit Hypothesis, states that people with AOS have difficulty accessing the intended motor program in the context where several motor programs are activated simultaneously. The participants included eight speakers with AOS, most of whom also had aphasia, nine speakers with aphasia without AOS, and 25 age-matched control speakers. The experimental paradigm prompted single word production following three types of primes. In most trials, prime and target were the same (e.g., bill-bill). On some trials, the initial consonant differed in one phonetic feature (e.g., bill-dill; Similar) or in all phonetic features (fill-bill; Different). The dependent measures were accuracy and reaction time. The results revealed a switch cost - longer reaction times in trials where the prime and target differed compared to trials where they were the same words - in all groups; however, the switch cost was significantly larger in the AOS group compared to the other two groups. These findings are in line with the prediction of the Program Retrieval Deficit Hypothesis and suggest that speakers with AOS have difficulty with selecting one program over another when several programs compete for selection.

Assuntos

Antecipação Psicológica , Afasia/psicologia , Fonética , Distúrbios da Fala/psicologia , Fala , Adulto , Idoso , Apraxias , Feminino , Humanos , Individualidade , Masculino , Pessoa de Meia-Idade , Desempenho Psicomotor , Tempo de Reação

13.

An age-dependent vocal tract model for males and females based on anatomic measurements.

Story, Brad H; Vorperian, Houri K; Bunton, Kate; Durtschi, Reid B.

J Acoust Soc Am ; 143(5): 3079, 2018 05.

Artigo em Inglês | MEDLINE | ID: mdl-29857736

RESUMO

The purpose of this study was to take a first step toward constructing a developmental and sex-specific version of a parametric vocal tract area function model representative of male and female vocal tracts ranging in age from infancy to 12 yrs, as well as adults. Anatomic measurements collected from a large imaging database of male and female children and adults provided the dataset from which length warping and cross-dimension scaling functions were derived, and applied to the adult-based vocal tract model to project it backward along an age continuum. The resulting model was assessed qualitatively by projecting hypothetical vocal tract shapes onto midsagittal images from the cohort of children, and quantitatively by comparison of formant frequencies produced by the model to those reported in the literature. An additional validation of modeled vocal tract shapes was made possible by comparison to cross-sectional area measurements obtained for children and adults using acoustic pharyngometry. This initial attempt to generate a sex-specific developmental vocal tract model paves a path to study the relation of vocal tract dimensions to documented prepubertal acoustic differences.

Assuntos

Desenvolvimento Infantil/fisiologia , Caracteres Sexuais , Fala/fisiologia , Prega Vocal/anatomia & histologia , Prega Vocal/fisiologia , Adulto , Fatores Etários , Criança , Pré-Escolar , Feminino , Humanos , Lactente , Recém-Nascido , Masculino , Fatores Sexuais , Prega Vocal/diagnóstico por imagem

14.

Vowel space density as an indicator of speech performance.

Story, Brad H; Bunton, Kate.

J Acoust Soc Am ; 141(5): EL458, 2017 05.

Artigo em Inglês | MEDLINE | ID: mdl-28599542

RESUMO

The purpose of this study was to develop a method for visualizing and assessing the characteristics of vowel production by measuring the local density of normalized F1 and F2 formant frequencies. The result is a three-dimensional plot called the vowel space density (VSD) and indicates the regions in the vowel space most heavily used by a talker during speech production. The area of a convex hull enclosing the vowel space at specific threshold density values was proposed as a means of quantifying the VSD.

Assuntos

Acústica , Fonética , Acústica da Fala , Medida da Produção da Fala/métodos , Qualidade da Voz , Humanos , Processamento de Sinais Assistido por Computador , Espectrografia do Som

15.

Influence of Left-Right Asymmetries on Voice Quality in Simulated Paramedian Vocal Fold Paralysis.

Samlan, Robin A; Story, Brad H.

J Speech Lang Hear Res ; 60(2): 306-321, 2017 02 01.

Artigo em Inglês | MEDLINE | ID: mdl-28199505

RESUMO

Purpose: The purpose of this study was to determine the vocal fold structural and vibratory symmetries that are important to vocal function and voice quality in a simulated paramedian vocal fold paralysis. Method: A computational kinematic speech production model was used to simulate an exemplar "voice" on the basis of asymmetric settings of parameters controlling glottal configuration. These parameters were then altered individually to determine their effect on maximum flow declination rate, spectral slope, cepstral peak prominence, harmonics-to-noise ratio, and perceived voice quality. Results: Asymmetry of each of the 5 vocal fold parameters influenced vocal function and voice quality; measured change was greatest for adduction and bulging. Increasing the symmetry of all parameters improved voice, and the best voice occurred with overcorrection of adduction, followed by bulging, nodal point ratio, starting phase, and amplitude of vibration. Conclusions: Although vocal process adduction and edge bulging asymmetries are most influential in voice quality for simulated vocal fold motion impairment, amplitude of vibration and starting phase asymmetries are also perceptually important. These findings are consistent with the current surgical approach to vocal fold motion impairment, where goals include medializing the vocal process and straightening concave edges. The results also explain many of the residual postoperative voice limitations.

Assuntos

Simulação por Computador , Modelos Biológicos , Paralisia das Pregas Vocais/fisiopatologia , Qualidade da Voz , Fenômenos Biomecânicos , Humanos , Vibração , Prega Vocal/fisiopatologia , Qualidade da Voz/fisiologia

16.

An acoustically-driven vocal tract model for stop consonant production.

Story, Brad H; Bunton, Kate.

Speech Commun ; 87: 1-17, 2017 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-28093574

RESUMO

The purpose of this study was to further develop a multi-tier model of the vocal tract area function in which the modulations of shape to produce speech are generated by the product of a vowel substrate and a consonant superposition function. The new approach consists of specifying input parameters for a target consonant as a set of directional changes in the resonance frequencies of the vowel substrate. Using calculations of acoustic sensitivity functions, these "resonance deflection patterns" are transformed into time-varying deformations of the vocal tract shape without any direct specification of location or extent of the consonant constriction along the vocal tract. The configuration of the constrictions and expansions that are generated by this process were shown to be physiologically-realistic and produce speech sounds that are easily identifiable as the target consonants. This model is a useful enhancement for area function-based synthesis and can serve as a tool for understanding how the vocal tract is shaped by a talker during speech production.

17.

The effects of physiological adjustments on the perceptual and acoustical characteristics of vibrato as a model of vocal tremor.

Lester-Smith, Rosemary A; Story, Brad H.

J Acoust Soc Am ; 140(5): 3827, 2016 Nov.

Artigo em Inglês | MEDLINE | ID: mdl-27908094

RESUMO

The purpose of this study was to investigate the effects of physiological adjustments on listeners' perception of the magnitude of modulation of voice and to determine the characteristics of the acoustical modulations that explained listeners' judgments. This research was carried out using singers producing vibrato as a model of vocal tremor. Twenty healthy adults participated in a perceptual study involving pair-comparisons of the magnitude of "shakiness" with singers' samples, which differed by fundamental frequency, vocal quality, and vowel. Results revealed that listeners perceived a higher magnitude of voice modulation when female samples had a pressed vocal quality. Acoustical analyses were performed with voice samples to determine the features that predicted listeners' judgments. Based on regression analyses, listeners' judgments were predicted to some extent by modulation information in frequency bands across the spectrum.

Assuntos

Tremor , Adulto , Feminino , Humanos , Julgamento , Masculino , Canto , Voz , Qualidade da Voz , Adulto Jovem

18.

A Modeling Study of the Effects of Vocal Tract Movement Duration and Magnitude on the F2 Trajectory in CV Words.

Neely, Kimberly D; Bunton, Kate; Story, Brad H.

J Speech Lang Hear Res ; 59(6): 1327-1334, 2016 12 01.

Artigo em Inglês | MEDLINE | ID: mdl-27768174

RESUMO

Purpose: This study used a computational vocal tract model to investigate the relationship of diphthong duration and vocal tract movement magnitude to measures of the F2 trajectory in CV words. Method: Three words (bough, boy, and buy) were simulated on the basis of an adult female vocal tract model, in which the model parameters were estimated from audio recordings of a female talker. Model parameters were then modified to generate 35 simulations of each word corresponding to 7 different durations and 5 movement magnitude settings. In addition, these simulations were repeated with vocal tract lengths representative of an adult male and an approximately 6-year-old child. Results: On the basis of univariate analysis, measures of frequency predicted changes in magnitude, and temporal measures predicted changes in speaking rate consistent with the hypothesis. The combined effects of duration and magnitude showed that F2 was more sensitive to changes in magnitude at shorter word durations compared with longer word durations. This finding held across words and vocal tract length. Conclusions: Results suggest that there is an interaction between duration and magnitude that affects the slope of the F2 trajectory. The next step is to relate kinematics to F2 trajectory output using real speakers.

Assuntos

Simulação por Computador , Modelos Biológicos , Movimento , Fala/fisiologia , Prega Vocal/fisiologia , Adulto , Fenômenos Biomecânicos , Criança , Feminino , Humanos , Masculino , Movimento/fisiologia , Fonética

19.

Arizona Child Acoustic Database Repository.

Bunton, Kate; Story, Brad H.

Folia Phoniatr Logop ; 68(3): 107-111, 2016.

Artigo em Inglês | MEDLINE | ID: mdl-27784009

RESUMO

OBJECTIVE: The goal of the Arizona Child Acoustic Database project was to obtain a large set of acoustic recordings, primarily vowels, collected from a cohort of children over a critical period of growth and development. METHOD: Data was recorded longitudinally from 63 children between the ages of 2;0 and 7;0 at 3-month intervals. The protocol included individual American English vowels and diphthongs, nonsense multi-vowel transitions, word level multi-vowel sequences (e.g., Hawaii), single-syllable words targeting each American English vowel, short sentences, and conversation. RESULTS: Acoustic files are available for download through the University of Arizona Library Repository for use in future research projects. CONCLUSION: Longitudinal recordings may be of interest because they allow tracking of acoustic characteristics produced by an individual child during a period of rapid growth and speech development.

Assuntos

Bases de Dados Factuais , Acústica da Fala , Acústica , Arizona , Criança , Pré-Escolar , Comunicação , Feminino , Humanos , Idioma , Masculino , Fonética , Percepção da Fala

20.

The effects of physiological adjustments on the perceptual and acoustical characteristics of simulated laryngeal vocal tremor.

Lester, Rosemary A; Story, Brad H.

J Acoust Soc Am ; 138(2): 953-63, 2015 Aug.

Artigo em Inglês | MEDLINE | ID: mdl-26328711

RESUMO

The purpose of this study was to determine if adjustments to the voice source [i.e., fundamental frequency (F0), degree of vocal fold adduction] or vocal tract filter (i.e., vocal tract shape for vowels) reduce the perception of simulated laryngeal vocal tremor and to determine if listener perception could be explained by characteristics of the acoustical modulations. This research was carried out using a computational model of speech production that allowed for precise control and manipulation of the glottal and vocal tract configurations. Forty-two healthy adults participated in a perceptual study involving pair-comparisons of the magnitude of "shakiness" with simulated samples of laryngeal vocal tremor. Results revealed that listeners perceived a higher magnitude of voice modulation when simulated samples had a higher mean F0, greater degree of vocal fold adduction, and vocal tract shape for /i/ vs /É/. However, the effect of F0 was significant only when glottal noise was not present in the acoustic signal. Acoustical analyses were performed with the simulated samples to determine the features that affected listeners' judgments. Based on regression analyses, listeners' judgments were predicted to some extent by modulation information present in both low and high frequency bands.

Assuntos

Distúrbios da Fala/fisiopatologia , Percepção da Fala/fisiologia , Tremor/fisiopatologia , Qualidade da Voz/fisiologia , Estimulação Acústica , Adolescente , Adulto , Fenômenos Biomecânicos , Simulação por Computador , Feminino , Glote/fisiopatologia , Humanos , Julgamento , Músculos Laríngeos/fisiopatologia , Masculino , Pessoa de Meia-Idade , Variações Dependentes do Observador , Fonética , Psicoacústica , Acústica da Fala , Prega Vocal/fisiopatologia , Adulto Jovem

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA