Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 7.366
Filter
1.
J Acoust Soc Am ; 155(5): 3071-3089, 2024 May 01.
Article in English | MEDLINE | ID: mdl-38717213

ABSTRACT

This study investigated how 40 Chinese learners of English as a foreign language (EFL learners) differed from 40 native English speakers in the production of four English tense-lax contrasts, /i-ɪ/, /u-ʊ/, /ɑ-ʌ/, and /æ-ε/, by examining the acoustic measurements of duration, the first three formant frequencies, and the slope of the first formant movement (F1 slope). The dynamic formant trajectory was modeled using discrete cosine transform coefficients to demonstrate the time-varying properties of formant trajectories. A discriminant analysis was employed to illustrate the extent to which Chinese EFL learners relied on different acoustic parameters. This study found that: (1) Chinese EFL learners overemphasized durational differences and weakened spectral differences for the /i-ɪ/, /u-ʊ/, and /ɑ-ʌ/ pairs, although they maintained sufficient spectral differences for /æ-ε/. In contrast, native English speakers predominantly used spectral differences across all four pairs; (2) in non-low tense-lax contrasts, unlike native English speakers, Chinese EFL learners failed to exhibit different F1 slope values, indicating a non-nativelike tongue-root placement during the articulatory process. The findings underscore the contribution of dynamic spectral patterns to the differentiation between English tense and lax vowels, and reveal the influence of precise articulatory gestures on the realization of the tense-lax contrast.


Subject(s)
Multilingualism , Phonetics , Speech Acoustics , Humans , Male , Female , Young Adult , Speech Production Measurement , Adult , Language , Acoustics , Learning , Voice Quality , Sound Spectrography , East Asian People
2.
Codas ; 36(4): e20230047, 2024.
Article in Portuguese, English | MEDLINE | ID: mdl-38808777

ABSTRACT

PURPOSE: To compare the acoustic measurements of Cepstral Peak Prominence Smoothed (CPPS) and Acoustic Voice Quality Index (AVQI) of children with normal and altered voices, to relationship with auditory-perceptual judgment (APJ) and to establish cut-off points. METHODS: Vocal recordings of the sustained vowel and number counting tasks of 185 children were selected from a database and submitted to acoustic analysis with extraction of CPPS and AVQI measurements, and to APJ. The APJ was performed individually for each task, classified as normal or altered, and for the tasks together defining whether the child would pass or fail in a situation of vocal screening. RESULTS: Children with altered APJ and who failed the screening had lower CPPS values and higher AVQI values, than those with normal APJ and who passed the screening. The APJ of the sustained vowel task was related to CPPS and AVQI, and APJ of the number counting task was related only to AVQI and CPPS numbers. The cut-off points that differentiate children with and without vocal deviation are 14.07 for the vowel CPPS, 7.62 for the CPPS numbers and 2.01 for the AVQI. CONCLUSION: Children with altered voices, have higher AVQI values and lower CPPS values, when detected in children with voices within the normal range. The acoustic measurements were related to the auditory perceptual judgment of vocal quality in the sustained vowel task, however, the number counting task was related only to the AVQI and CPPS. The cut-off points that differentiate children with and without vocal deviation are 14.07 for the CPPS vowel, 7.62 for the CPPS numbers and 2.01 for the AVQI. The three measures were similar in identifying voices without deviation and dysphonic voices.


OBJETIVO: Comparar as medidas acústicas de Cepstral Peak Prominence Smoothed (CPPS) e Acoustic Voice Quality Index (AVQI) de crianças com vozes normais e alteradas, relacionar com o julgamento perceptivo-auditivo (JPA) da voz e estabelecer pontos de corte. MÉTODO: Gravações vocais das tarefas de vogal sustentada e contagem de números de 185 crianças foram selecionadas em um banco de dados e submetidas a análise acústica com extração das medidas de CPPS e AVQI, e ao JPA. O JPA foi realizado individualmente para cada tarefa e as amostras foram classificadas posteriormente como normal ou alterada, e para as tarefas em conjunto definindo-se se a criança passaria ou falharia em uma situação de triagem vocal. RESULTADOS: Crianças com JPA alterado e que falharam na triagem apresentaram valores menores de CPPS e maiores de AVQI, do que as com JPA normal e que passaram na triagem. O JPA da tarefa de vogal sustentada se relacionou ao CPPS e AVQI, e da tarefa de contagem de números relacionou-se apenas ao AVQI e CPPS números. Os pontos de corte que diferenciam crianças com e sem desvio vocal são 14,07 para o CPPS vogal, 7,62 para o CPPS números e 2,01 para o AVQI. CONCLUSÃO: Crianças com JPA alterado apresentaram maiores valores de AVQI e menores valores de CPPs. O JPA da tarefa de vogal previu todas as medidas acústicas, porém, de contagem previu apenas as medidas extraídas dela. As três medidas foram semelhantes na identificação de vozes sem desvio e vozes disfônicas.


Subject(s)
Speech Acoustics , Voice Quality , Humans , Voice Quality/physiology , Child , Female , Male , Auditory Perception/physiology , Voice Disorders/diagnosis , Voice Disorders/physiopathology , Adolescent , Case-Control Studies , Speech Production Measurement , Judgment
3.
J Acoust Soc Am ; 155(5): 3521-3536, 2024 May 01.
Article in English | MEDLINE | ID: mdl-38809098

ABSTRACT

This electromagnetic articulography study explores the kinematic profile of Intonational Phrase boundaries in Seoul Korean. Recent findings suggest that the scope of phrase-final lengthening is conditioned by word- and/or phrase-level prominence. However, evidence comes mainly from head-prominence languages, which conflate positions of word prosody with positions of phrasal prominence. Here, we examine phrase-final lengthening in Seoul Korean, an edge-prominence language with no word prosody, with respect to focus location as an index of phrase-level prominence and Accentual Phrase (AP) length as an index of word demarcation. Results show that phrase-final lengthening extends over the phrase-final syllable. The effect is greater the further away that focus occurs. It also interacts with the domains of AP and prosodic word: lengthening is greater in smaller APs, whereas shortening is observed in the initial gesture of the phrase-final word. Additional analyses of kinematic displacement and peak velocity revealed that Korean phrase-final gestures bear the kinematic profile of IP boundaries concurrently to what is typically considered prominence marking. Based on these results, a gestural coordination account is proposed, in which boundary-related events interact systematically with phrase-level prominence as well as lower prosodic levels, and how this proposal relates to the findings in head-prominence languages is discussed.


Subject(s)
Phonetics , Speech Acoustics , Humans , Male , Female , Young Adult , Biomechanical Phenomena , Adult , Language , Gestures , Speech Production Measurement , Republic of Korea , Voice Quality , Time Factors
4.
J Commun Disord ; 109: 106428, 2024.
Article in English | MEDLINE | ID: mdl-38744198

ABSTRACT

PURPOSE: This study examines whether there are differences in the speech of speakers with dysarthria, speakers with apraxia and healthy speakers in spectral acoustic measures during production of the central-peninsular Spanish alveolar sibilant fricative /s/. METHOD: To this end, production of the sibilant was analyzed in 20 subjects with dysarthria, 8 with apraxia of speech and 28 healthy speakers. Participants produced 12 sV(C) words. The variables compared across groups were the fricative's spectral amplitude difference (AmpD) and spectral moments in the temporal midpoint of fricative execution. RESULTS: The results indicate that individuals with dysarthria can be distinguished from healthy speakers in terms of the spectral characteristics AmpD, standard deviation (SD), center of gravity (CoG) and skewness, the last two in context with unrounded vowel, while no differences in kurtosis were detected. Participants with AoS group differ significantly from healthy speaker group in AmpD, SD and CoG and Kurtosis, the first one followed unrounded vowel and the latter two followed by rounded vowels. In addition, speakers with apraxia of speech group returned significant differences with respect to speakers with dysarthria group in AmpD, CoG and skewness. CONCLUSIONS: The differences found between the groups in the measures studied as a function of the type of vowel context could provide insights into the distinctive manifestations of motor speech disorders, contributing to the differential diagnosis between apraxia and dysarthria in motor control processes.


Subject(s)
Apraxias , Dysarthria , Speech Acoustics , Humans , Dysarthria/physiopathology , Dysarthria/etiology , Apraxias/physiopathology , Male , Female , Middle Aged , Adult , Aged , Phonetics , Speech Production Measurement
5.
J Speech Lang Hear Res ; 67(6): 1712-1730, 2024 Jun 06.
Article in English | MEDLINE | ID: mdl-38749007

ABSTRACT

PURPOSE: The goal of this study was to assess various recording methods, including combinations of high- versus low-cost microphones, recording interfaces, and smartphones in terms of their ability to produce commonly used time- and spectral-based voice measurements. METHOD: Twenty-four vowel samples representing a diversity of voice quality deviations and severities from a wide age range of male and female speakers were played via a head-and-thorax model and recorded using a high-cost, research standard GRAS 40AF (GRAS Sound & Vibration) microphone and amplification system. Additional recordings were made using various combinations of headset microphones (AKG C555 L [AKG Acoustics GmbH], Shure SM35-XLR [Shure Incorporated], AVID AE-36 [AVID Products, Inc.]) and audio interfaces (Focusrite Scarlett 2i2 [Focusrite Audio Engineering Ltd.] and PC, Focusrite and smartphone, smartphone via a TRRS adapter), as well as smartphones direct (Apple iPhone 13 Pro, Google Pixel 6) using their built-in microphones. The effect of background noise from four different room conditions was also evaluated. Vowel samples were analyzed for measures of fundamental frequency, perturbation, cepstral peak prominence, and spectral tilt (low vs. high spectral ratio). RESULTS: Results show that a wide variety of recording methods, including smartphones with and without a low-cost headset microphone, can effectively track the wide range of acoustic characteristics in a diverse set of typical and disordered voice samples. Although significant differences in acoustic measures of voice may be observed, the presence of extremely strong correlations (rs > .90) with the recording standard implies a strong linear relationship between the results of different methods that may be used to predict and adjust any observed differences in measurement results. CONCLUSION: Because handheld smartphone distance and positioning may be highly variable when used in actual clinical recording situations, smartphone + a low-cost headset microphone is recommended as an affordable recording method that controls mouth-to-microphone distance and positioning and allows both hands to be available for manipulation of the smartphone device.


Subject(s)
Smartphone , Speech Acoustics , Humans , Female , Male , Adult , Young Adult , Speech Production Measurement/instrumentation , Speech Production Measurement/methods , Reproducibility of Results , Voice Quality , Middle Aged , Adolescent
6.
J Speech Lang Hear Res ; 67(6): 1660-1681, 2024 Jun 06.
Article in English | MEDLINE | ID: mdl-38758676

ABSTRACT

PURPOSE: Literature suggests a dependency of the acoustic metrics, smoothed cepstral peak prominence (CPPS) and harmonics-to-noise ratio (HNR), on human voice loudness and fundamental frequency (F0). Even though this has been explained with different oscillatory patterns of the vocal folds, so far, it has not been specifically investigated. In the present work, the influence of three elicitation levels, calibrated sound pressure level (SPL), F0 and vowel on the electroglottographic (EGG) and time-differentiated EGG (dEGG) metrics hybrid open quotient (OQ), dEGG OQ and peak dEGG, as well as on the acoustic metrics CPPS and HNR, was examined, and their suitability for voice assessment was evaluated. METHOD: In a retrospective study, 29 women with a mean age of 25 years (± 8.9, range: 18-53) diagnosed with structural vocal fold pathologies were examined before and after voice therapy or phonosurgery. Both acoustic and EGG signals were recorded simultaneously during the phonation of the sustained vowels /ɑ/, /i/, and /u/ at three elicited levels of loudness (soft/comfortable/loud) and unconstrained F0 conditions. RESULTS: A linear mixed-model analysis showed a significant effect of elicitation effort levels on peak dEGG, HNR, and CPPS (all p < .01). Calibrated SPL significantly influenced HNR and CPPS (both p < .01). Furthermore, F0 had a significant effect on peak dEGG and CPPS (p < .0001). All metrics showed significant changes with regard to vowel (all p < .05). However, the treatment had no effect on the examined metrics, regardless of the treatment type (surgery vs. voice therapy). CONCLUSIONS: The value of the investigated metrics for voice assessment purposes when sampled without sufficient control of SPL and F0 is limited, in that they are significantly influenced by the phonatory context, be it speech or elicited sustained vowels. Future studies should explore the diagnostic value of new data collation approaches such as voice mapping, which take SPL and F0 effects into account.


Subject(s)
Dysphonia , Speech Acoustics , Humans , Female , Adult , Dysphonia/physiopathology , Dysphonia/therapy , Retrospective Studies , Young Adult , Middle Aged , Adolescent , Voice Quality/physiology , Electrodiagnosis/methods , Glottis/physiopathology , Phonation/physiology , Vocal Cords/physiopathology , Voice Training , Speech Production Measurement/methods
7.
J Acoust Soc Am ; 155(5): 3206-3212, 2024 May 01.
Article in English | MEDLINE | ID: mdl-38738937

ABSTRACT

Modern humans and chimpanzees share a common ancestor on the phylogenetic tree, yet chimpanzees do not spontaneously produce speech or speech sounds. The lab exercise presented in this paper was developed for undergraduate students in a course entitled "What's Special About Human Speech?" The exercise is based on acoustic analyses of the words "cup" and "papa" as spoken by Viki, a home-raised, speech-trained chimpanzee, as well as the words spoken by a human. The analyses allow students to relate differences in articulation and vocal abilities between Viki and humans to the known anatomical differences in their vocal systems. Anatomical and articulation differences between humans and Viki include (1) potential tongue movements, (2) presence or absence of laryngeal air sacs, (3) presence or absence of vocal membranes, and (4) exhalation vs inhalation during production.


Subject(s)
Pan troglodytes , Speech Acoustics , Speech , Humans , Animals , Pan troglodytes/physiology , Speech/physiology , Tongue/physiology , Tongue/anatomy & histology , Vocalization, Animal/physiology , Species Specificity , Speech Production Measurement , Larynx/physiology , Larynx/anatomy & histology , Phonetics
8.
J Acoust Soc Am ; 155(4): 2836-2848, 2024 Apr 01.
Article in English | MEDLINE | ID: mdl-38682915

ABSTRACT

This paper evaluates an innovative framework for spoken dialect density prediction on children's and adults' African American English. A speaker's dialect density is defined as the frequency with which dialect-specific language characteristics occur in their speech. Rather than treating the presence or absence of a target dialect in a user's speech as a binary decision, instead, a classifier is trained to predict the level of dialect density to provide a higher degree of specificity in downstream tasks. For this, self-supervised learning representations from HuBERT, handcrafted grammar-based features extracted from ASR transcripts, prosodic features, and other feature sets are experimented with as the input to an XGBoost classifier. Then, the classifier is trained to assign dialect density labels to short recorded utterances. High dialect density level classification accuracy is achieved for child and adult speech and demonstrated robust performance across age and regional varieties of dialect. Additionally, this work is used as a basis for analyzing which acoustic and grammatical cues affect machine perception of dialect.


Subject(s)
Black or African American , Speech Acoustics , Humans , Adult , Child , Male , Female , Speech Production Measurement/methods , Language , Child, Preschool , Young Adult , Speech Perception , Adolescent , Phonetics , Child Language
9.
Codas ; 36(3): e20230175, 2024.
Article in English | MEDLINE | ID: mdl-38629682

ABSTRACT

PURPOSE: To assess the influence of the listener experience, measurement scales and the type of speech task on the auditory-perceptual evaluation of the overall severity (OS) of voice deviation and the predominant type of voice (rough, breathy or strain). METHODS: 22 listeners, divided into four groups participated in the study: speech-language pathologist specialized in voice (SLP-V), SLP non specialized in voice (SLP-NV), graduate students with auditory-perceptual analysis training (GS-T), and graduate students without auditory-perceptual analysis training (GS-U). The subjects rated the OS of voice deviation and the predominant type of voice of 44 voices by visual analog scale (VAS) and the numerical scale (score "G" from GRBAS), corresponding to six speech tasks such as sustained vowel /a/ and /ɛ/, sentences, number counting, running speech, and all five previous tasks together. RESULTS: Sentences obtained the best interrater reliability in each group, using both VAS and GRBAS. SLP-NV group demonstrated the best interrater reliability in OS judgment in different speech tasks using VAS or GRBAS. Sustained vowel (/a/ and /ɛ/) and running speech obtained the best interrater reliability among the groups of listeners in judging the predominant vocal quality. GS-T group got the best result of interrater reliability in judging the predominant vocal quality. CONCLUSION: The time of experience in the auditory-perceptual judgment of the voice, the type of training to which they were submitted, and the type of speech task influence the reliability of the auditory-perceptual evaluation of vocal quality.


Subject(s)
Dysphonia , Speech Perception , Humans , Speech , Reproducibility of Results , Speech Production Measurement , Observer Variation , Voice Quality , Speech Acoustics
10.
J Speech Lang Hear Res ; 67(5): 1370-1384, 2024 May 07.
Article in English | MEDLINE | ID: mdl-38619435

ABSTRACT

OBJECTIVES: The study aimed to investigate the predictive potential of language environment and vocal development status measures obtained through integrated analysis of Language ENvironment Analysis (LENA) recordings during the prelinguistic stage for subsequent speech and language development in Korean-acquiring children. Specifically, this study explored whether measures from both LENA-automated analysis and human coding at 6-8 months and 12-14 months of age predict vocabulary and phonological development at 18-20 months. METHOD: One-day home recordings from 20 children were collected using a LENA recorder at 6-8 months, 12-14 months, and 18-20 months. Both LENA-automated measures and measures from human coding were obtained from recordings at 6-8 months and 12-14 months. The number of different words, consonant inventory, and utterance structure inventory were identified from recordings of 18-20 months. Correlation and multiple regression analyses were performed to investigate whether measures related to early language environment and child vocalization at 6-8 months and 12-14 months were predictive of vocabulary and phonological measures at 18-20 months. RESULTS: The results showed that the two main LENA-automated measures, conversational turn count (CTC) and child vocalization count, were positively correlated with all vocabulary and phonological measures at 18-20 months. Multiple regression analysis revealed that CTC during the prelinguistic stages was the most significant predictor of a number of different words, consonant inventory, and utterance structure inventory at 18-20 months. Also, adult word count in LENA-automated measures, child-directed speech ratio, and canonical babbling ratio measured by human coding significantly predicted some vocabulary and phonological measures at 18-20 months. CONCLUSION: This study highlights the multifaceted nature of language acquisition and collectively emphasizes the value of considering both quantitative and qualitative aspects of language input to understand early language development in children.


Subject(s)
Child Language , Language Development , Speech , Vocabulary , Humans , Male , Female , Infant , Speech/physiology , Phonetics , Speech Production Measurement/methods
11.
J Speech Lang Hear Res ; 67(6): 1643-1659, 2024 Jun 06.
Article in English | MEDLINE | ID: mdl-38683058

ABSTRACT

PURPOSE: The aim of this study was to determine (a) diagnostic accuracy of acoustic measures of glottal stop production (GSP; intensity differences, slopes, complete voicing cessation) to distinguish between unilateral vocal fold paresis/paralysis (UVFP) patients and controls; (b) if acoustic measures of GSP significantly correlated with an acoustic measure of voice disorder severity, acoustic voice quality index (AVQI); and (c) if acoustic measures from another type of voicing cessation, voiceless consonant production, also significantly differed between groups. METHOD: Ninety-seven patients with unilateral paresis/paralysis and 35 controls with normal laryngostroboscopic signs produced two sets of five repeated [i] and four repeated [isi]. Tokens were randomized by type between groups and analyzed blinded using a customized Praat program that computed intensity differences and slopes between vowel maxima and glottal stop minima for inter-[i] tokens and vowel maxima and voiceless consonant minima for intra-[isi] tokens. The number of voicing cessations for inter-[i] tokens was obtained. RESULTS: Onset and offset intensity differences and number of voicing cessations from inter-[i] tokens had the greatest areas under the curve (.854, .856, and .835, respectively). Correlation coefficients were significant (p < .01) between AVQI and all GSP acoustic measures with weak/medium effect sizes. No significant differences were found between controls and participants with UVFP for acoustic measures from intra-[isi]. CONCLUSIONS: Acoustic GSP measures demonstrated good diagnostic accuracy and some relationship to severity of voice disorder. No significant differences in acoustic measures for medial voiceless fricative consonants between controls and participants with UVFP suggested that voicing cessation for voiceless fricatives differs from voicing cessation for GSP.


Subject(s)
Glottis , Speech Acoustics , Vocal Cord Paralysis , Voice Quality , Humans , Vocal Cord Paralysis/physiopathology , Vocal Cord Paralysis/diagnosis , Male , Female , Middle Aged , Adult , Retrospective Studies , Glottis/physiopathology , Voice Quality/physiology , Aged , Speech Production Measurement/methods , Young Adult , Severity of Illness Index , Voice Disorders/diagnosis , Voice Disorders/physiopathology
12.
J Speech Lang Hear Res ; 67(6): 1682-1711, 2024 Jun 06.
Article in English | MEDLINE | ID: mdl-38662942

ABSTRACT

PURPOSE: Pitch variations (tone productions) have been reported as a measure to differentiate Cantonese-speaking children with and without childhood apraxia of speech (CAS). This study aims to examine fundamental frequency (F0) changes within syllables and the effects of syllable structure, lexical status, and syllable positions on F0 in Cantonese-speaking preschool children with and without CAS. METHOD: Six children with CAS, six children with non-CAS speech sound disorder plus language disorder (S&LD), 22 children with speech sound disorder only (SSD), and 63 children with typical speech-language development (TD) performed the tone sequencing task (TST). Growth curve analysis was employed to analyze and compare the F0 values within syllables with three Cantonese tones (high level, high rising, and low falling). The analysis considered the effects of syllable structure (vowel and consonant-vowel), lexical status (word and nonword), and syllable position (initial, medial, and final) on F0, as well as comparisons within and between groups. RESULTS: Within each group, the effects of syllable structure and position on F0 values were found with different patterns. Between-group comparisons showed that the CAS group had reduced F0 contrasts. The CAS group could be differentiated from the control groups based on interactions of F0 with syllable structure and position, but not lexical status. The dissimilarity of F0 values detected between the CAS and SSD/TD groups was more prominent than that observed between the CAS and S&LD groups. CONCLUSIONS: This study demonstrated that Cantonese-speaking children with CAS had difficulty in varying F0 within syllables as compared to those without CAS, suggesting pitch variation difficulty and language-specific impairment profiles in CAS. Future investigations of objective measures for identifying Cantonese speakers with CAS and cross-linguistic investigations using growth curve analysis and the TST are suggested.


Subject(s)
Apraxias , Phonetics , Humans , Child, Preschool , Apraxias/diagnosis , Male , Female , Speech Acoustics , Speech Sound Disorder/diagnosis , Speech Production Measurement/methods , Language , Speech/physiology
13.
Codas ; 36(2): e20230065, 2024.
Article in Portuguese, English | MEDLINE | ID: mdl-38537026

ABSTRACT

PURPOSE: To seek evidence of validity and reliability for the Compressed Speech Test with Figures. METHODS: The study was subdivided into three stages: construct validation, criteria and reliability. All participants were aged between 6:00 and 8:11. For the construct, Compressed Speech with Figures and the gold standard Adapted Compressed Speech test were applied to children with typical phonological development. For criterion analysis, Compressed Speech with Figures was applied in two groups, with typical (G1) and atypical (G2) phonological development. Finally, the application protocols underwent analysis by two Speech Therapists, with experience in the area of Central Auditory Processing, seeking to obtain an inter-evaluator reliability analysis. RESULTS: The correlation test indicated an almost perfect construct (correlation 0.843 for the right ear and 0.823 for the left ear). In the criterion analysis, it was noticed that both groups presented satisfactory results (G1 = 99.6 to 100%; G2 = 96 to 96.5%). The reliability analysis demonstrated that the protocol is easy to analyze, as both professionals presented unanimous responses. CONCLUSION: It was possible to obtain evidence of validity and reliability for the Compressed Speech with Figures instrument. The construct analysis showed that the instrument measures the same variable as the gold standard test, with an almost perfect correlation. In the criterion analysis, both groups presented similar performance, demonstrating that the instrument does not seem to differentiate populations with and without mild phonological disorder. The inter-evaluator reliability analysis demonstrated that the protocol is easy to analyze and score.


OBJETIVO: Buscar evidências de validade e fidedignidade para o Teste de Fala Comprimida com Figuras. MÉTODO: O estudo foi subdividido em três etapas: validação de construto, critério e fidedignidade. Todos os participantes tinham idade entre 6:00 e 8:11. Para o construto, aplicou-se o Fala Comprimida com Figuras e o teste padrão ouro Fala Comprimida Adaptado em crianças com desenvolvimento fonológico típico. Para análise de critério, aplicou-se o Fala Comprimida com Figuras em dois grupos, com desenvolvimento fonológico típico (G1) e atípico (G2). Por fim, os protocolos de aplicação passaram pela análise de duas Fonoaudiólogas, com experiência na área do Processamento Auditivo Central, buscando obter uma análise de fidedignidade interavaliadores. RESULTADOS: O teste de correlação indicou um construto quase perfeito (Rho=0,843 para orelha direita e Rho=0,823 para orelha esquerda). Na análise de critério, percebeu-se que ambos os grupos apresentaram resultados satisfatórios (G1 = 99,6 a 100%; G2 = 96 a 96,5%). Já a análise de fidedignidade demonstrou que o protocolo é de fácil análise, pois ambos os profissionais apresentaram respostas unânimes. CONCLUSÃO: Foi possível obter evidências de validade e fidedignidade para o instrumento de Fala Comprimida com Figuras. A análise de construto evidenciou que o instrumento mede a mesma variável que o teste padrão outro, com correlação quase perfeita. Na análise de critério, ambos os grupos apresentaram desempenho semelhante, demonstrando que o instrumento não parece diferenciar populações com e sem transtorno fonológico leve. A análise de fidedignidade interavaliador demonstrou que o protocolo é de fácil análise e pontuação.


Subject(s)
Speech Sound Disorder , Speech , Child , Humans , Speech/physiology , Reproducibility of Results , Speech Production Measurement , Phonetics
14.
Am J Speech Lang Pathol ; 33(3): 1420-1431, 2024 May.
Article in English | MEDLINE | ID: mdl-38451741

ABSTRACT

PURPOSE: Differences in inhibitory control and cognitive flexibility between children who stutter (CWS) and children who do not stutter (CWNS) have been previously demonstrated. The aim of the current study was to investigate whether the previously reported inhibitory control- and cognitive flexibility-related performance costs for CWS are associated with the number of speech disfluencies that they produce. METHOD: Participants were 19 CWS (Mage = 7.58 years, range: 6.08-9.17) and 19 CWNS matched on age and gender (Mage = 7.58 years, range: 6.08-9.33). Gamma regression models were used to investigate possible associations between performance costs in speed and accuracy measured during a computer task evaluating inhibitory control and cognitive flexibility and the number of speech disfluencies during video-recorded speech samples (story retelling and casual conversation). RESULTS: Two significant interactions were observed. For both inhibitory control and cognitive flexibility, we identified a significant group and inhibitory control/cognitive flexibility performance-cost interaction in stuttering-like disfluencies (SLDs), indicating that the performance-cost effects on SLD production were significantly higher in the CWS group, compared to the CWNS group. CONCLUSIONS: CWS with reduced inhibitory control or cognitive flexibility produce more SLDs, but not other disfluencies. These results are partly in line with some previous findings in nonstuttering and stuttering populations linking inhibitory control and cognitive flexibility weaknesses to the production of speech disfluencies.


Subject(s)
Cognition , Inhibition, Psychological , Stuttering , Humans , Stuttering/psychology , Stuttering/physiopathology , Stuttering/diagnosis , Male , Child , Female , Speech Production Measurement , Child Behavior , Case-Control Studies
15.
Am J Speech Lang Pathol ; 33(3): 1283-1300, 2024 May.
Article in English | MEDLINE | ID: mdl-38483199

ABSTRACT

PURPOSE: This study examined whether the "Three Bears Passage" (TB), a standard Mandarin reading passage, could elicit significant vocal range variations in individuals with voice disorders. Relative sensitivity of TB versus another existing standard reading passage, "Passage in Mandarin" (PM), for differentiating between individuals with and without voice disorders was also evaluated. METHOD: Forty-two individuals with normal voice and 30 individuals with voice disorders participated in the study. Maximum fundamental frequency (f0), minimum f0, mean f0, f0 range, maximum vocal intensity, minimum intensity, mean intensity, and intensity range of all participants reading aloud the two passages were measured with Praat to construct speech range profiles (SRPs). RESULTS: Significantly larger vocal range was found for TB than for PM in individuals with voice disorders, including significantly higher maximum f0, mean f0, maximum intensity, mean intensity, and significantly larger f0 range and intensity range. Significantly more limited vocal range was observed in individuals with voice disorders than those without, with more obviously restricted SRPs while reading aloud TB compared to PM. Receiver operating characteristic analysis suggested that TB was more sensitive than PM in distinguishing between individuals with and without voice disorders. CONCLUSIONS: Our findings supported the potential of TB as a standard clinical assessment tool for evaluating pathological changes in vocal range. Future studies should explore if therapeutic approaches based on the passage or variations of it could be developed for overcoming functional limitations and restrictions in vocal range for specific voice disorders.


Subject(s)
Reading , Speech Acoustics , Voice Disorders , Voice Quality , Humans , Male , Female , Adult , Voice Disorders/diagnosis , Voice Disorders/physiopathology , Young Adult , Speech Production Measurement , Middle Aged , ROC Curve , Language , Case-Control Studies , Adolescent
16.
Am J Speech Lang Pathol ; 33(3): 1113-1126, 2024 May.
Article in English | MEDLINE | ID: mdl-38501906

ABSTRACT

PURPOSE: The study of gender and speech has historically excluded studies of transmasculine individuals. Consequently, generalizations about speech and gender are based on cisgender individuals. This lack of representation hinders clinical training and clinical service delivery, particularly by speech-language pathologists providing gender-affirming communication services. This letter describes a new corpus of the speech of American English-speaking transmasculine men, transmasculine nonbinary people, and cisgender men that is open and available to clinicians and researchers. METHOD: Twenty masculine-presenting native English speakers from the Upper Midwestern United States (including cisgender men, transmasculine men, and transmasculine nonbinary people) were recorded, producing three sets of speech materials: Consensus Auditory-Perceptual Evaluation of Voice sentences, the Rainbow Passage, and a novel set of sentences developed for this project. Acoustic measures vowels (overall formant frequency scaling, vowel-space dispersion, fundamental frequency, breathiness), consonants (voice onset time of word-initial voiceless stops, spectral moments of word-initial /s/), and the entire sentence (rate of speech) that were made. RESULTS: The acoustic measures reveal a wide range for all dependent measures and low correlations among the measures. Results show that many of the voices depart considerably from the norms for men's speech in published studies. CONCLUSION: This new corpus can be used to illustrate different ways of sounding masculine by speech-language pathologists performing gender-affirming communication services and by higher education teachers as examples of diverse ways of sounding masculine.


Subject(s)
Speech Acoustics , Speech Production Measurement , Transgender Persons , Voice Quality , Humans , Male , Transgender Persons/psychology , Adult , Young Adult , Speech-Language Pathology/methods , Female , Middle Aged , Phonetics
17.
Am J Speech Lang Pathol ; 33(3): 1485-1503, 2024 May.
Article in English | MEDLINE | ID: mdl-38512040

ABSTRACT

PURPOSE: Motor deficits are widely documented among autistic individuals, and speech characteristics consistent with a motor speech disorder have been reported in prior literature. We conducted an auditory-perceptual analysis of speech production skills in low and minimally verbal autistic individuals as a step toward clarifying the nature of speech production impairments in this population and the potential link between oromotor functioning and language development. METHOD: Fifty-four low or minimally verbal autistic individuals aged 4-18 years were video-recorded performing nonspeech oromotor tasks and producing phonemes, syllables, and words in imitation. Three trained speech-language pathologists provided auditory perceptual ratings of 11 speech features reflecting speech subsystem performance and overall speech production ability. The presence, attributes, and severity of signs of oromotor dysfunction were analyzed, as were relative performance on nonspeech and speech tasks and correlations between perceptual speech features and language skills. RESULTS AND CONCLUSIONS: Our findings provide evidence of a motor speech disorder in this population, characterized by perceptual speech features including reduced intelligibility, decreased consonant and vowel precision, and impairments of speech coordination and consistency. Speech deficits were more associated with articulation than with other speech subsystems. Speech production was more impaired than nonspeech oromotor abilities in a subgroup of the sample. Oromotor deficits were significantly associated with expressive and receptive language skills. Findings are interpreted in the context of known characteristics of the pediatric motor speech disorders childhood apraxia of speech and childhood dysarthria. These results, if replicated in future studies, have significant potential to improve the early detection of language impairments, inform the development of speech and language interventions, and aid in the identification of neurobiological mechanisms influencing communication development.


Subject(s)
Speech Intelligibility , Humans , Child , Child, Preschool , Male , Adolescent , Female , Speech Perception , Speech Production Measurement , Autistic Disorder/psychology , Autistic Disorder/complications , Autistic Disorder/diagnosis , Video Recording , Speech Disorders/diagnosis , Speech Disorders/physiopathology , Speech-Language Pathology/methods , Articulation Disorders/diagnosis
18.
Am J Speech Lang Pathol ; 33(3): 1390-1405, 2024 May.
Article in English | MEDLINE | ID: mdl-38530396

ABSTRACT

PURPOSE: Changes in voice and speech are characteristic symptoms of Huntington's disease (HD). Objective methods for quantifying speech impairment that can be used across languages could facilitate assessment of disease progression and intervention strategies. The aim of this study was to analyze acoustic features to identify language-independent features that could be used to quantify speech dysfunction in English-, Spanish-, and Polish-speaking participants with HD. METHOD: Ninety participants with HD and 83 control participants performed sustained vowel, syllable repetition, and reading passage tasks recorded with previously validated methods using mobile devices. Language-independent features that differed between HD and controls were identified. Principal component analysis (PCA) and unsupervised clustering were applied to the language-independent features of the HD data set to identify subgroups within the HD data. RESULTS: Forty-six language-independent acoustic features that were significantly different between control participants and participants with HD were identified. Following dimensionality reduction using PCA, four speech clusters were identified in the HD data set. Unified Huntington's Disease Rating Scale (UHDRS) total motor score, total functional capacity, and composite UHDRS were significantly different for pairwise comparisons of subgroups. The percentage of HD participants with higher dysarthria score and disease stage also increased across clusters. CONCLUSION: The results support the application of acoustic features to objectively quantify speech impairment and disease severity in HD in multilanguage studies. SUPPLEMENTAL MATERIAL: https://doi.org/10.23641/asha.25447171.


Subject(s)
Huntington Disease , Speech Acoustics , Speech Production Measurement , Humans , Huntington Disease/diagnosis , Huntington Disease/complications , Male , Female , Middle Aged , Adult , Case-Control Studies , Aged , Dysarthria/diagnosis , Dysarthria/etiology , Dysarthria/physiopathology , Principal Component Analysis , Voice Quality , Speech Disorders/diagnosis , Speech Disorders/etiology , Predictive Value of Tests
19.
Phonetica ; 81(3): 321-349, 2024 Jun 25.
Article in English | MEDLINE | ID: mdl-38522003

ABSTRACT

This study investigates the variation in phrase-final f0 movements found in dyadic unscripted conversations in Papuan Malay, an Eastern Indonesian language. This is done by a novel combination of exploratory and confirmatory classification techniques. In particular, this study investigates the linguistic factors that potentially drive f0 contour variation in phrase-final words produced in a naturalistic interactive dialogue task. To this end, a cluster analysis, manual labelling and random forest analysis are carried out to reveal the main sources of contour variation. These are: taking conversational interaction into account; turn transition, topic continuation, information structure (givenness and contrast), and context-independent properties of words such as word class, syllable structure, voicing and intrinsic f0. Results indicate that contour variation in Papuan Malay, in particular f0 direction and target level, is best explained by turn transitions between speakers, corroborating similar findings for related languages. The applied methods provide opportunities to further lower the threshold of incorporating intonation and prosody in the early stages of language documentation.


Subject(s)
Language , Phonetics , Humans , Female , Male , Indonesia , Speech Acoustics , Adult , Linguistics , Speech Production Measurement
20.
Eur Arch Otorhinolaryngol ; 281(5): 2707-2716, 2024 May.
Article in English | MEDLINE | ID: mdl-38319369

ABSTRACT

PURPOSE: This cross-sectional study aimed to investigate the potential of voice analysis as a prescreening tool for type II diabetes mellitus (T2DM) by examining the differences in voice recordings between non-diabetic and T2DM participants. METHODS: 60 participants diagnosed as non-diabetic (n = 30) or T2DM (n = 30) were recruited on the basis of specific inclusion and exclusion criteria in Iran between February 2020 and September 2023. Participants were matched according to their year of birth and then placed into six age categories. Using the WhatsApp application, participants recorded the translated versions of speech elicitation tasks. Seven acoustic features [fundamental frequency, jitter, shimmer, harmonic-to-noise ratio (HNR), cepstral peak prominence (CPP), voice onset time (VOT), and formant (F1-F2)] were extracted from each recording and analyzed using Praat software. Data was analyzed with Kolmogorov-Smirnov, two-way ANOVA, post hoc Tukey, binary logistic regression, and student t tests. RESULTS: The comparison between groups showed significant differences in fundamental frequency, jitter, shimmer, CPP, and HNR (p < 0.05), while there were no significant differences in formant and VOT (p > 0.05). Binary logistic regression showed that shimmer was the most significant predictor of the disease group. There was also a significant difference between diabetes status and age, in the case of CPP. CONCLUSIONS: Participants with type II diabetes exhibited significant vocal variations compared to non-diabetic controls.


Subject(s)
Diabetes Mellitus, Type 2 , Voice , Humans , Voice Quality , Speech Acoustics , Diabetes Mellitus, Type 2/complications , Cross-Sectional Studies , Speech Production Measurement , Acoustics
SELECTION OF CITATIONS
SEARCH DETAIL
...