Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 28
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Augment Altern Commun ; : 1-12, 2024 Jul 11.
Artigo em Inglês | MEDLINE | ID: mdl-38990060

RESUMO

It is well-known that children with expressive communication difficulties have the right to communicate, but they should also have the right to do so in whichever language they choose, with a voice that closely matches their age, gender, and dialect. This study aimed to develop naturalistic synthetic child speech, matching the vocal identity of three children with expressive communication difficulties, using Tacotron 2, for three under-resourced South African languages, namely South African English (SAE), Afrikaans, and isiXhosa. Due to the scarcity of child speech corpora, 2 hours of child speech data per child was collected from three 11- to 12-year-old children. Two adult models were used to "warm start" the child speech synthesis. To determine the naturalness of the synthetic voices, 124 listeners participated in a mean opinion score survey (Likert Score) and optionally gave qualitative feedback. Despite limited training data used in this study, we successfully developed a synthesized child voice of adequate quality in each language. This study highlights that with recent technological advancements, it is possible to develop synthetic child speech that matches the vocal identity of a child with expressive communication difficulties in different under-resourced languages.

2.
J Acoust Soc Am ; 155(1): 294-305, 2024 01 01.
Artigo em Inglês | MEDLINE | ID: mdl-38230970

RESUMO

This study constitutes an investigation into the acoustic variability of intervocalic alveolar taps in a corpus of spontaneous speech from Madrid, Spain. Substantial variability was documented in this segment, with highly reduced variants constituting roughly half of all tokens during spectrographic inspection. In addition to qualitative documentation, the intensity difference between the tap and surrounding vowels was measured. Changes in this intensity difference were statistically modeled using Bayesian finite mixture models containing lexical and phonetic predictors. Model comparisons indicate predictive performance is improved when we assume two latent categories, interpreted as two pronunciation variants for the Spanish tap. In interpreting the model, predictors were more often related to categorical changes in which pronunciation variant was produced than to gradient intensity changes within each tap type. Variability in tap production was found according to lexical frequency, speech rate, and phonetic environment. These results underscore the importance of evaluating model fit to the data as well as what researchers modeling phonetic variability can gain in moving past linear models when they do not adequately fit the observed data.


Assuntos
Acústica da Fala , Percepção da Fala , Teorema de Bayes , Fala , Fonética , Acústica
3.
J Acoust Soc Am ; 154(3): 1896-1902, 2023 09 01.
Artigo em Inglês | MEDLINE | ID: mdl-37756577

RESUMO

This study examines the role of frequencies above 8 kHz in the classification of conversational speech fricatives [f, v, θ, ð, s, z, ʃ, ʒ, h] in random forest modeling. Prior research has mostly focused on spectral measures for fricative categorization using frequency information below 8 kHz. The contribution of higher frequencies has received only limited attention, especially for non-laboratory speech. In the present study, we use a corpus of sociolinguistic interview recordings from Western Canadian English sampled at 44.1 and 16 kHz. For both sampling rates, we analyze spectral measures obtained using Fourier analysis and the multitaper method, and we also compare models without and with amplitudinal measures. Results show that while frequency information above 8 kHz does not improve classification accuracy in random forest analyses, inclusion of such frequencies can affect the relative importance of specific measures. This includes a decreased contribution of center of gravity and an increased contribution of spectral standard deviation for the higher sampling rate. We also find no major differences in classification accuracy between Fourier and multitaper measures. The inclusion of power measures improves model accuracy but does not change the overall importance of spectral measures.


Assuntos
Comunicação , Idioma , Canadá , Linguística , Algoritmo Florestas Aleatórias
4.
J Acoust Soc Am ; 153(3): 1623, 2023 03.
Artigo em Inglês | MEDLINE | ID: mdl-37002094

RESUMO

The papers in this special issue provide a critical look at some historical ideas that have had an influence on research and teaching in the field of speech communication. They also address widely used methodologies or address long-standing methodological challenges in the areas of speech perception and speech production. The goal is to reconsider and evaluate the need for caution or replacement of historical ideas with more modern results and methods. The contributions provide respectful historical context to the classic ideas, as well as new original research or discussion that clarifies the limitations of the original ideas.


Assuntos
Percepção da Fala , Fala , Comunicação
5.
Lang Speech ; 66(3): 564-605, 2023 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-36000386

RESUMO

We present an implementation of DIANA, a computational model of spoken word recognition, to model responses collected in the Massive Auditory Lexical Decision (MALD) project. DIANA is an end-to-end model, including an activation and decision component that takes the acoustic signal as input, activates internal word representations, and outputs lexicality judgments and estimated response latencies. Simulation 1 presents the process of creating acoustic models required by DIANA to analyze novel speech input. Simulation 2 investigates DIANA's performance in determining whether the input signal is a word present in the lexicon or a pseudoword. In Simulation 3, we generate estimates of response latency and correlate them with general tendencies in participant responses in MALD data. We find that DIANA performs fairly well in free word recognition and lexical decision. However, the current approach for estimating response latency provides estimates opposite to those found in behavioral data. We discuss these findings and offer suggestions as to what a contemporary model of spoken word recognition should be able to do.


Assuntos
Percepção da Fala , Fala , Humanos , Tempo de Reação , Simulação por Computador , Percepção da Fala/fisiologia , Acústica
6.
Lang Speech ; 66(2): 474-499, 2023 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-35971942

RESUMO

Recent evidence indicates that a word's paradigmatic neighbors affect production. However, these findings have mostly been obtained in careful laboratory settings using words in isolation, and thus ignoring potential effects that may arise from the syntagmatic context, which is typically present in spontaneous speech. The current corpus analysis investigates paradigmatic and syntagmatic effects in Estonian spontaneous speech. Following work on English, we focus on the duration of inflected and uninflected word-final /-s/ in content words, while simultaneously investigating whole words. Our analyses reveal three points. First, we find an effect of realized inflectional paradigm size, such that smaller paradigms actively used by the speakers lead to longer durations. Second, higher conditional probability is associated with shorter word forms and shorter segments. Finally, we do not directly replicate previous work on effects of inflectional status as in English word-final /-s/. Instead, we find that inflectional status interacts with conditional probability. We discuss the results in light of models of speech production and how they account for morphologically complex words and their paradigmatic neighbors.


Assuntos
Idioma , Fala , Humanos , Estônia , Probabilidade , Fatores de Tempo
7.
Artigo em Inglês | MEDLINE | ID: mdl-36521156

RESUMO

While known to influence visual lexical processing, the semantic information we associate with words has recently been found to influence auditory lexical processing as well. The present work explored the influence of semantic richness in auditory lexical decision. Study 1 recreated an experiment investigating semantic richness effects in concrete nouns (Goh et al., 2016). In Study 2, we expanded the stimulus set from 442 to 8,626 items, exploring the robustness of effects observed in Study 1 against a larger data set with increased diversity in both word class and other characteristics of interest. We also utilized generalized additive mixed models to investigate potential nonlinear effects. Results indicate that semantic richness effects become more nuanced and detectable when a wider set of items belonging to different parts of speech is examined. Findings are discussed in the context of models of spoken word recognition. (PsycInfo Database Record (c) 2022 APA, all rights reserved).

8.
J Acoust Soc Am ; 152(4): 2073, 2022 10.
Artigo em Inglês | MEDLINE | ID: mdl-36319243

RESUMO

The present study compares the production of fricatives in conversational versus read speech in American English. The goal is to examine which parameters contribute to the identification of fricatives across the two speech styles. The study surveys over 162 000 fricative tokens from the Buckeye Corpus [Pitt, Johnson, Hume, Kiesling, and Raymond (2005). Speech Commun. 45, 89-95] and the TIMIT Corpus [Zue and Seneff (1996). Recent Research towards Advanced Man-Machine Interface through Spoken Language (Elsevier, Amsterdam, the Netherlands), pp. 515-525]. A total of 18 different temporal and spectral measures are tested, including segment duration, preceding and following phone duration, spectral moments (at onset, midpoint, and/or offset), spectral peak frequency, etc. Results show that segment duration and midpoint spectral moments make the most prominent contribution to the categorization of fricatives for both speech styles. Spectral measures are more important for conversational speech, whereas duration plays a greater role for read speech. At the same time, the magnitude of the differences across speech styles is often low and many of the observed effects may be attributable to methodological differences across the corpora. Results may indicate that reduction of fricatives in conversational speech is more limited compared to the reduction of other types of speech sounds, such as plosives.


Assuntos
Idioma , Percepção da Fala , Humanos , Estados Unidos , Acústica da Fala , Fonética , Fala
9.
Brain Sci ; 12(7)2022 Jul 15.
Artigo em Inglês | MEDLINE | ID: mdl-35884736

RESUMO

In conversational speech, phones and entire syllables are often missing. This can make "he's" and "he was" homophonous, realized for example as [ɨz]. Similarly, "you're" and "you were" can both be realized as [jɚ], [ɨ], etc. We investigated what types of information native listeners use to perceive such verb tenses. Possible types included acoustic cues in the phrase (e.g., in "he was"), the rate of the surrounding speech, and syntactic and semantic information in the utterance, such as the presence of time adverbs such as "yesterday" or other tensed verbs. We extracted utterances such as "So they're gonna have like a random roommate" and "And he was like, 'What's wrong?!'" from recordings of spontaneous conversations. We presented parts of these utterances to listeners, in either a written or auditory modality, to determine which types of information facilitated listeners' comprehension. Listeners rely primarily on acoustic cues in or near the target words rather than meaning and syntactic information in the context. While that information also improves comprehension in some conditions, the acoustic cues in the target itself are strong enough to reverse the percept that listeners gain from all other information together. Acoustic cues override other information in comprehending reduced productions in conversational speech.

10.
J Acoust Soc Am ; 151(2): 1367, 2022 02.
Artigo em Inglês | MEDLINE | ID: mdl-35232063

RESUMO

Using phonological neighborhood density has been a common method to quantify lexical competition. It is useful and convenient but has shortcomings that are worth reconsidering. The present study quantifies the effects of lexical competition during spoken word recognition using acoustic distance and acoustic absement rather than phonological neighborhood density. The indication of a word's lexical competition is given by what is termed to be its acoustic distinctiveness, which is taken as its average acoustic absement to all words in the lexicon. A variety of acoustic representations for items in the lexicon are analyzed. Statistical modeling shows that acoustic distinctiveness has a similar effect trend as that of phonological neighborhood density. Additionally, acoustic distinctiveness consistently increases model fitness more than phonological neighborhood density regardless of which kind of acoustic representation is used. However, acoustic distinctiveness does not seem to explain all of the same things as phonological neighborhood density. The different areas that these two predictors explain are discussed in addition to the potential theoretical implications of the usefulness of acoustic distinctiveness in the models. The present paper concludes with some reasons why a researcher may want to use acoustic distinctiveness over phonological neighborhood density in future experiments.


Assuntos
Acústica , Linguística
11.
Wiley Interdiscip Rev Cogn Sci ; 12(5): e1557, 2021 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-33651922

RESUMO

In this overview we describe literature on how speech production and speech perception change in healthy or normal aging across the adult lifespan. In the production section we review acoustic characteristics that have been investigated as potentially distinguishing younger and older adults. In the speech perception section studies concerning speaker age estimation and those investigating older listeners' perception are addressed. Our discussion focuses on major themes and other fruitful areas for future research. This article is categorized under: Linguistics > Language in Mind and Brain Linguistics > Linguistic Theory Psychology > Development and Aging.


Assuntos
Percepção da Fala , Fala , Idoso , Envelhecimento , Humanos , Idioma , Linguística
12.
Morphology (Dordr) ; 31(2): 171-199, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-33747253

RESUMO

Many theories of word structure in linguistics and morphological processing in cognitive psychology are grounded in a compositional perspective on the (mental) lexicon in which complex words are built up during speech production from sublexical elements such as morphemes, stems, and exponents. When combined with the hypothesis that storage in the lexicon is restricted to the irregular, the prediction follows that properties specific to regular inflected words cannot co-determine the phonetic realization of these inflected words. This study shows that the stem vowels of regular English inflected verb forms that are more frequent in their paradigm are produced with more enhanced articulatory gestures in the midsaggital plane, challenging compositional models of lexical processing. The effect of paradigmatic probability dovetails well with the Paradigmatic Enhancement Hypothesis and is consistent with a growing body of research indicating that the whole is more than its parts.

13.
JMIR Res Protoc ; 10(3): e22805, 2021 Mar 17.
Artigo em Inglês | MEDLINE | ID: mdl-33729165

RESUMO

BACKGROUND: Major depressive disorder (MDD) is the second highest cause of disability worldwide. Standard treatments for MDD include medicine and talk therapy; however, approximately 1 in 5 Canadians fail to respond to these approaches and must consider alternatives. Transcranial direct current stimulation (tDCS) is a safe, noninvasive method that uses electrical stimulation to change the activation pattern of different brain regions. By targeting those regions known to be affected in MDD, tDCS may be useful in ameliorating treatment-resistant depression. OBJECTIVE: The objective of the Neurostimulation of the Brain in Depression trial is to compare the effectiveness of active versus sham tDCS in treating patients with ultraresistant MDD. The primary outcome will be the improvement in depressive symptoms, as measured by the change on the Mongtomery-Asberg Depression Rating Scale. Secondary outcomes will include changes in the Quick Inventory of Depressive Symptomatology Scale (subjective assessment), the World Health Organization Disability Assessment Schedule 2.0 (functional assessment), and the Screen for Cognitive Impairment in Psychiatry (cognitive assessment). Adverse events will be captured using the Young Mania Rating Scale; tDCS Adverse Events Questionnaire; Frequency, Intensity, and Burden of Side Effects Rating Scale; and Patient-Rated Inventory of Side Effects Scale. A parallel component of the study will involve assaying for baseline language function and the effect of treatment on language using an exploratory acoustic and semantic corpus analysis on recorded interviews. Participant accuracy and response latency on an auditory lexical decision task will also be evaluated. METHODS: We will recruit inpatients and outpatients in the city of Edmonton, Alberta, and will deliver the study interventions at the Grey Nuns and University of Alberta Hospitals. Written informed consent will be obtained from all participants before enrollment. Eligible participants will be randomly assigned, in a double-blinded fashion, to receive active or sham tDCS, and they will continue receiving their usual pharmacotherapy and psychotherapy throughout the trial. In both groups, participants will receive 30 weekday stimulation sessions, each session being 30 minutes in length, with the anode over the left dorsolateral prefrontal cortex and the cathode over the right. Participants in the active group will be stimulated at 2 mA throughout, whereas the sham group will receive only a brief period of stimulation to mimic skin sensations felt in the active group. Measurements will be conducted at regular points throughout the trial and 30 days after trial completion. RESULTS: The trial has been approved by the University of Alberta Research Ethics Board and is scheduled to commence in June 2021. The target sample size is 60 participants. CONCLUSIONS: This is a protocol for a multicenter, double-blinded, randomized controlled superiority trial comparing active versus sham tDCS in patients with treatment-resistant MDD. TRIAL REGISTRATION: ClinicalTrials.gov NCT04159012; http://clinicaltrials.gov/ct2/show/NCT04159012. INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID): PRR1-10.2196/22805.

14.
JASA Express Lett ; 1(8): 085201, 2021 08.
Artigo em Inglês | MEDLINE | ID: mdl-36154242

RESUMO

The present study investigates the informativity of anticipatory coarticulatory acoustic detail about inflectional suffixes in English verbs, performing two experiments in which listeners classified inflectional functions of verbs. Listener response latencies were slower when acoustic detail resulting from anticipatory coarticulation mismatched with the inflectional suffix. The results indicate that listeners actively use coarticulatory phonetic detail to predict the verbs' inflectional function.


Assuntos
Percepção da Fala , Acústica , Fonética , Tempo de Reação , Acústica da Fala , Percepção da Fala/fisiologia
15.
Neurocase ; 27(1): 30-38, 2021 02.
Artigo em Inglês | MEDLINE | ID: mdl-33347384

RESUMO

ABSTRACChildren with cerebral palsy (CP) are characterized as difficult to understand because of poor articulation and breathy voice quality. This case series describes the subsystems of the speech mechanism (i.e., respiratory, laryngeal, oroarticulatory) in four children with CP and four matched typically developing children (TDC) during the modulation of vocal loudness. TDC used biomechanically efficient strategies among speech subsystems to increase vocal loudness. Children with CP made fewer breathing adjustments but recruited greater chest wall muscle activity and neuromuscular drive for louder productions. These results inform future clinical research and identify speech treatment targets for children with motor speech disorders.


Assuntos
Paralisia Cerebral , Disartria , Paralisia Cerebral/complicações , Criança , Disartria/etiologia , Humanos , Fala
16.
Lang Speech ; 64(3): 654-680, 2021 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-32811294

RESUMO

Repeating the movements associated with activities such as drawing or sports typically leads to improvements in kinematic behavior: these movements become faster, smoother, and exhibit less variation. Likewise, practice has also been shown to lead to faster and smoother movement trajectories in speech articulation. However, little is known about its effect on articulatory variability. To address this, we investigate the extent to which repetition and predictability influence the articulation of the frequent German word "sie" [zi] (they). We find that articulatory variability is proportional to speaking rate and the duration of [zi], and that overall variability decreases as [zi] is repeated during the experiment. Lower variability is also observed as the conditional probability of [zi] increases, and the greatest reduction in variability occurs during the execution of the vocalic target of [i]. These results indicate that practice can produce observable differences in the articulation of even the most common gestures used in speech.


Assuntos
Gestos , Fala , Fenômenos Biomecânicos , Humanos , Movimento , Medida da Produção da Fala
17.
J Commun Disord ; 86: 106003, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-32505858

RESUMO

Producing speech that is clear, audible, and intelligible to others is a challenge for many children with cerebral palsy (CP) and children with Down syndrome (DS). Previous studies have demonstrated the effectiveness of using the Lee Silverman Voice Treatment (LSVT LOUD®) to increase vocal loudness and improve speech intelligibility in individuals with dysarthria secondary to Parkinson's disease (PD), and some research suggests that it also may be effective for individuals with dysarthria secondary to other conditions, including CP and DS. Although LSVT LOUD targets healthy vocal loudness, there is some evidence of spreading effects to the articulatory system. Acoustic data from two groups of children with secondary motor speech disorders [one with CP (n = 17) and one with DS (n = 9)] who received a full dose of LSVT LOUD and for whom post-treatment intelligibility gains have been previously reported, were analyzed for treatment effects on: 1) vowel duration, 2) acoustic vowel space and 3) the ratio of F2/i/ to F2/u/. Statistically significant changes in vowel duration and acoustic vowel space occurred pre-treatment to 12 weeks post-treatment in the CP group, and increased acoustic vowel space was observed in 5 of the DS participants. The present study provides preliminary evidence of intensive voice treatment spreading effects to the articulatory system in some children with CP and children with DS consistent with previous findings in other populations.


Assuntos
Paralisia Cerebral , Doença de Parkinson , Inteligibilidade da Fala , Fonoterapia , Paralisia Cerebral/terapia , Criança , Disartria , Humanos , Doença de Parkinson/terapia , Acústica da Fala
18.
J Acoust Soc Am ; 147(4): 2741, 2020 04.
Artigo em Inglês | MEDLINE | ID: mdl-32359328

RESUMO

As scientists, it is important to sample as broadly as possible; however, there is a bias in the research performed on the speech acoustics of the world's languages toward work on languages of convenience (e.g., English). This special issue seeks to initiate increased publication of acoustic research on the sounds of the world's languages. The special issue contains a sample of 25 under-documented languages. While large relative to previous work (particularly in the Journal of the Acoustical Society of America), the 23 articles in this issue just scratch the surface. To have a better understanding of the fundamentals of speech communication, it is imperative, as a research community, to make a concerted effort to learn more about how speech sounds are perceived and produced in a wide variety of languages.


Assuntos
Fonética , Percepção da Fala , Idioma , Fala , Acústica da Fala
19.
J Acoust Soc Am ; 147(1): 137, 2020 01.
Artigo em Inglês | MEDLINE | ID: mdl-32007016

RESUMO

Multiple measures of vowel overlap have been proposed that use F1, F2, and duration to calculate the degree of overlap between vowel categories. The present study assesses four of these measures: the spectral overlap assessment metric [SOAM; Wassink (2006). J. Acoust. Soc. Am. 119(4), 2334-2350], the a posteriori probability (APP)-based metric [Morrison (2008). J. Acoust. Soc. Am. 123(1), 37-40], the vowel overlap analysis with convex hulls method [VOACH; Haynes and Taylor, (2014). J. Acoust. Soc. Am. 136(2), 883-891], and the Pillai score as first used for vowel overlap by Hay, Warren, and Drager [(2006). J. Phonetics 34(4), 458-484]. Summaries of the measures are presented, and theoretical critiques of them are performed, concluding that the APP-based metric and Pillai score are theoretically preferable to SOAM and VOACH. The measures are empirically assessed using accuracy and precision criteria with Monte Carlo simulations. The Pillai score demonstrates the best overall performance in these tests. The potential applications of vowel overlap measures to research scenarios are discussed, including comparisons of vowel productions between different social groups, as well as acoustic investigations into vowel formant trajectories.

20.
Behav Res Methods ; 51(3): 1187-1204, 2019 06.
Artigo em Inglês | MEDLINE | ID: mdl-29916041

RESUMO

The Massive Auditory Lexical Decision (MALD) database is an end-to-end, freely available auditory and production data set for speech and psycholinguistic research, providing time-aligned stimulus recordings for 26,793 words and 9592 pseudowords, and response data for 227,179 auditory lexical decisions from 231 unique monolingual English listeners. In addition to the experimental data, we provide many precompiled listener- and item-level descriptor variables. This data set makes it easy to explore responses, build and test theories, and compare a wide range of models. We present summary statistics and analyses.


Assuntos
Tomada de Decisões , Adolescente , Adulto , Coleta de Dados , Bases de Dados Factuais , Feminino , Humanos , Idioma , Masculino , Psicolinguística , Fala , Adulto Jovem
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...