Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 21
Filter
Add more filters










Publication year range
1.
J Speech Lang Hear Res ; 67(5): 1400-1412, 2024 May 07.
Article in English | MEDLINE | ID: mdl-38573836

ABSTRACT

PURPOSE: We compare two signal smoothing and differentiation approaches: a frequently used approach in the speech community of digital filtering with approximation of derivatives by finite differences and a spline smoothing approach widely used in other fields of human movement science. METHOD: In particular, we compare the values of a classic set of kinematic parameters estimated by the two smoothing approaches and assess, via regressions, how well these reconstructed values conform to known laws about relations between the parameters. RESULTS: Substantially smaller regression errors were observed for the spline smoothing than for the filtering approach. CONCLUSION: This result is in broad agreement with reports from other fields of movement science and underpins the superiority of splines also in the domain of speech.


Subject(s)
Speech , Humans , Biomechanical Phenomena , Speech/physiology , Regression Analysis , Signal Processing, Computer-Assisted
2.
JASA Express Lett ; 3(9)2023 09 01.
Article in English | MEDLINE | ID: mdl-37671987

ABSTRACT

Evaluating any model underlying the control of speech requires segmenting the continuous flow of speech effectors into sequences of movements. A virtually universal practice in this segmentation is to use a velocity-based threshold which identifies a movement onset or offset as the time at which the velocity of the relevant effector breaches some threshold percentage of the maximal velocity. Depending on the threshold choice, more or less of the movement's trajectory is left in for model regression. This paper makes explicit how the choice of this threshold modulates the regression performance of a dynamical model hypothesized to govern speech movements.


Subject(s)
Lewis Blood Group Antigens , Movement , Linear Models , Speech
3.
Phonetica ; 78(5-6): 385-433, 2021 10 11.
Article in English | MEDLINE | ID: mdl-34624186

ABSTRACT

This paper is concerned with the relation between syllabic organization and intersegmental spatiotemporal coordination using Electromagnetic Articulometry recordings from seven speakers of American English (henceforth, English). Whereas previous work on English has focused on word-initial clusters (preceding a vowel whose identity was not systematically varied), the present work examined word-medial clusters /pl, kl, sp, sk/ in the context of three different vowel heights (high, mid, low). Our results provide evidence for a global organization for the segments involved in these cluster-vowel combinations. This is reflected in a number of ways: compression of the prevocalic consonant and reduction of CV timing in the word-medial cluster case compared to its singleton paired word in both stop-lateral and s-stop clusters, early vowel initiation (as permitted by the clusters' phonetic properties), and presence of compensatory relations between phonetic properties of different segments or intersegmental transitions within each cluster. In other words, we find that the global organization presiding over the segments partaking in these word-medial tautosyllabic CCVs is pleiotropic, that is, simultaneously expressed in multiple phonetic exponents rather than via a privileged metric such as c-center stability or any other such given single measure employed in previous works.


Subject(s)
Language , Phonetics , Humans
4.
JASA Express Lett ; 1(2): 024401, 2021 Feb.
Article in English | MEDLINE | ID: mdl-36154034

ABSTRACT

Fourteen-month-olds' ability to distinguish a just learned word, /buːk/, from its minimally different word, /duːk/, was assessed under two pre-exposure conditions: one where /b, d/-initial forms occurred in a varying vowel context and another where the vowel was fixed but the final consonant varied. Infants in the experiments benefited from the variable vowel but not from the variable final consonant context, suggesting that vowel variability but not all kinds of variability are beneficial. These results are discussed in the context of time-honored observations on the vowel-dependent nature of place of articulation cues for consonants.

5.
JASA Express Lett ; 1(1): 015204, 2021 Jan.
Article in English | MEDLINE | ID: mdl-36154090

ABSTRACT

In a cue-distractor task, speakers' response times (RTs) were found to speed up when they perceived a distractor syllable whose vowel was identical to the vowel in the syllable they were preparing to utter. At a more fine-grained level, subphonemic congruency between response and distractor-defined by higher number of shared phonological features or higher acoustic proximity-was also found to be predictive of RT modulations. Furthermore, the findings indicate that perception of vowel stimuli embedded in syllables gives rise to robust and more consistent perceptuomotor compatibility effects (compared to isolated vowels) across different response-distractor vowel pairs.

6.
Phonetica ; 78(1): 3-27, 2021 02 24.
Article in English | MEDLINE | ID: mdl-31574509

ABSTRACT

Fitts' law, perhaps the most celebrated law of human motor control, expresses a relation between the kinematic property of speed and the non-kinematic, task-specific property of accuracy. We aimed to assess whether speech movements obey this law using a metronome-driven speech elicitation paradigm with a systematic speech rate control. Specifically, using the paradigm of repetitive speech, we recorded via electromagnetic articulometry speech movement data in sequences of the form /CV…/ from 6 adult speakers. These sequences were spoken at 8 distinct rates ranging from extremely slow to extremely fast. Our results demonstrate, first, that the present paradigm of extensive metronome-driven manipulations satisfies the crucial prerequisites for evaluating Fitts' law in a subset of our elicited rates. Second, we uncover for the first time in speech evidence for Fitts' law at the faster rates and specifically beyond a participant-specific critical rate. We find no evidence for Fitts' law at the slowest metronome rates. Finally, we discuss implications of these results for models of speech.


Subject(s)
Psychomotor Performance , Speech , Adult , Biomechanical Phenomena , Humans , Movement , Tongue
7.
8.
Atten Percept Psychophys ; 82(5): 2751-2764, 2020 Jul.
Article in English | MEDLINE | ID: mdl-32236835

ABSTRACT

Perceptuomotor compatibility between phonemically identical spoken and perceived syllables has been found to speed up response times (RTs) in speech production tasks. However, research on compatibility effects between perceived and produced stimuli at the subphonemic level is limited. Using a cue-distractor task, we investigated the effects of phonemic and subphonemic congruency in pairs of vowels. On each trial, a visual cue prompted individuals to produce a response vowel, and after the visual cue appeared a distractor vowel was auditorily presented while speakers were planning to produce the response vowel. The results revealed effects on RTs due to phonemic congruency (same vs. different vowels) between the response and distractor vowels, which resemble effects previously seen for consonants. Beyond phonemic congruency, we assessed how RTs are modulated as a function of the degree of subphonemic similarity between the response and distractor vowels. Higher similarity between the response and distractor in terms of phonological distance-defined by number of mismatching phonological features-resulted in faster RTs. However, the exact patterns of RTs varied across response-distractor vowel pairs. We discuss how different assumptions about phonological feature representations may account for the different patterns observed in RTs across response-distractor pairs. Our findings on the effects of perceived stimuli on produced speech at a more detailed level of representation than phonemic identity necessitate a more direct and specific formulation of the perception-production link. Additionally, these results extend previously reported perceptuomotor interactions mainly involving consonants to vowels.


Subject(s)
Speech Perception , Humans , Phonetics , Reaction Time , Speech
9.
Dev Sci ; 23(5): e12950, 2020 09.
Article in English | MEDLINE | ID: mdl-32052548

ABSTRACT

Seminal work by Werker and colleagues (Stager & Werker [1997] Nature, 388, 381-382) has found that 14-month-old infants do not show evidence for learning minimal pairs in the habituation-switch paradigm. However, when multiple speakers produce the minimal pair in acoustically variable ways, infants' performance improves in comparison to a single speaker condition (Rost & McMurray [2009] Developmental Science, 12, 339-349). The current study further extends these results and assesses how different kinds of input variability affect 14-month-olds' minimal pair learning in the habituation-switch paradigm testing German learning infants. The first two experiments investigated word learning when the labels were spoken by a single speaker versus when the labels were spoken by multiple speakers. In the third experiment we studied whether non-acoustic variability, implemented by visual variability of the objects presented together with the labels, would also affect minimal pair learning. We found enhanced learning in the multiple speakers compared to the single speaker condition, confirming previous findings with English-learning infants. In contrast, visual variability of the presented objects did not support learning. These findings both confirm and better delimit the beneficial role of speech-specific variability in minimal pair learning. Finally, we review different proposals on the mechanisms via which variability confers benefits to learning and outline what may be likely principles that underlie this benefit. We highlight among these the multiplicity of acoustic cues signalling phonemic contrasts and the presence of relations among these cues. It is in these relations where we trace part of the source for the apparent paradoxical benefit of variability in learning.


Subject(s)
Language Development , Phonetics , Speech Perception , Verbal Learning , Cues , Female , Germany , Humans , Infant , Language , Male , Speech/physiology , Speech Disorders
10.
Phonetica ; 76(6): 448-478, 2019.
Article in English | MEDLINE | ID: mdl-31454798

ABSTRACT

We examined gestural coordination in C1C2 (C1 stop, C2 lateral or tap) word initial clusters using articulatory (electromagnetic articulometry) and acoustic data from six speakers of Standard Peninsular Spanish. We report on patterns of voice onset time (VOT), gestural plateau duration of C1, C2, and their overlap. For VOT, as expected, place of articulation is a major factor, with velars exhibiting longer VOTs than labials. Regarding C1 plateau duration, voice and place effects were found such that voiced consonants are significantly shorter than voiceless consonants, and velars show longer duration than labials. For C2 plateau duration, lateral duration was found to vary as a function of onset complexity (C vs. CC). As for overlap, unlike in French, where articulatory data for clusters have also been examined, clusters where both C1 and C2 are voiced show more overlap than where voicing differs. Further, overlap was affected by the C2 such that clusters where C2 is a tap show less overlap than clusters where C2 is a lateral. We discuss these results in the context of work aiming to uncover phonetic (e.g., articulatory or perceptual) and phonological forces (e.g., syllabic organization) on timing.


Subject(s)
Language , Phonetics , Speech Perception , Voice/physiology , Female , Humans , Male , Sampling Studies , Spain
11.
PLoS One ; 14(3): e0213851, 2019.
Article in English | MEDLINE | ID: mdl-30883586

ABSTRACT

The speed-curvature power law is a celebrated law of motor control expressing a relation between the kinematic property of speed and the geometric property of curvature. We aimed to assess whether speech movements obey this law just as movements from other domains do. We describe a metronome-driven speech elicitation paradigm designed to cover a wide range of speeds. We recorded via electromagnetic articulometry speech movements in sequences of the form /CV…/ from nine speakers (five German, four English) speaking at eight distinct rates. First, we demonstrate that the paradigm of metronome-driven manipulations results in speech movement data consistent with earlier reports on the kinematics of speech production. Second, analysis of our data in their full three-dimensions and using advanced numerical differentiation methods offers stronger evidence for the law than that reported in previous studies devoted to its assessment. Finally, we demonstrate the presence of a clear rate dependency of the power law's parameters. The robustness of the speed-curvature relation in our datasets lends further support to the hypothesis that the power law is a general feature of human movement. We place our results in the context of other work in movement control and consider implications for models of speech production.


Subject(s)
Speech/physiology , Tongue/physiology , Adolescent , Adult , Biomechanical Phenomena , Electronic Data Processing , Female , Humans , Language , Male , Principal Component Analysis , Young Adult
12.
J Child Lang ; 46(1): 1-23, 2019 01.
Article in English | MEDLINE | ID: mdl-30176956

ABSTRACT

In a preferential looking paradigm, we studied how children's looking behavior and pupillary response were modulated by the degree of phonological mismatch between the correct label of a target referent and its manipulated form. We manipulated degree of mismatch by introducing one or more featural changes to the target label. Both looking behavior and pupillary response were sensitive to degree of mismatch, corroborating previous studies that found differential responses in one or the other measure. Using time-course analyses, we present for the first time results demonstrating full separability among conditions (detecting difference not only between one vs. more, but also between two and three featural changes). Furthermore, the correct labels and small featural changes were associated with stable target preference, while large featural changes were associated with oscillating looking behavior, suggesting significant shifts in looking preference over time. These findings further support and extend the notion that early words are represented in great detail, containing subphonemic information.


Subject(s)
Fixation, Ocular , Language Development , Pupil/physiology , Child, Preschool , Female , Humans , Linguistics , Male , Phonetics
13.
J Acoust Soc Am ; 144(3): EL215, 2018 09.
Article in English | MEDLINE | ID: mdl-30424675

ABSTRACT

Australian English /iː/, /ɪ/, and /ɪə/ exhibit almost identical average first (F1) and second (F2) formant frequencies and differ in duration and vowel inherent spectral change (VISC). The cues of duration, F1 × F2 trajectory direction (TD) and trajectory length (TL) were assessed in listeners' categorization of /iː/ and /ɪə/ compared to /ɪ/. Duration was important for distinguishing both /iː/ and /ɪə/ from /ɪ/. TD and TL were important for categorizing /iː/ versus /ɪ/, whereas only TL was important for /ɪə/ versus /ɪ/. Finally, listeners' use of duration and VISC was not mutually affected for either vowel compared to /ɪ/.


Subject(s)
Acoustic Stimulation/methods , Cues , Phonetics , Speech Acoustics , Speech Perception/physiology , Adolescent , Adult , Australia , Female , Humans , Male , Young Adult
14.
J Acoust Soc Am ; 144(6): EL528, 2018 12.
Article in English | MEDLINE | ID: mdl-30599650

ABSTRACT

During a cue-distractor task, participants repeatedly produce syllables prompted by visual cues. Distractor syllables are presented to participants via headphones 150 ms after the visual cue (before any response). The task has been used to demonstrate perceptuomotor integration effects (perception effects on production): response times (RTs) speed up as the distractor shares more phonetic properties with the response. Here it is demonstrated that perceptuomotor integration is not limited to RTs. Voice Onset Times (VOTs) of the distractor syllables were systematically varied and their impact on responses was measured. Results demonstrate trial-specific convergence of response syllables to VOT values of distractor syllables.

15.
J Exp Child Psychol ; 153: 140-148, 2017 01.
Article in English | MEDLINE | ID: mdl-27692796

ABSTRACT

This study introduces a method ideally suited for investigating toddlers' ability to detect mispronunciations in lexical representations: pupillometry. Previous research has established that the magnitude of pupil dilation reflects differing levels of cognitive effort. Building on those findings, we use pupil dilation to study the level of detail encoded in lexical representations with 30-month-old children whose lexicons allow for a featurally balanced stimulus set. In each trial, we present a picture followed by a corresponding auditory label. By systematically manipulating the number of feature changes in the onset of the label (e.g., baby∼daby∼faby∼shaby), we tested whether featural distance predicts the degree of pupil dilation. Our findings support the existence of a relationship between featural distance and pupil dilation. First, mispronounced words are associated with a larger degree of dilation than correct forms. Second, words that deviate more from the correct form are related to a larger dilation than words that deviate less. This pattern indicates that toddlers are sensitive to the degree of mispronunciation, and as such it corroborates previous work that found word recognition modulated by sub-segmental detail and by the degree of mismatch. Thus, we establish that pupillometry provides a viable alternative to paradigms that require overt behavioral response in increasing our understanding of the development of lexical representations.


Subject(s)
Child Development , Comprehension , Language , Child, Preschool , Female , Humans , Male
16.
J Mem Lang ; 89: 222-243, 2016 Aug.
Article in English | MEDLINE | ID: mdl-27440947

ABSTRACT

We offer a dynamical model of phonological planning that provides a formal instantiation of how the speech production and perception systems interact during online processing. The model is developed on the basis of evidence from an experimental task that requires concurrent use of both systems, the so-called response-distractor task in which speakers hear distractor syllables while they are preparing to produce required responses. The model formalizes how ongoing response planning is affected by perception and accounts for a range of results reported across previous studies. It does so by explicitly addressing the setting of parameter values in representations. The key unit of the model is that of the dynamic field, a distribution of activation over the range of values associated with each representational parameter. The setting of parameter values takes place by the attainment of a stable distribution of activation over the entire field, stable in the sense that it persists even after the response cue in the above experiments has been removed. This and other properties of representations that have been taken as axiomatic in previous work are derived by the dynamics of the proposed model.

17.
PLoS One ; 10(5): e0124714, 2015.
Article in English | MEDLINE | ID: mdl-25996153

ABSTRACT

Drawing on phonology research within the generative linguistics tradition, stochastic methods, and notions from complex systems, we develop a modelling paradigm linking phonological structure, expressed in terms of syllables, to speech movement data acquired with 3D electromagnetic articulography and X-ray microbeam methods. The essential variable in the models is syllable structure. When mapped to discrete coordination topologies, syllabic organization imposes systematic patterns of variability on the temporal dynamics of speech articulation. We simulated these dynamics under different syllabic parses and evaluated simulations against experimental data from Arabic and English, two languages claimed to parse similar strings of segments into different syllabic structures. Model simulations replicated several key experimental results, including the fallibility of past phonetic heuristics for syllable structure, and exposed the range of conditions under which such heuristics remain valid. More importantly, the modelling approach consistently diagnosed syllable structure proving resilient to multiple sources of variability in experimental data including measurement variability, speaker variability, and contextual variability. Prospects for extensions of our modelling paradigm to acoustic data are also discussed.


Subject(s)
Models, Theoretical , Phonetics
18.
Psychon Bull Rev ; 22(1): 242-50, 2015 Feb.
Article in English | MEDLINE | ID: mdl-24865282

ABSTRACT

Previous studies have found faster response times in a production task when a speaker perceives a distractor syllable that is identical to the syllable they are required to produce. No study has found such effects when a response and a distractor are not identical but share parameters below the level of the phoneme. Results from Experiment 1 show some evidence of a response-time effect of response-distractor voicing congruency. Experiment 2 showed a robust effect of articulator congruency: perceiving a distractor that has the same articulatory organ as that implicated in the planned motor response speeds up response times. These results necessitate a more direct and specific formulation of the perception-production link than warranted by previous experimental evidence. Implications for theories of speech production are also discussed.


Subject(s)
Attention/physiology , Psychomotor Performance/physiology , Reaction Time/physiology , Speech Perception/physiology , Speech/physiology , Adult , Female , Humans , Male , Psycholinguistics
19.
J Phon ; 35(3): 271-300, 2007.
Article in English | MEDLINE | ID: mdl-18389086

ABSTRACT

Using a combination of magnetometry and ultrasound, we examined the articulatory characteristics of the so-called 'transparent' vowels [i], [i], and [e] in Hungarian vowel harmony. Phonologically, transparent vowels are front, but they can be followed by either front or back suffixes. However, a finer look reveals an underlying phonetic coherence in two respects. First, transparent vowels in back harmony contexts show a less advanced (more retracted) tongue body posture than phonemically identical vowels in front harmony contexts: e.g. [i] in buli-val is less advanced than [i] in bili-vel. Second, transparent vowels in monosyllabic stems selecting back suffixes are also less advanced than phonemically identical vowels in stems selecting front suffixes: e.g. [i] in ír, taking back suffixes, compared to [i] of hír, taking front suffixes, is less advanced when these stems are produced in bare form (no suffixes). We thus argue that the phonetic degree of tongue body horizontal position correlates with the phonological alternation in suffixes. A hypothesis that emerges from this work is that a plausible phonetic basis for transparency can be found in quantal characteristics of the relation between articulation and acoustics of transparent vowels. More broadly, the proposal is that the phonology of transparent vowels is better understood when their phonological patterning is studied together with their articulatory and acoustic characteristics.

20.
Cogn Sci ; 30(5): 905-43, 2006 Sep 10.
Article in English | MEDLINE | ID: mdl-21702841

ABSTRACT

A fundamental problem in spoken language is the duality between the continuous aspects of phonetic performance and the discrete aspects of phonological competence. We study 2 instances of this problem from the phenomenon of voicing neutralization and vowel harmony. In each case, we present a model where the experimentally observed continuous distinctions are linked to the discreteness of phonological form using the mathematics of nonlinear dynamics.

SELECTION OF CITATIONS
SEARCH DETAIL
...