Search | VHL Regional Portal

1.

The role of speech planning in the articulation of pauses.

Krivokapic, Jelena; Styler, Will; Byrd, Dani.

J Acoust Soc Am ; 151(1): 402, 2022 01.

Article in English | MEDLINE | ID: mdl-35104998

ABSTRACT

Extensive research has found that the duration of a pause is influenced by the length of an upcoming utterance, suggesting that speakers plan the upcoming utterance during this time. Research has more recently begun to examine articulation during pauses. A specific configuration of the vocal tract during acoustic pauses, termed pause posture (PP), has been identified in Greek and American English. However, the cognitive function giving rise to PPs is not well understood. The present study examines whether PPs are related to speech planning processes, such that they contribute additional planning time for an upcoming utterance. In an articulatory magnetometer study, the hypothesis is tested that an increase in upcoming utterance length leads to more frequent PP occurrence and that PPs are longer in pauses that precede longer phrases. The results indicate that PPs are associated with planning time for longer utterances but that they are associated with a relatively fixed scope of planning for upcoming speech. To further examine the relationship between articulation and speech planning, an additional hypothesis examines whether the first part of the pause predominantly serves to mark prosodic boundaries while the second part serves speech planning purposes. This hypothesis is not supported by the results.

Subject(s)

Language , Speech , Speech Acoustics , Speech Articulation Tests , Speech Production Measurement

2.

A multispeaker dataset of raw and reconstructed speech production real-time MRI video and 3D volumetric images.

Lim, Yongwan; Toutios, Asterios; Bliesener, Yannick; Tian, Ye; Lingala, Sajan Goud; Vaz, Colin; Sorensen, Tanner; Oh, Miran; Harper, Sarah; Chen, Weiyi; Lee, Yoonjeong; Töger, Johannes; Monteserin, Mairym Lloréns; Smith, Caitlin; Godinez, Bianca; Goldstein, Louis; Byrd, Dani; Nayak, Krishna S; Narayanan, Shrikanth S.

Sci Data ; 8(1): 187, 2021 07 20.

Article in English | MEDLINE | ID: mdl-34285240

ABSTRACT

Real-time magnetic resonance imaging (RT-MRI) of human speech production is enabling significant advances in speech science, linguistics, bio-inspired speech technology development, and clinical applications. Easy access to RT-MRI is however limited, and comprehensive datasets with broad access are needed to catalyze research across numerous domains. The imaging of the rapidly moving articulators and dynamic airway shaping during speech demands high spatio-temporal resolution and robust reconstruction methods. Further, while reconstructed images have been published, to-date there is no open dataset providing raw multi-coil RT-MRI data from an optimized speech production experimental setup. Such datasets could enable new and improved methods for dynamic image reconstruction, artifact correction, feature extraction, and direct extraction of linguistically-relevant biomarkers. The present dataset offers a unique corpus of 2D sagittal-view RT-MRI videos along with synchronized audio for 75 participants performing linguistically motivated speech tasks, alongside the corresponding public domain raw RT-MRI data. The dataset also includes 3D volumetric vocal tract MRI during sustained speech sounds and high-resolution static anatomical T2-weighted upper airway MRI for each participant.

Subject(s)

Larynx/physiology , Magnetic Resonance Imaging/methods , Speech , Adolescent , Adult , Computer Systems , Female , Humans , Male , Middle Aged , Time Factors , Video Recording , Young Adult

3.

Aliasing artifact reduction in spiral real-time MRI.

Tian, Ye; Lim, Yongwan; Zhao, Ziwei; Byrd, Dani; Narayanan, Shrikanth; Nayak, Krishna S.

Magn Reson Med ; 86(2): 916-925, 2021 08.

Article in English | MEDLINE | ID: mdl-33728700

ABSTRACT

PURPOSE: To mitigate a common artifact in spiral real-time MRI, caused by aliasing of signal outside the desired FOV. This artifact frequently occurs in midsagittal speech real-time MRI. METHODS: Simulations were performed to determine the likely origin of the artifact. Two methods to mitigate the artifact are proposed. The first approach, denoted as "large FOV" (LF), keeps an FOV that is large enough to include the artifact signal source during reconstruction. The second approach, denoted as "estimation-subtraction" (ES), estimates the artifact signal source before subtracting a synthetic signal representing that source in multicoil k-space raw data. Twenty-five midsagittal speech-production real-time MRI data sets were used to evaluate both of the proposed methods. Reconstructions without and with corrections were evaluated by two expert readers using a 5-level Likert scale assessing artifact severity. Reconstruction time was also compared. RESULTS: The origin of the artifact was found to be a combination of gradient nonlinearity and imperfect anti-aliasing in spiral sampling. The LF and ES methods were both able to substantially reduce the artifact, with an averaged qualitative score improvement of 1.25 and 1.35 Likert levels for LF correction and ES correction, respectively. Average reconstruction time without correction, with LF correction, and with ES correction were 160.69 ± 1.56, 526.43 ± 5.17, and 171.47 ± 1.71 ms/frame. CONCLUSION: Both proposed methods were able to reduce the spiral aliasing artifacts, with the ES-reduction method being more effective and more time efficient.

Subject(s)

Artifacts , Image Processing, Computer-Assisted , Magnetic Resonance Imaging , Speech

4.

Improved 3D real-time MRI of speech production.

Zhao, Ziwei; Lim, Yongwan; Byrd, Dani; Narayanan, Shrikanth; Nayak, Krishna S.

Magn Reson Med ; 85(6): 3182-3195, 2021 06.

Article in English | MEDLINE | ID: mdl-33452722

ABSTRACT

PURPOSE: To provide 3D real-time MRI of speech production with improved spatio-temporal sharpness using randomized, variable-density, stack-of-spiral sampling combined with a 3D spatio-temporally constrained reconstruction. METHODS: We evaluated five candidate (k, t) sampling strategies using a previously proposed gradient-echo stack-of-spiral sequence and a 3D constrained reconstruction with spatial and temporal penalties. Regularization parameters were chosen by expert readers based on qualitative assessment. We experimentally determined the effect of spiral angle increment and kz temporal order. The strategy yielding highest image quality was chosen as the proposed method. We evaluated the proposed and original 3D real-time MRI methods in 2 healthy subjects performing speech production tasks that invoke rapid movements of articulators seen in multiple planes, using interleaved 2D real-time MRI as the reference. We quantitatively evaluated tongue boundary sharpness in three locations at two speech rates. RESULTS: The proposed data-sampling scheme uses a golden-angle spiral increment in the kx -ky plane and variable-density, randomized encoding along kz . It provided a statistically significant improvement in tongue boundary sharpness score (P < .001) in the blade, body, and root of the tongue during normal and 1.5-times speeded speech. Qualitative improvements were substantial during natural speech tasks of alternating high, low tongue postures during vowels. The proposed method was also able to capture complex tongue shapes during fast alveolar consonant segments. Furthermore, the proposed scheme allows flexible retrospective selection of temporal resolution. CONCLUSION: We have demonstrated improved 3D real-time MRI of speech production using randomized, variable-density, stack-of-spiral sampling with a 3D spatio-temporally constrained reconstruction.

Subject(s)

Image Processing, Computer-Assisted , Speech , Humans , Imaging, Three-Dimensional , Magnetic Resonance Imaging , Retrospective Studies , Tongue/diagnostic imaging

5.

How an aglossic speaker produces an alveolar-like percept without a functional tongue tip.

Toutios, Asterios; Xu, Melissa; Byrd, Dani; Goldstein, Louis; Narayanan, Shrikanth.

J Acoust Soc Am ; 147(6): EL460, 2020 06.

Article in English | MEDLINE | ID: mdl-32611190

ABSTRACT

It has been previously observed [McMicken, Salles, Berg, Vento-Wilson, Rogers, Toutios, and Narayanan. (2017). J. Commun. Disorders, Deaf Stud. Hear. Aids 5(2), 1-6] using real-time magnetic resonance imaging that a speaker with severe congenital tongue hypoplasia (aglossia) had developed a compensatory articulatory strategy where she, in the absence of a functional tongue tip, produced a plosive consonant perceptually similar to /d/ using a bilabial constriction. The present paper provides an updated account of this strategy. It is suggested that the previously observed compensatory bilabial closing that occurs during this speaker's /d/ production is consistent with vocal tract shaping resulting from hyoid raising created with mylohyoid action, which may also be involved in typical /d/ production. Simulating this strategy in a dynamic articulatory synthesis experiment leads to the generation of /d/-like formant transitions.

Subject(s)

Tongue , Voice , Female , Humans , Phonetics , Speech , Tongue/diagnostic imaging

6.

Improved real-time tagged MRI using REALTAG.

Chen, Weiyi; Lee, Nam Gyun; Byrd, Dani; Narayanan, Shrikanth; Nayak, Krishna S.

Magn Reson Med ; 84(2): 838-846, 2020 08.

Article in English | MEDLINE | ID: mdl-31872918

ABSTRACT

OBJECTIVES: To evaluate a novel method for real-time tagged MRI with increased tag persistence using phase sensitive tagging (REALTAG), demonstrated for speech imaging. METHODS: Tagging is applied as a brief interruption to a continuous real-time spiral acquisition. REALTAG is implemented using a total tagging flip angle of 180° and a novel frame-by-frame phase sensitive reconstruction to remove smooth background phase while preserving the sign of the tag lines. Tag contrast-to-noise ratio of REALTAG and conventional tagging (total flip angle of 90°) is simulated and evaluated in vivo. The ability to extend tag persistence is tested during the production of vowel-to-vowel transitions by American English speakers. RESULTS: REALTAG resulted in a doubling of contrast-to-noise ratio at each time point and increased tag persistence by more than 1.9-fold. The tag persistence was 1150 ms with contrast-to-noise ratio >6 at 1.5T, providing 2 mm in-plane resolution, 179 frames/s, with 72.6 ms temporal window width, and phase sensitive reconstruction. The new imaging window is able to capture internal tongue deformation over word-to-word transitions in natural speech production. CONCLUSION: Tag persistence is substantially increased in intermittently tagged real-time MRI by using the improved REALTAG method. This makes it possible to capture longer motion patterns in the tongue, such as cross-word vowel-to-vowel transitions, and provides a powerful new window to study tongue biomechanics.

Subject(s)

Language , Magnetic Resonance Imaging , Biomechanical Phenomena , Speech , Tongue/diagnostic imaging

7.

Intermittently tagged real-time MRI reveals internal tongue motion during speech production.

Chen, Weiyi; Byrd, Dani; Narayanan, Shrikanth; Nayak, Krishna S.

Magn Reson Med ; 82(2): 600-613, 2019 08.

Article in English | MEDLINE | ID: mdl-30919494

ABSTRACT

PURPOSE: To demonstrate a tagging method compatible with RT-MRI for the study of speech production. METHODS: Tagging is applied as a brief interruption to a continuous real-time spiral acquisition. Tagging can be initiated manually by the operator, cued to the speech stimulus, or be automatically applied with a fixed frequency. We use a standard 2D 1-3-3-1 binomial SPAtial Modulation of Magnetization (SPAMM) sequence with 1 cm spacing in both in-plane directions. Tag persistence in tongue muscle is simulated and validated in vivo. The ability to capture internal tongue deformations is tested during speech production of American English diphthongs in native speakers. RESULTS: We achieved an imaging window of 650-800 ms at 1.5T, with imaging signal to noise ratio ≥ 17 and tag contrast to noise ratio ≥ 5 in human tongue, providing 36 frames/s temporal resolution and 2 mm in-plane spatial resolution with real-time interactive acquisition and view-sharing reconstruction. The proposed method was able to capture tongue motion patterns and their relative timing with adequate spatiotemporal resolution during the production of American English diphthongs and consonants. CONCLUSION: Intermittent tagging during real-time MRI of speech production is able to reveal the internal deformations of the tongue. This capability will allow new investigations of valuable spatiotemporal information on the biomechanics of the lingual subsystems during speech without reliance on binning speech utterance repetition.

Subject(s)

Image Processing, Computer-Assisted/methods , Magnetic Resonance Imaging/methods , Speech Production Measurement/methods , Speech/physiology , Tongue , Adult , Female , Humans , Male , Movement/physiology , Tongue/diagnostic imaging , Tongue/physiology

8.

Syllable-internal corrective focus in Korean.

Oh, Miran; Byrd, Dani.

J Phon ; 772019 Nov.

Article in English | MEDLINE | ID: mdl-32863471

ABSTRACT

In producing linguistic prominence, certain linguistic elements are highlighted relative to others in a given domain; focus is an instance of prominence in which speakers highlight new or important information. This study investigates prominence modulation at the sub-syllable level using a corrective focus task, examining acoustic duration and pitch with particular attention to the gestural composition of Korean tense and lax consonants. The results indicate that focus effects are manifested with systematic variations depending on the gestural structures, i.e. consonants, active during the domain of a focus gesture, that the patterns of focus modulation do not differ as a function of elicited focus positions within the syllable. The findings generally support the premise that the scope of the focus gesture is not (much) smaller than the interval of (CVC) syllable. Lastly, there is also some support for an interaction among prosodic gestures-focus gestures and pitch accentual gestures-at the phrase level. Overall, the current findings support the hypothesis that focus, implemented as a prosodic prominence gesture, modulates temporal characteristics of gestures, as well as possibly other prosodic gestures that are co-active in its the domain.

9.

3D dynamic MRI of the vocal tract during natural speech.

Lim, Yongwan; Zhu, Yinghua; Lingala, Sajan Goud; Byrd, Dani; Narayanan, Shrikanth; Nayak, Krishna Shrinivas.

Magn Reson Med ; 81(3): 1511-1520, 2019 03.

Article in English | MEDLINE | ID: mdl-30390319

ABSTRACT

PURPOSE: To develop and evaluate a technique for 3D dynamic MRI of the full vocal tract at high temporal resolution during natural speech. METHODS: We demonstrate 2.4 × 2.4 × 5.8 mm3 spatial resolution, 61-ms temporal resolution, and a 200 × 200 × 70 mm3 FOV. The proposed method uses 3D gradient-echo imaging with a custom upper-airway coil, a minimum-phase slab excitation, stack-of-spirals readout, pseudo golden-angle view order in kx -ky , linear Cartesian order along kz , and spatiotemporal finite difference constrained reconstruction, with 13-fold acceleration. This technique is evaluated using in vivo vocal tract airway data from 2 healthy subjects acquired at 1.5T scanner, 1 with synchronized audio, with 2 tasks during production of natural speech, and via comparison with interleaved multislice 2D dynamic MRI. RESULTS: This technique captured known dynamics of vocal tract articulators during natural speech tasks including tongue gestures during the production of consonants "s" and "l" and of consonant-vowel syllables, and was additionally consistent with 2D dynamic MRI. Coordination of lingual (tongue) movements for consonants is demonstrated via volume-of-interest analysis. Vocal tract area function dynamics revealed critical lingual constriction events along the length of the vocal tract for consonants and vowels. CONCLUSION: We demonstrate feasibility of 3D dynamic MRI of the full vocal tract, with spatiotemporal resolution adequate to visualize lingual movements for consonants and vocal tact shaping during natural productions of consonant-vowel syllables, without requiring multiple repetitions.

Subject(s)

Imaging, Three-Dimensional/methods , Larynx/diagnostic imaging , Magnetic Resonance Imaging , Signal Processing, Computer-Assisted , Speech Production Measurement/methods , Speech/physiology , Adult , Female , Humans , Image Processing, Computer-Assisted , Language , Male , Movement , Reproducibility of Results , Tongue , Video Recording

10.

Simultaneous electromagnetic articulography and electroglottography data acquisition of natural speech.

Harper, Sarah; Lee, Sungbok; Goldstein, Louis; Byrd, Dani.

J Acoust Soc Am ; 144(5): EL380, 2018 11.

Article in English | MEDLINE | ID: mdl-30522297

ABSTRACT

This paper reports on the concurrent use of electroglottography (EGG) and electromagnetic articulography (EMA) in the acquisition of EMA trajectory data for running speech. Static and dynamic intersensor distances, standard deviations, and coefficients of variation associated with inter-sample distances were compared in two conditions: with and without EGG present. Results indicate that measurement discrepancies between the two conditions are within the EMA system's measurement uncertainty. Therefore, potential electromagnetic interference from EGG does not seem to cause differences of practical importance on EMA trajectory behaviors, suggesting that simultaneous EMA and EGG data acquisition is a viable laboratory procedure for speech research.

Subject(s)

Electromagnetic Phenomena , Glottis/physiology , Speech Production Measurement/instrumentation , Speech/physiology , Female , Glottis/anatomy & histology , Humans , Larynx/anatomy & histology , Larynx/physiology , Male , Mouth/anatomy & histology , Mouth/physiology

11.

Articulatory, acoustic, and prosodic accommodation in a cooperative maze navigation task.

Lee, Yoonjeong; Gordon Danner, Samantha; Parrell, Benjamin; Lee, Sungbok; Goldstein, Louis; Byrd, Dani.

PLoS One ; 13(8): e0201444, 2018.

Article in English | MEDLINE | ID: mdl-30086554

ABSTRACT

This study uses a maze navigation task in conjunction with a quasi-scripted, prosodically controlled speech task to examine acoustic and articulatory accommodation in pairs of interacting speakers. The experiment uses a dual electromagnetic articulography set-up to collect synchronized acoustic and articulatory kinematic data from two facing speakers simultaneously. We measure the members of a dyad individually before they interact, while they are interacting in a cooperative task, and again individually after they interact. The design is ideally suited to measure speech convergence, divergence, and persistence effects during and after speaker interaction. This study specifically examines how convergence and divergence effects during a dyadic interaction may be related to prosodically salient positions, such as preceding a phrase boundary. The findings of accommodation in fine-grained prosodic measures illuminate our understanding of how the realization of linguistic phrasal structure is coordinated across interacting speakers. Our findings on individual speaker variability and the time course of accommodation provide novel evidence for accommodation at the level of cognitively specified motor control of individual articulatory gestures. Taken together, these results have implications for understanding the cognitive control of interactional behavior in spoken language communication.

Subject(s)

Cognition/physiology , Cooperative Behavior , Interpersonal Relations , Speech/physiology , Adult , Electromagnetic Phenomena , Female , Humans , Male , Speech Production Measurement/instrumentation , Speech Production Measurement/methods , Young Adult

12.

Real-time magnetic resonance imaging and electromagnetic articulography database for speech production research (TC).

Narayanan, Shrikanth; Toutios, Asterios; Ramanarayanan, Vikram; Lammert, Adam; Kim, Jangwon; Lee, Sungbok; Nayak, Krishna; Kim, Yoon-Chul; Zhu, Yinghua; Goldstein, Louis; Byrd, Dani; Bresch, Erik; Ghosh, Prasanta; Katsamanis, Athanasios; Proctor, Michael.

J Acoust Soc Am ; 136(3): 1307, 2014 Sep.

Article in English | MEDLINE | ID: mdl-25190403

ABSTRACT

USC-TIMIT is an extensive database of multimodal speech production data, developed to complement existing resources available to the speech research community and with the intention of being continuously refined and augmented. The database currently includes real-time magnetic resonance imaging data from five male and five female speakers of American English. Electromagnetic articulography data have also been presently collected from four of these speakers. The two modalities were recorded in two independent sessions while the subjects produced the same 460 sentence corpus used previously in the MOCHA-TIMIT database. In both cases the audio signal was recorded and synchronized with the articulatory data. The database and companion software are freely available to the research community.

Subject(s)

Acoustics , Biomedical Research , Databases, Factual , Electromagnetic Phenomena , Magnetic Resonance Imaging , Pharynx/physiology , Speech Acoustics , Speech Production Measurement , Voice Quality , Acoustics/instrumentation , Adult , Biomechanical Phenomena , Female , Humans , Male , Middle Aged , Pharynx/anatomy & histology , Signal Processing, Computer-Assisted , Software , Speech Production Measurement/instrumentation , Time Factors , Transducers

13.

Spatiotemporal coupling between speech and manual motor actions.

Parrell, Benjamin; Goldstein, Louis; Lee, Sungbok; Byrd, Dani.

J Phon ; 42: 1-11, 2014 Jan.

Article in English | MEDLINE | ID: mdl-24465063

ABSTRACT

Much evidence has been found for pervasive links between the manual and speech motor systems, including evidence from infant development, deictic pointing, and repetitive tapping and speaking tasks. We expand on the last of these paradigms to look at intra- and cross-modal effects of emphatic stress, as well as the effects of coordination in the absence of explicit rhythm. In this study, subjects repeatedly tapped their finger and synchronously repeated a single spoken syllable. On each trial, subjects placed an emphatic stress on one finger tap or one spoken syllable. Results show that both movement duration and magnitude are affected by emphatic stress regardless of whether that stress is in the same domain (e.g., effects on the oral articulators when a spoken repetition is stressed) or across domains (e.g., effects on the oral articulators when a tap is stressed). Though the size of the effects differs between intra-and cross-domain emphases, the implementation of stress affects both motor domains, indicating a tight connection. This close coupling is seen even in the absence of stress, though it is highlighted under stress. The results of this study support the idea that implementation of prosody is not domain-specific but relies on general aspects of the motor system.

14.

Evaluation of prosodic juncture strength using functional data analysis.

Parrell, Benjamin; Lee, Sungbok; Byrd, Dani.

J Phon ; 41(8)2013 Nov.

Article in English | MEDLINE | ID: mdl-24244056

ABSTRACT

Prosodic structure has large effects on the temporal realization of speech via the shaping of articulatory events. It is important for speech scientists to be able to systematically quantify these prosodic effects on articulation in a way that is capable both of differentiating between the degree of prosodic lengthening associated with varying linguistic contexts and that is generalizable across speakers. The current paper presents a novel method to automatically quantify boundary strength from articulatory speech data based on functional data analysis (FDA). In particular, a new derived variable-the Deformation Index-is proposed, which is the area under FDA time-deformation functions. First using synthetic speech produced with the TaDA task dynamics computational model, the Deformation Index is shown to be able to capture a priori known differences in boundary strengths instantiated in the π-gesture framework. Additionally, this method accurately distinguishes between types of boundaries in non-synthetic speech produced by four speakers.

15.

An investigation of articulatory setting using real-time magnetic resonance imaging.

Ramanarayanan, Vikram; Goldstein, Louis; Byrd, Dani; Narayanan, Shrikanth S.

J Acoust Soc Am ; 134(1): 510-9, 2013 Jul.

Article in English | MEDLINE | ID: mdl-23862826

ABSTRACT

This paper presents an automatic procedure to analyze articulatory setting in speech production using real-time magnetic resonance imaging of the moving human vocal tract. The procedure extracts frames corresponding to inter-speech pauses, speech-ready intervals and absolute rest intervals from magnetic resonance imaging sequences of read and spontaneous speech elicited from five healthy speakers of American English and uses automatically extracted image features to quantify vocal tract posture during these intervals. Statistical analyses show significant differences between vocal tract postures adopted during inter-speech pauses and those at absolute rest before speech; the latter also exhibits a greater variability in the adopted postures. In addition, the articulatory settings adopted during inter-speech pauses in read and spontaneous speech are distinct. The results suggest that adopted vocal tract postures differ on average during rest positions, ready positions and inter-speech pauses, and might, in that order, involve an increasing degree of active control by the cognitive speech planning mechanism.

Subject(s)

Epiglottis/physiology , Glottis/physiology , Image Interpretation, Computer-Assisted/methods , Lip/physiology , Magnetic Resonance Imaging/methods , Palate, Soft/physiology , Pharynx/physiology , Phonation/physiology , Phonetics , Speech/physiology , Tongue/physiology , Algorithms , Female , Humans , Muscle Contraction/physiology , Pulmonary Ventilation/physiology , Supine Position/physiology

16.

Paralinguistic mechanisms of production in human "beatboxing": a real-time magnetic resonance imaging study.

Proctor, Michael; Bresch, Erik; Byrd, Dani; Nayak, Krishna; Narayanan, Shrikanth.

J Acoust Soc Am ; 133(2): 1043-54, 2013 Feb.

Article in English | MEDLINE | ID: mdl-23363120

ABSTRACT

Real-time magnetic resonance imaging (rtMRI) was used to examine mechanisms of sound production by an American male beatbox artist. rtMRI was found to be a useful modality with which to study this form of sound production, providing a global dynamic view of the midsagittal vocal tract at frame rates sufficient to observe the movement and coordination of critical articulators. The subject's repertoire included percussion elements generated using a wide range of articulatory and airstream mechanisms. Many of the same mechanisms observed in human speech production were exploited for musical effect, including patterns of articulation that do not occur in the phonologies of the artist's native languages: ejectives and clicks. The data offer insights into the paralinguistic use of phonetic primitives and the ways in which they are coordinated in this style of musical performance. A unified formalism for describing both musical and phonetic dimensions of human vocal percussion performance is proposed. Audio and video data illustrating production and orchestration of beatboxing sound effects are provided in a companion annotated corpus.

Subject(s)

Larynx/physiology , Magnetic Resonance Imaging , Music , Phonation , Phonetics , Voice Quality , Adult , Humans , Image Processing, Computer-Assisted , Male , Pattern Recognition, Automated , Speech Acoustics , Time Factors , Video Recording , Vocal Cords/physiology

17.

Prosodic boundary strength: An articulatory and perceptual study.

Krivokapic, Jelena; Byrd, Dani.

J Phon ; 40(3): 430-442, 2012 May 01.

Article in English | MEDLINE | ID: mdl-23441103

ABSTRACT

This study examines the production and perception of Intonational Phrase (IP) boundaries. In particular, it investigates (1) whether the articulatory events that occur at IP boundaries can exhibit temporal distinctions that would indicate a difference in degree of disjuncture, and (2) to what extent listeners are sensitive to the effects of such differences among IP boundaries. Two experiments investigate these questions. An articulatory kinematic experiment examines the effects of structural differences between IP boundaries on the production of those boundaries. In a perception experiment listeners then evaluate the strength of the junctures occurring in the utterances produced in the production study. The results of the studies provide support for the existence of prosodic strength differences among IP boundaries and also demonstrate a close link between the production and perception of prosodic boundaries. The results are discussed in the context of possible linguistic structural explanations, with implications for cognitive accounts for the creation, implementation, and processing of prosody.

18.

Analysis of pausing behavior in spontaneous speech using real-time magnetic resonance imaging of articulation.

Ramanarayanan, Vikram; Bresch, Erik; Byrd, Dani; Goldstein, Louis; Narayanan, Shrikanth S.

J Acoust Soc Am ; 126(5): EL160-5, 2009 Nov.

Article in English | MEDLINE | ID: mdl-19894792

ABSTRACT

It is hypothesized that pauses at major syntactic boundaries (i.e., grammatical pauses), but not ungrammatical (e.g., word search) pauses, are planned by a high-level cognitive mechanism that also controls the rate of articulation around these junctures. Real-time magnetic resonance imaging is used to analyze articulation at and around grammatical and ungrammatical pauses in spontaneous speech. Measures quantifying the speed of articulators were developed and applied during these pauses as well as during their immediate neighborhoods. Grammatical pauses were found to have an appreciable drop in speed at the pause itself as compared to ungrammatical pauses, which is consistent with our hypothesis that grammatical pauses are indeed choreographed by a central cognitive planner.

Subject(s)

Auditory Cortex/physiology , Magnetic Resonance Imaging/methods , Speech Articulation Tests , Speech/physiology , Vocal Cords/physiology , Humans , Phonetics

19.

Timing effects of syllable structure and stress on nasals: a real-time MRI examination.

Byrd, Dani; Tobin, Stephen; Bresch, Erik; Narayanan, Shrikanth.

J Phon ; 37(1): 97-110, 2009 Jan 01.

Article in English | MEDLINE | ID: mdl-20046892

ABSTRACT

The coordination of velum and oral gestures for English [n] is studied using real-time MRI movies to reconstruct vocal tract aperture functions. This technique allows for the examination of parts of the vocal tract otherwise inaccessible to dynamic imaging or movement tracking. The present experiment considers syllable onset, coda, and juncture geminate nasals and also addresses the effects of a variety of word stress patterns on segment internal coordination. We find a bimodal timing pattern in which near-synchrony of velum lowering and tongue tip raising characterizes the timing for onsets and temporal lag between the gestures is characteristic for codas, supporting and extending the findings of Krakow (1989), 1993) for [m]. Intervocalic word-internal nasals are found to have timing patterns that are sensitive to the local stress context, which suggests the presence of an underlying timing specification that can yield flexibly. We consider these findings in light of the gestural coupling structures described by Goldstein and colleagues (Goldstein, Byrd, & Saltzman 2006; Nam, Goldstein, and Saltzman in press; Goldstein, Nam, Saltzman, & Chitoran 2008).

20.

Phrase boundary effects on the temporal kinematics of sequential tongue tip consonants.

Byrd, Dani; Lee, Sungbok; Campos-Astorkiza, Rebeka.

J Acoust Soc Am ; 123(6): 4456-65, 2008 Jun.

Article in English | MEDLINE | ID: mdl-18537396

ABSTRACT

This study evaluates the effects of phrase boundaries on the intra- and intergestural kinematic characteristics of blended gestures, i.e., overlapping gestures produced with a single articulator. The sequences examined are the juncture geminate [d(#)d], the sequence [d(#)z], and, for comparison, the singleton tongue tip gesture in [d(#)b]. This allows the investigation of the process of gestural aggregation [Munhall, K. G., and Lofqvist, A. (1992). "Gestural aggregation in speech: laryngeal gestures," J. Phonetics 20, 93-110] and the manner in which it is affected by prosodic structure. Juncture geminates are predicted to be affected by prosodic boundaries in the same way as other gestures; that is, they should display prosodic lengthening and lesser overlap across a boundary. Articulatory prosodic lengthening is also investigated using a signal alignment method of the functional data analysis framework [Ramsay, J. O., and Silverman, B. W. (2005). Functional Data Analysis, 2nd ed. (Springer-Verlag, New York)]. This provides the ability to examine a time warping function that characterizes relative timing difference (i.e., lagging or advancing) of a test signal with respect to a given reference, thus offering a way of illuminating local nonlinear deformations at work in prosodic lengthening. These findings are discussed in light of the pi-gesture framework of Byrd and Saltzman [(2003) "The elastic phrase: Modeling the dynamics of boundary-adjacent lengthening," J. Phonetics 31, 149-180].

Subject(s)

Biomechanical Phenomena , Phonation , Speech Articulation Tests , Speech/physiology , Tongue/physiology , Gestures , Humans , Language , Motor Activity , Movement , Speech Production Measurement

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL