Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 12 de 12
Filter
Add more filters










Publication year range
1.
Magn Reson Med ; 91(1): 61-74, 2024 01.
Article in English | MEDLINE | ID: mdl-37677043

ABSTRACT

PURPOSE: To improve the spatiotemporal qualities of images and dynamics of speech MRI through an improved data sampling and image reconstruction approach. METHODS: For data acquisition, we used a Poisson-disc random under sampling scheme that reduced the undersampling coherence. For image reconstruction, we proposed a novel locally higher-rank partial separability model. This reconstruction model represented the oral and static regions using separate low-rank subspaces, therefore, preserving their distinct temporal signal characteristics. Regional optimized temporal basis was determined from the regional-optimized virtual coil approach. Overall, we achieved a better spatiotemporal image reconstruction quality with the potential of reducing total acquisition time by 50%. RESULTS: The proposed method was demonstrated through several 2-mm isotropic, 64 mm total thickness, dynamic acquisitions with 40 frames per second and compared to the previous approach using a global subspace model along with other k-space sampling patterns. Individual timeframe images and temporal profiles of speech samples were shown to illustrate the ability of the Poisson-disc under sampling pattern in reducing total acquisition time. Temporal information of sagittal and coronal directions was also shown to illustrate the effectiveness of the locally higher-rank operator and regional optimized temporal basis. To compare the reconstruction qualities of different regions, voxel-wise temporal SNR analysis were performed. CONCLUSION: Poisson-disc sampling combined with a locally higher-rank model and a regional-optimized temporal basis can drastically improve the spatiotemporal image quality and provide a 50% reduction in overall acquisition time.


Subject(s)
Magnetic Resonance Imaging , Speech , Magnetic Resonance Imaging/methods , Image Processing, Computer-Assisted/methods , Algorithms
2.
Cleft Palate Craniofac J ; : 10556656231183385, 2023 Jun 19.
Article in English | MEDLINE | ID: mdl-37335134

ABSTRACT

OBJECTIVE: To introduce a highly innovative imaging method to study the complex velopharyngeal (VP) system and introduce the potential future clinical applications of a VP atlas in cleft care. DESIGN: Four healthy adults participated in a 20-min dynamic magnetic resonance imaging scan that included a high-resolution T2-weighted turbo-spin-echo 3D structural scan and five custom dynamic speech imaging scans. Subjects repeated a variety of phrases when in the scanner as real-time audio was captured. SETTING: Multisite institution and clinical setting. PARTICIPANTS: Four adult subjects with normal anatomy were recruited for this study. MAIN OUTCOME: Establishment of 4-D atlas constructed from dynamic VP MRI data. RESULTS: Three-dimensional dynamic magnetic resonance imaging was successfully used to obtain high quality dynamic speech scans in an adult population. Scans were able to be re-sliced in various imaging planes. Subject-specific MR data were then reconstructed and time-aligned to create a velopharyngeal atlas representing the averaged physiological movements across the four subjects. CONCLUSIONS: The current preliminary study examined the feasibility of developing a VP atlas for potential clinical applications in cleft care. Our results indicate excellent potential for the development and use of a VP atlas for assessing VP physiology during speech.

3.
Magn Reson Med ; 89(2): 652-664, 2023 02.
Article in English | MEDLINE | ID: mdl-36289572

ABSTRACT

PURPOSE: To enable a more comprehensive view of articulations during speech through near-isotropic 3D dynamic MRI with high spatiotemporal resolution and large vocal-tract coverage. METHODS: Using partial separability model-based low-rank reconstruction coupled with a sparse acquisition of both spatial and temporal models, we are able to achieve near-isotropic resolution 3D imaging with a high frame rate. The total acquisition time of the speech acquisition is shortened by introducing a sparse temporal sampling that interleaves one temporal navigator with four randomized phase and slice-encoded imaging samples. Memory and computation time are improved through compressing coils based on the region of interest for low-rank constrained reconstruction with an edge-preserving spatial penalty. RESULTS: The proposed method has been evaluated through experiments on several speech samples, including a standard reading passage. A near-isotropic 1.875 × 1.875 × 2 mm3 spatial resolution, 64-mm through-plane coverage, and a 35.6-fps temporal resolution are achieved. Investigations and analysis on specific speech samples support novel insights into nonsymmetric tongue movement, velum raising, and coarticulation events with adequate visualization of rapid articulatory movements. CONCLUSION: Three-dimensional dynamic images of the vocal tract structures during speech with high spatiotemporal resolution and axial coverage is capable of enhancing linguistic research, enabling visualization of soft tissue motions that are not possible with other modalities.


Subject(s)
Magnetic Resonance Imaging , Speech , Magnetic Resonance Imaging/methods , Imaging, Three-Dimensional/methods , Language , Linguistics
4.
Sci Rep ; 12(1): 1239, 2022 01 24.
Article in English | MEDLINE | ID: mdl-35075188

ABSTRACT

Travertine crystal growth ripples are used to reconstruct the early hydraulic history of the Anio Novus aqueduct of ancient Rome. These crystalline morphologies deposited within the aqueduct channel record the hydraulic history of gravity-driven turbulent flow at the time of Roman operation. The wavelength, amplitude, and steepness of these travertine crystal growth ripples indicate that large-scale sustained aqueduct flows scaled directly with the thickness of the aqueous viscous sublayer. Resulting critical shear Reynolds numbers are comparable with those reconstructed from heat/mass transfer crystalline ripples formed in other natural and engineered environments. This includes sediment transport in rivers, lakes, and oceans, chemical precipitation and dissolution in caves, and melting and freezing in ice. Where flow depth and perimeter could be reconstructed from the distribution and stratigraphy of the travertine within the Anio Novus aqueduct, flow velocity and rate have been quantified by deriving roughness-flow relationships that are independent of water temperature. More generally, under conditions of near-constant water temperature and kinematic viscosity within the Anio Novus aqueduct channel, the travertine crystal growth ripple wavelengths increased with decreasing flow velocity, indicating that systematic changes took place in flow rate during travertine deposition. This study establishes that travertine crystal growth ripples such as those preserved in the Anio Novus provide a sensitive record of past hydraulic conditions, which can be similarly reconstructed from travertine deposited in other ancient water conveyance and storage systems around the world.

5.
J Acoust Soc Am ; 150(5): 3500, 2021 11.
Article in English | MEDLINE | ID: mdl-34852570

ABSTRACT

Magnetic resonance (MR) imaging is becoming an established tool in capturing articulatory and physiological motion of the structures and muscles throughout the vocal tract and enabling visual and quantitative assessment of real-time speech activities. Although motion capture speed has been regularly improved by the continual developments in high-speed MR technology, quantitative analysis of multi-subject group data remains challenging due to variations in speaking rate and imaging time among different subjects. In this paper, a workflow of post-processing methods that matches different MR image datasets within a study group is proposed. Each subject's recorded audio waveform during speech is used to extract temporal domain information and generate temporal alignment mappings from their matching pattern. The corresponding image data are resampled by deformable registration and interpolation of the deformation fields, achieving inter-subject temporal alignment between image sequences. A four-dimensional dynamic MR speech atlas is constructed using aligned volumes from four human subjects. Similarity tests between subject and target domains using the squared error, cross correlation, and mutual information measures all show an overall score increase after spatiotemporal alignment. The amount of image variability in atlas construction is reduced, indicating a quality increase in the multi-subject data for groupwise quantitative analysis.


Subject(s)
Algorithms , Speech , Humans , Magnetic Resonance Imaging , Motion , Movement
6.
Magn Reson Med ; 77(4): 1619-1629, 2017 04.
Article in English | MEDLINE | ID: mdl-27099178

ABSTRACT

PURPOSE: To achieve high temporal frame rate, high spatial resolution and full-vocal-tract coverage for three-dimensional dynamic speech MRI by using low-rank modeling and sparse sampling. METHODS: Three-dimensional dynamic speech MRI is enabled by integrating a novel data acquisition strategy and an image reconstruction method with the partial separability model: (a) a self-navigated sparse sampling strategy that accelerates data acquisition by collecting high-nominal-frame-rate cone navigator sand imaging data within a single repetition time, and (b) are construction method that recovers high-quality speech dynamics from sparse (k,t)-space data by enforcing joint low-rank and spatiotemporal total variation constraints. RESULTS: The proposed method has been evaluated through in vivo experiments. A nominal temporal frame rate of 166 frames per second (defined based on a repetition time of 5.99 ms) was achieved for an imaging volume covering the entire vocal tract with a spatial resolution of 2.2 × 2.2 × 5.0 mm3 . Practical utility of the proposed method was demonstrated via both validation experiments and a phonetics investigation. CONCLUSION: Three-dimensional dynamic speech imaging is possible with full-vocal-tract coverage, high spatial resolution and high nominal frame rate to provide dynamic speech data useful for phonetic studies. Magn Reson Med 77:1619-1629, 2017. © 2016 International Society for Magnetic Resonance in Medicine.


Subject(s)
Image Interpretation, Computer-Assisted/methods , Imaging, Three-Dimensional/methods , Magnetic Resonance Imaging, Cine/methods , Speech Production Measurement/methods , Speech/physiology , Tongue/diagnostic imaging , Adult , Algorithms , Female , Humans , Image Enhancement/methods , Larynx/anatomy & histology , Larynx/diagnostic imaging , Larynx/physiology , Male , Reproducibility of Results , Sensitivity and Specificity , Signal Processing, Computer-Assisted , Tongue/anatomy & histology , Tongue/physiology , Young Adult
7.
J Acoust Soc Am ; 140(3): 2145, 2016 09.
Article in English | MEDLINE | ID: mdl-27914422

ABSTRACT

Hypernasality is one of the most detrimental speech disturbances that lead to declines of speech intelligibility. Velopharyngeal inadequacy, which is associated with anatomic defects such as cleft palate or neuromuscular disorders that affect velopharygneal function, is the primary cause of hypernasality. A simulation study by Rong and Kuehn [J. Speech Lang. Hear. Res. 55(5), 1438-1448 (2012)] demonstrated that properly adjusted oropharyngeal articulation can reduce nasality for vowels synthesized with an articulatory model [Mermelstein, J. Acoust. Soc. Am. 53(4), 1070-1082 (1973)]. In this study, a speaker-adaptive articulatory model was developed to simulate speaker-customized oropharyngeal articulatory adaptation to compensate for the acoustic effects of nasalization on /a/, /i/, and /u/. The results demonstrated that (1) the oropharyngeal articulatory adaptation effectively counteracted the effects of nasalization on the second lowest formant frequency (F2) and partially compensated for the effects of nasalization on vowel space (e.g., shifting and constriction of vowel space) and (2) the articulatory adaptation strategies generated by the speaker-adaptive model might be more efficacious for counteracting the acoustic effects of nasalization compared to the adaptation strategies generated by the standard articulatory model in Rong and Kuehn. The findings of this study indicated the potential of using oropharyngeal articulatory adaptation as a means to correct maladaptive articulatory behaviors and to reduce nasality.


Subject(s)
Acoustics , Dysphonia , Phonetics , Speech Acoustics , Speech Intelligibility , Speech Production Measurement
8.
Annu Int Conf IEEE Eng Med Biol Soc ; 2015: 1568-71, 2015 Aug.
Article in English | MEDLINE | ID: mdl-26736572

ABSTRACT

Dynamic speech magnetic resonance imaging (DSMRI) is a promising technique for visualizing articulatory motion in real time. However, many existing applications of DSMRI have been limited by slow imaging speed and the lack of quantitative motion analysis. In this paper, we present a novel DS-MRI technique to simultaneously estimate dynamic image sequence of speech and the associated deformation field. Extending on our previous Partial Separability (PS) model-based methods, the proposed technique visualizes both speech motion and deformation with a spatial resolution of 2.2 × 2.2 mm(2) and a nominal frame rate of 100 fps. Also, the technique enables direct analysis of articulatory motion through the deformation fields. Effectiveness of the method is systematically examined via in vivo experiments. Utilizing the obtained high-resolution images and deformation fields, we also performed a phonetics study on Brazilian Portuguese to show the method's practical utility.


Subject(s)
Speech , Brazil , Humans , Magnetic Resonance Imaging , Motion , Phonetics
9.
Magn Reson Med ; 73(5): 1820-32, 2015 May.
Article in English | MEDLINE | ID: mdl-24912452

ABSTRACT

PURPOSE: To enable dynamic speech imaging with high spatiotemporal resolution and full-vocal-tract spatial coverage, leveraging recent advances in sparse sampling. METHODS: An imaging method is developed to enable high-speed dynamic speech imaging exploiting low-rank and sparsity of the dynamic images of articulatory motion during speech. The proposed method includes: (a) a novel data acquisition strategy that collects spiral navigators with high temporal frame rate and (b) an image reconstruction method that derives temporal subspaces from navigators and reconstructs high-resolution images from sparsely sampled data with joint low-rank and sparsity constraints. RESULTS: The proposed method has been systematically evaluated and validated through several dynamic speech experiments. A nominal imaging speed of 102 frames per second (fps) was achieved for a single-slice imaging protocol with a spatial resolution of 2.2 × 2.2 × 6.5 mm(3) . An eight-slice imaging protocol covering the entire vocal tract achieved a nominal imaging speed of 12.8 fps with the identical spatial resolution. The effectiveness of the proposed method and its practical utility was also demonstrated in a phonetic investigation. CONCLUSION: High spatiotemporal resolution with full-vocal-tract spatial coverage can be achieved for dynamic speech imaging experiments with low-rank and sparsity constraints.


Subject(s)
Image Processing, Computer-Assisted/methods , Magnetic Resonance Imaging, Cine/methods , Mouth/physiology , Nasal Cavity/physiology , Palate, Soft/physiology , Pharynx/physiology , Phonation/physiology , Speech Articulation Tests/methods , Speech/physiology , Tongue/physiology , Computer Simulation , Female , Humans , Image Processing, Computer-Assisted/instrumentation , Phonetics , Reference Values , Speech Acoustics , Speech Articulation Tests/instrumentation
10.
J Acoust Soc Am ; 131(1): 455-65, 2012 Jan.
Article in English | MEDLINE | ID: mdl-22280607

ABSTRACT

There is increasing evidence that fine articulatory adjustments are made by speakers to reinforce and sometimes counteract the acoustic consequences of nasality. However, it is difficult to attribute the acoustic changes in nasal vowel spectra to either oral cavity configuration or to velopharyngeal opening (VPO). This paper takes the position that it is possible to disambiguate the effects of VPO and oropharyngeal configuration on the acoustic output of the vocal tract by studying the position and movement of the tongue and lips during the production of oral and nasal vowels. This paper uses simultaneously collected articulatory, acoustic, and nasal airflow data during the production of all oral and phonemically nasal vowels in Hindi (four speakers) to understand the consequences of the movements of oral articulators on the spectra of nasal vowels. For Hindi nasal vowels, the tongue body is generally lowered for back vowels, fronted for low vowels, and raised for front vowels (with respect to their oral congeners). These movements are generally supported by accompanying changes in the vowel spectra. In Hindi, the lowering of back nasal vowels may have originally served to enhance the acoustic salience of nasality, but has since engendered a nasal vowel chain shift.


Subject(s)
Lip/physiology , Phonetics , Speech Acoustics , Tongue/physiology , Female , Humans , Male , Movement , Nasal Cavity/physiology , Phonation/physiology , Velopharyngeal Sphincter/physiology
11.
Lang Speech ; 55(Pt 4): 477-502, 2012 Dec.
Article in English | MEDLINE | ID: mdl-23420979

ABSTRACT

Are palatal consonants articulated by multiple tongue gestures (coronal and dorsal) or by a single gesture that brings the tongue into contact with the palate at several places of articulation? The lenition of palatal consonants (resulting in approximants) has been presented as evidence that palatals are simple, not complex: When reduced, they do not lose their coronal gesture and become dorsals; instead, they manifest reduced linguopalatal contact while retaining their anterior place of articulation. The frequently-reported deocclusivization of the Brazilian Portuguese (BP) palatal nasal may support this claim. However, the linguopalatal configuration of this sound has not been studied directly. Electropalatographic evidence from three speakers of BP (compared with data from three speakers of Peninsular Spanish) demonstrates that the palatal nasal is frequently realized as an approximant. There is no evidence of anterior occlusion in BP's post-palatal, lenited nasal. Under conditions of focus/hyperarticulation, there is no evidence of stronger/more anterior occlusion. We argue that the articulatory target of the BP palatal nasal is neither occluded nor anterior.


Subject(s)
Models, Biological , Palate/physiology , Phonetics , Speech/physiology , Tongue/physiology , Brazil , Electrophysiology , Female , Humans , Language , Male , Nasal Cavity/physiology , Sound Spectrography , Spain
12.
Phonetica ; 67(1-2): 47-62, 2010.
Article in English | MEDLINE | ID: mdl-20798569

ABSTRACT

In the typical speech of any language, voicing onset and offset are effortlessly coordinated with articulation as part of the intrinsic coordination of sound production. In this paper, we argue that voicing-articulatory coordination patterns could be shaped by sensory feedback during early speech learning and these patterns persist in mature syllable productions. Our experimental results show that voicing onset is closely associated with the peak velocity and peak amplitude of jaw and upper lip movements for VC syllables in adults. This robust coordination in the onset position may function to increase the salience of VC syllables and provide a phonetically natural explanation for vowels to undergo phonological lengthening and to avoid phonological reduction in word-initial onset position.


Subject(s)
Cues , Feedback, Sensory/physiology , Phonation/physiology , Phonetics , Speech Articulation Tests , Speech Perception/physiology , Adult , Biomechanical Phenomena/physiology , Glottis/physiology , Humans , Jaw/physiology , Language Development , Male , Sound Spectrography , Verbal Behavior/physiology
SELECTION OF CITATIONS
SEARCH DETAIL
...