Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 10 de 10
Filter
Add more filters










Publication year range
1.
Laryngoscope ; 133(10): 2687-2694, 2023 10.
Article in English | MEDLINE | ID: mdl-36715109

ABSTRACT

OBJECTIVE: The purpose of this study was to determine whether automated estimates of vocal creak would differentiate speakers with adductor laryngeal dystonia (AdLD) from speakers with muscle tension dysphonia (MTD) and speakers without voice disorders. METHODS: Sixteen speakers with AdLD, sixteen speakers with MTD, and sixteen speakers without voice disorders were recorded in a quiet environment reading aloud a standard paragraph. An open-source creak detector was used to calculate the percentage of creak (% creak) in each of the speaker's six recorded sentences. RESULTS: A Kruskal-Wallis one-way analysis of variance revealed a statistically significant effect of group on the % creak with a large effect size. Pairwise Wilcoxon tests revealed a statistically significant difference in % creak between speakers with AdLD and controls as well as between speakers with AdLD and MTD. Receiver operating characteristic curve analyses indicated that % creak differentiated AdLD from both controls and speakers with MTD with high sensitivity and specificity (area under the curve statistics of 0.94 and 0.86, respectively). CONCLUSION: Percentage of creak as calculated by an automated creak detector may be useful as a quantitative indicator of AdLD, demonstrating the potential for use as a screening tool or to aid in a differential diagnosis. LEVEL OF EVIDENCE: 3 Laryngoscope, 133:2687-2694, 2023.


Subject(s)
Dysphonia , Dystonia , Voice , Humans , Dysphonia/diagnosis , Muscle Tonus/physiology , Voice Quality , Voice/physiology , Laryngeal Muscles
2.
J Voice ; 37(4): 553-560, 2023 Jul.
Article in English | MEDLINE | ID: mdl-33992477

ABSTRACT

PURPOSE: Society has become increasingly dependent on telecommunication, which has been shown to negatively impact vocal function. This study explores the use of sidetone regulation during audio-visual communication as one potential technique to alleviate the effects of telecommunication on the voice. METHOD: The speech acoustics of 18 participants with typical voices were measured during conversational tasks during three conditions of sidetone amplification: baseline (no sidetone amplification), low sidetone amplification, and high sidetone amplification. Vocal intensity, vocal quality (estimated using acoustic measures of the low-high ratio and the smoothed cepstral peak prominence), and self-perceived vocal effort were used to measure the impacts of sidetone amplification on vocal function. RESULTS: Compared to baseline, there were statistically significant decreases in vocal intensity and increases in low-high ratio in the high level of sidetone amplification condition. Changes in these measures were not significantly correlated. When asked to rank conditions based on their perceived vocal effort, participants most often ranked the high level of sidetone amplification as least effortful; however, the visual-analog ratings of vocal effort were not significantly different between conditions. The smoothed cepstral peak prominence did not change with varying levels of sidetone amplification. CONCLUSIONS: Vocal intensity decreased with high levels of sidetone amplification. High levels of sidetone amplification also resulted in increases in the low-high ratio, which were shown to be more than just a byproduct of decreased vocal intensity. The impact of sidetone amplification on vocal effort was less clear, but results suggested that participants generally decreased their vocal effort with increased levels of sidetone amplification. This was a preliminary study and future work is warranted in a population of participants with voice complaints and in a more noisy, realistic environments.


Subject(s)
Telecommunications , Voice , Humans , Speech , Speech Acoustics , Voice Quality
3.
J Speech Lang Hear Res ; 65(11): 4085-4095, 2022 11 17.
Article in English | MEDLINE | ID: mdl-36198059

ABSTRACT

OBJECTIVE: Currently, no clinically feasible objective measures exist that are specific to the signs of adductor laryngeal dystonia (LD), deterring effective diagnosis and treatment. This project sought to establish concurrent validity of a new automated acoustic outcome measure, designed to be specific to adductor laryngeal dystonia (AdLD): the spectral aggregate of the high-passed fundamental frequency contour (SAHf o). METHOD: Twenty speakers with AdLD read voiced phoneme-loaded (more symptomatic) and voiceless phoneme-loaded (less symptomatic) sentences. LD discontinuities (defined as phonatory breaks, frequency shifts, and creak), the acoustic ramifications of laryngeal spasms, were manually identified. The frequency content of the f o contour was examined as a function of time, and content above 1000 Hz was summed to automatically calculate SAHf o. Multiple linear regression analysis was applied to SAHf o based on LD discontinuities and sentence type (voiced or voiceless phoneme-loaded). RESULTS: The regression model accounted for 41.1% of the variance in SAHf o. Both the LD discontinuities and sentence type were statistically related to SAHf o. CONCLUSION: Results of this study provide evidence of concurrent validity. SAHf o is an automatic outcome measure specific to acoustic signs of AdLD that may be useful to track treatment progress.


Subject(s)
Dysphonia , Dystonia , Humans , Dystonia/diagnosis , Speech Acoustics , Acoustics , Phonation
4.
J Speech Lang Hear Res ; 65(4): 1349-1369, 2022 04 04.
Article in English | MEDLINE | ID: mdl-35263546

ABSTRACT

PURPOSE: This study examined the discriminative ability of acoustic indices of vocal hyperfunction combining smoothed cepstral peak prominence (CPPS) and relative fundamental frequency (RFF). METHOD: Demographic, CPPS, and RFF parameters were entered into logistic regression models trained on two 1:1 case-control groups: individuals with and without nonphonotraumatic vocal hyperfunction (NPVH; n = 360) and phonotraumatic vocal hyperfunction (PVH; n = 240). Equations from the final models were used to predict group membership in two independent test sets (n = 100 each). RESULTS: Both CPPS and RFF parameters significantly improved model fits for NPVH and PVH after accounting for demographics. CPPS explained unique variance beyond RFF in both models. RFF explained unique variance beyond CPPS in the PVH model. Final models included CPPS and RFF offset parameters for both NPVH and PVH; RFF onset parameters were significant only in the PVH model. Area under the receiver operating characteristic curve analysis for the independent test sets revealed acceptable classification for NPVH (72%) and good classification for PVH (86%). CONCLUSIONS: A combination of CPPS and RFF parameters showed better discriminative ability than either measure alone for PVH. Clinical cutoff scores for acoustic indices of vocal hyperfunction are proposed for assessment and screening purposes.


Subject(s)
Voice Disorders , Voice , Acoustics , Humans , ROC Curve , Speech Acoustics , Voice Disorders/diagnosis
5.
Am J Speech Lang Pathol ; 30(6): 2446-2455, 2021 11 04.
Article in English | MEDLINE | ID: mdl-34473568

ABSTRACT

Purpose This study assessed the reliability and accuracy of auditory-perceptual voice evaluations by experienced clinicians via telepractice platforms. Method Voice samples from 20 individuals were recorded after transmission via telepractice platforms. Twenty experienced clinicians (10 speech-language pathologists, 10 laryngologists) evaluated the samples for dysphonia percepts (overall severity, roughness, breathiness, and strain) using a modified Consensus Auditory-Perceptual Evaluation of Voice. Reliability was calculated as the mean of squared differences between repeated ratings (intrarater agreement), and between individual and group mean ratings (interrater agreement). Repeated measures analyses of variance were constructed to measure effects of transmission condition (e.g., original recording, WebEx, Zoom), dysphonia percept, and their interaction on intrarater agreement, interrater agreement, and average ratings. Significant effects were evaluated with post hoc Tukey's tests. Results There were significant effects of transmission condition, percept, and their interaction on average ratings, and a significant effect of percept on interrater agreement. Post hoc testing revealed statistically, but not clinically, significant differences in average roughness ratings across transmission conditions, and significant differences in interrater agreement for several percepts. Overall severity had the highest agreement and strain had the lowest. Conclusion Telepractice transmission does not substantially reduce reliability or accuracy of auditory-perceptual voice evaluations by experienced clinicians.


Subject(s)
Dysphonia , Voice , Dysphonia/diagnosis , Humans , Observer Variation , Reproducibility of Results , Voice Quality
6.
J Acoust Soc Am ; 149(4): 2189, 2021 04.
Article in English | MEDLINE | ID: mdl-33940922

ABSTRACT

The acoustic measure, relative fundamental frequency (RFF), has been proposed as an objective metric for assessing vocal hyperfunction; however, its underlying physiological mechanisms have not yet been fully characterized. This study aimed to characterize the relationship between RFF and vocal fold kinematics. Simultaneous acoustic and high-speed videoendoscopic (HSV) recordings were collected as younger and older speakers repeated the utterances /ifi/ and /iti/. RFF values at voicing offsets and onsets surrounding the obstruents were estimated from acoustic recordings, whereas glottal angles, durations of voicing offset and onset, and a kinematic estimate of laryngeal stiffness (KS) were obtained from HSV images. No differences were found between younger and older speakers for any measure. RFF did not differ between the two obstruents at voicing offset; however, fricatives necessitated larger glottal angles and longer durations to devoice. RFF values were lower and glottal angles were greater for stops relative to fricatives at voicing onset. KS values were greater in stops relative to fricatives. The less adducted vocal folds with greater KS and lower RFF at voicing onset for stops relative to fricatives in this study were in accordance with prior speculations that decreased vocal fold contact area and increased laryngeal stiffness may decrease RFF.


Subject(s)
Larynx , Vocal Cords , Acoustics , Biomechanical Phenomena , Phonation , Speech Acoustics
7.
Appl Sci (Basel) ; 11(9)2021 May.
Article in English | MEDLINE | ID: mdl-36188437

ABSTRACT

Methods for automating relative fundamental frequency (RFF)-an acoustic estimate of laryngeal tension-rely on manual identification of voiced/unvoiced boundaries from acoustic signals. This study determined the effect of incorporating features derived from vocal fold vibratory transitions for acoustic boundary detection. Simultaneous microphone and flexible nasendoscope recordings were collected from adults with typical voices (N=69) and with voices characterized by excessive laryngeal tension (N=53) producing voiced-unvoiced-voiced utterances. Acoustic features that coincided with vocal fold vibratory transitions were identified and incorporated into an automated RFF algorithm ("aRFF-APH"). Voiced/unvoiced boundary detection accuracy was compared between the aRFF-APH algorithm, a recently published version of the automated RFF algorithm ("aRFF-AP"), and gold-standard, manual RFF estimation. Chi-square tests were performed to characterize differences in boundary cycle identification accuracy among the three RFF estimation methods. Voiced/unvoiced boundary detection accuracy significantly differed by RFF estimation method for voicing offsets and onsets. Of 7721 productions, 76.0% of boundaries were accurately identified via the aRFF-APH algorithm, compared to 70.3% with the aRFF-AP algorithm and 20.4% with manual estimation. Incorporating acoustic features that corresponded with voiced/unvoiced boundaries led to improvements in boundary detection accuracy that surpassed the gold-standard method for calculating RFF.

8.
J Speech Lang Hear Res ; 62(6): 1685-1706, 2019 06 19.
Article in English | MEDLINE | ID: mdl-31181175

ABSTRACT

Objective Prephonatory vocal fold angle trajectories may supply useful information about the laryngeal system but were examined in previous studies using sigmoidal curves fit to data collected at 30 frames per second (fps). Here, high-speed videoendoscopy (HSV) was used to investigate the impacts of video frame rate and sigmoidal fitting strategy on vocal fold adductory patterns for voicing onsets. Method Twenty-five participants with healthy voices performed /ifi/ sequences under flexible nasendoscopy at 1,000 fps. Glottic angles were extracted during adduction for voicing onset; resulting vocal fold trajectories (i.e., changes in glottic angle over time) were down-sampled to simulate different frame rate conditions (30-1,000 fps). Vocal fold adduction data were fit with asymmetric sigmoids using 5 fitting strategies with varying parameter restrictions. Adduction trajectories and maximum adduction velocities were compared between the fits and the actual HSV data. Adduction trajectory errors between HSV data and fits were evaluated using root-mean-square error and maximum angular velocity error. Results Simulated data were generally well fit by sigmoid models; however, when compared to the actual 1,000-fps data, sigmoid fits were found to overestimate maximum angle velocities. Errors decreased as frame rate increased, reaching a plateau by 120 fps. Conclusion In healthy adults, vocal fold kinematic behavior during adduction is generally sigmoidal, although such fits can produce substantial errors when data are acquired at frame rates lower than 120 fps.


Subject(s)
Laryngoscopy/methods , Phonation/physiology , Video Recording/methods , Vocal Cords/diagnostic imaging , Voice/physiology , Adolescent , Adult , Biomechanical Phenomena , Computer Simulation , Female , Glottis/diagnostic imaging , Glottis/physiology , Humans , Larynx/diagnostic imaging , Larynx/physiology , Male , Vocal Cords/physiology , Young Adult
9.
J Speech Lang Hear Res ; 62(4): 815-834, 2019 04 15.
Article in English | MEDLINE | ID: mdl-30969902

ABSTRACT

Purpose This study aimed to examine the relationship between a large set of hypothesized physiological measures of vocal effort and self-ratings of vocal effort. Method Twenty-six healthy adults modulated speech rate and vocal effort during repetitions of the utterance /ifi/, followed by self-perceptual ratings of vocal effort on a visual analog scale. Physiological measures included (a) intrinsic laryngeal tension via kinematic stiffness ratios determined from high-speed laryngoscopy, (b) extrinsic suprahyoid and infrahyoid laryngeal tension via normalized percent activations and durations derived from surface electromyography, (c) supraglottal compression via expert visual-perceptual ratings, and (d) subglottal pressure via magnitude of neck surface vibrations from an accelerometer signal. Results Individual statistical models revealed that all of the physiological predictors, except for kinematic stiffness ratios, were significantly predictive of self-ratings of vocal effort. However, a combined regression model analysis yielded only 3 significant predictors: subglottal pressure, mediolateral supraglottal compression, and the normalized percent activation of the suprahyoid muscles (adjusted R 2 = .60). Conclusions Vocal effort manifests as increases in specific laryngeal physiological measures. Further work is needed to examine these measures in combination with other contributing factors, as well as in speakers with dysphonia.


Subject(s)
Larynx/physiology , Physical Exertion/physiology , Self Concept , Voice/physiology , Accelerometry , Adult , Biomechanical Phenomena , Electromyography , Female , Healthy Volunteers , Humans , Laryngoscopy , Male , Vibration
10.
Appl Sci (Basel) ; 9(11)2019 Jun 01.
Article in English | MEDLINE | ID: mdl-34267956

ABSTRACT

The development of trauma-induced lesions of the vocal folds (VFs) has been linked to a high collision pressure on the VF surface. However, there are no direct methods for the clinical assessment of VF collision, thus limiting the objective assessment of these disorders. In this study, we develop a video processing technique to directly quantify the mechanical impact of the VFs using solely laryngeal kinematic data. The technique is based on an edge tracking framework that estimates the kinematic sequence of each VF edge with a Kalman filter approach and a Hertzian impact model to predict the contact force during the collision. The proposed formulation overcomes several limitations of prior efforts since it uses a more relevant VF contact geometry, it does not require calibrated physical dimensions, it is normalized by the tissue properties, and it applies a correction factor for using a superior view only. The proposed approach is validated against numerical models, silicone vocal fold models, and prior studies. A case study with high-speed videoendoscopy recordings provides initial insights between the sound pressure level and contact pressure. Thus, the proposed method has a high potential in clinical practice and could also be adapted to operate with laryngeal stroboscopic systems.

SELECTION OF CITATIONS
SEARCH DETAIL
...