Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 45
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Folia Phoniatr Logop ; 2023 Dec 29.
Artigo em Inglês | MEDLINE | ID: mdl-38160667

RESUMO

PURPOSE: This study examined the immediate acoustic, auditory-perceptual, and self-perceptual effects of two semi-occluded vocal tract exercises (SOVTs): straw phonation and straw phonation into a cup of water, delivered in a remote setting. METHOD: 36 participants (19 females and 17 males) completed a baseline battery of acoustic recordings, followed by one of two SOVTs, and an identical post-task battery. The procedure repeated itself to include the other SOVT. Participants were also asked to rate their self-perceived vocal effort and quality following each condition. Recordings were presented to three expert listeners for completion of auditory-perceptual analysis. RESULTS: Acoustically, a significant decrease in shimmer was noted following straw phonation. Auditory-perceptual analysis revealed a significant increase in the perception of strain following straw phonation into a cup of water. While no significant differences were found between SOVT tasks in self-perception of vocal effort, a significant increase in self-perception of vocal loudness was reported following straw phonation into a cup of water. CONCLUSIONS: SOVTs have a varied, yet significant short-term impact across acoustic, auditory-perceptual, and self-perceptual measures of voice production. Straw phonation provided consistently significant acoustic results, with nearly every variable improving to some degree. Results also support the notion that shimmer is an acoustic measure that is particularly susceptible to change following modest manipulation. These results, in addition to the auditory-perceptual and self-perceptual findings, have a direct impact on how SOVTs are being used clinically and may generalize to inform the way voice metrics are collected and analyzed.

2.
J Voice ; 2023 Nov 15.
Artigo em Inglês | MEDLINE | ID: mdl-37977969

RESUMO

OBJECTIVE: Diagnosis of adductor laryngeal dystonia (AdLD) is challenging as it mimics voice features of other voice disorders. This could lead to misdiagnosis (or delayed diagnosis) and ineffective treatments of AdLD. This paper develops automated measurements of glottal attack time (GAT) and glottal offset time (GOT) from high-speed videoendoscopy (HSV) in connected speech as objective measures that can potentially facilitate the diagnosis of this disorder in the future. METHODS: HSV data were recorded from vocally normal adults and patients with AdLD during the reading of the "Rainbow Passage" and six CAPE-V (Consensus Auditory-Perceptual Evaluation of Voice) sentences. A deep learning framework was designed and trained to segment the glottal area and detect the vocal fold edges in the HSV dataset. This automated framework allowed us to automatically measure and quantify the GATs and GOTs for the participants. Accordingly, a comparison was held between the obtained measurements among vocally normal speakers and those with AdLD. RESULTS: The automated framework was successfully developed and able to accurately segment the glottal area/edges. The precise automated measurements of GAT and GOT revealed minor, nonsignificant differences compared to the results of manual analysis-showing a strong correlation between the measures by the automated and manual methods. The results showed significant differences in the GAT values between the vocally normal subjects and AdLD patients, with larger variability in both the GAT and GOT measures in the AdLD group. CONCLUSIONS: The developed automated approach for GAT and GOT measurement can be valuable in clinical practice. These quantitative measurements can be used as meaningful biomarkers of the impaired vocal function in AdLD and help its differential diagnosis in the future.

3.
Appl Sci (Basel) ; 13(5)2023 Mar 01.
Artigo em Inglês | MEDLINE | ID: mdl-37034315

RESUMO

Adductor spasmodic dysphonia (AdSD) disrupts laryngeal muscle control during speech and, therefore, affects the onset and offset of phonation. In this study, the goal is to use laryngeal high-speed videoendoscopy (HSV) to measure the glottal attack time (GAT) and glottal offset time (GOT) during connected speech for normophonic (vocally normal) and AdSD voices. A monochrome HSV system was used to record readings of six CAPE-V sentences and part of the "Rainbow Passage" from the participants. Three raters visually analyzed the HSV data using a playback software to measure the GAT and GOT. The results show that the GAT was greater in the AdSD group than in the normophonic group; however, the clinical significance of the amount of this difference needs to be studied further. More variability was observed in both GATs and GOTs of the disorder group. Additionally, the GAT and GOT time series were found to be nonstationary for the AdSD group while they were stationary for the normophonic voices. This study shows that the GAT and GOT measures can be potentially used as objective markers to characterize AdSD. The findings will potentially help in the development of standardized measures for voice evaluation and the accurate diagnosis of AdSD.

4.
J Voice ; 37(1): 26-36, 2023 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-33257208

RESUMO

OBJECTIVE: This study proposes a new computational framework for automated spatial segmentation of the vocal fold edges in high-speed videoendoscopy (HSV) data during connected speech. This spatio-temporal analytic representation of the vocal folds enables the HSV-based measurement of the glottal area waveform and other vibratory characteristics in the context of running speech. METHODS: HSV data were obtained from a vocally normal adult during production of the "Rainbow Passage." An algorithm based on an active contour modeling approach was developed for the analysis of HSV data. The algorithm was applied on a series of HSV kymograms at different intersections of the vocal folds to detect the edges of the vibrating vocal folds across the frames. This edge detection method follows a set of deformation rules for the active contours to capture the edges of the vocal folds through an energy optimization procedure. The detected edges in the kymograms were then registered back to the HSV frames. Subsequently, the glottal area waveform was calculated based on the area of the glottis enclosed by the vocal fold edges in each frame. RESULTS: The developed algorithm successfully captured the edges of the vocal folds in the HSV kymograms. This method led to an automated measurement of the glottal area waveform from the HSV frames during vocalizations in connected speech. CONCLUSION: The proposed algorithm serves as an automated method for spatial segmentation of the vocal folds in HSV data in connected speech. This study is one of the initial steps toward developing HSV-based measures to study vocal fold vibratory characteristics and voice production mechanisms in norm and disorder in the context of connected speech.


Assuntos
Laringe , Fala , Fonação , Gravação em Vídeo/métodos , Prega Vocal , Vibração
5.
J Voice ; 2022 Sep 22.
Artigo em Inglês | MEDLINE | ID: mdl-36154973

RESUMO

OBJECTIVE: Adductor spasmodic dysphonia (AdSD) is a neurogenic dystonia, which causes spasms of the laryngeal muscles. This disorder mainly affects production of connected speech. To understand how AdSD affects vocal fold (VF) movements and hence, the speech signal, it is necessary to study VF kinematics during the running speech. This paper introduces an automated method for analysis of VF vibrations in AdSD using laryngeal high-speed videoendoscopy (HSV) in running speech. METHODS: A monochrome HSV system was used to obtain video recordings from vocally normal individuals and AdSD patients during production of the six CAPE-V sentences and the "Rainbow Passage." A deep neural network was designed based on the UNet architecture. The network was developed for glottal area segmentation in HSV data providing a tool for quantitative analysis of VF vibrations in both norm and AdSD. The network was trained and validated using the manually labeled HSV frames. After training the network, the segmentation quality was quantitatively evaluated against visual analysis results of a test dataset including segregated HSV frames and a short sequence of VF vibrations in consecutive frames. RESULTS: The developed convolutional network was successfully trained and demonstrated an accurate segmentation on the testing dataset with a mean Intersection over Union (IoU) of 0.81 and a mean Boundary-F1 score of 0.93. Moreover, the visual assessment of the automated technique showed an accurate detection of the glottal edges/area in the HSV data even with challenging image quality and excessive laryngeal maneuvers of AdSD patients during the running speech. CONCLUSION: The introduced automated approach provides an accurate representation of the glottal edges/area during connected speech in HSV data for norm and AdSD patients. This method facilitates the development of HSV-based measures to quantify VF dynamics in AdSD. Using HSV to automatically analyze VF vibrations in AdSD can allow for understanding AdSD vocal mechanisms and characteristics.

6.
J Speech Lang Hear Res ; 65(6): 2098-2113, 2022 06 08.
Artigo em Inglês | MEDLINE | ID: mdl-35605603

RESUMO

PURPOSE: Voice disorders are best assessed by examining vocal fold dynamics in connected speech. This can be achieved using flexible laryngeal high-speed videoendoscopy (HSV), which enables us to study vocal fold mechanics with high temporal details. Analysis of vocal fold vibration using HSV requires accurate segmentation of the vocal fold edges. This article presents an automated deep-learning scheme to segment the glottal area in HSV from which the glottal edges are derived during connected speech. METHOD: Using a custom-built HSV system, data were obtained from a vocally healthy participant reciting the "Rainbow Passage." A deep neural network was designed for glottal area segmentation in the HSV data. A recently introduced hybrid approach by the authors was utilized as an automated labeling tool to train the network on a set of HSV frames, where the glottis region was automatically annotated during vocal fold vibrations. The network was then tested against manually segmented frames using different metrics, intersection over union (IoU), and Boundary F1 (BF) score, and its performance was assessed on various phonatory events on the HSV sequence. RESULTS: The designed network was successfully trained using the hybrid approach, without the need for manual labeling, and tested on the manually labeled data. The performance metrics showed a mean IoU of 0.82 and a mean BF score of 0.96. In addition, the evaluation assessment of the network's performance demonstrated an accurate segmentation of the glottal edges/area even during complex nonstationary phonatory events and when vocal folds were not vibrating, thus overcoming the limitations of the previous hybrid approach that could only be applied to the vibrating vocal folds. CONCLUSIONS: The introduced automated scheme guarantees accurate glottis representation in challenging color HSV data with lower image quality and excessive laryngeal maneuvers during all instances of connected speech. This facilitates the future development of HSV-based measures to assess the running vibratory characteristics of the vocal folds in speakers with and without voice disorder. SUPPLEMENTAL MATERIAL: https://doi.org/10.23641/asha.19798864.


Assuntos
Aprendizado Profundo , Laringe , Distúrbios da Voz , Glote/diagnóstico por imagem , Humanos , Laringoscopia/métodos , Fonação , Fala , Vibração , Gravação em Vídeo , Prega Vocal/diagnóstico por imagem , Distúrbios da Voz/diagnóstico
7.
J Voice ; 2022 Mar 15.
Artigo em Inglês | MEDLINE | ID: mdl-35304042

RESUMO

OBJECTIVE: Adductor spasmodic dysphonia (AdSD) is a neurogenic voice disorder, affecting the intrinsic laryngeal muscle control. AdSD leads to involuntary laryngeal spasms and only reveals during connected speech. Laryngeal high-speed videoendoscopy (HSV) coupled with a flexible fiberoptic endoscope provides a unique opportunity to study voice production and visualize the vocal fold vibrations in AdSD during speech. The goal of this study is to automatically detect instances during which the image of the vocal folds is optically obstructed in HSV recordings obtained during connected speech. METHODS: HSV data were recorded from vocally normal adults and patients with AdSD during reading of the "Rainbow Passage", six CAPE-V sentences, and production of the vowel /i/. A convolutional neural network was developed and trained as a classifier to detect obstructed/unobstructed vocal folds in HSV frames. Manually labelled data were used for training, validating, and testing of the network. Moreover, a comprehensive robustness evaluation was conducted to compare the performance of the developed classifier and visual analysis of HSV data. RESULTS: The developed convolutional neural network was able to automatically detect the vocal fold obstructions in HSV data in vocally normal participants and AdSD patients. The trained network was tested successfully and showed an overall classification accuracy of 94.18% on the testing dataset. The robustness evaluation showed an average overall accuracy of 94.81% on a massive number of HSV frames demonstrating the high robustness of the introduced technique while keeping a high level of accuracy. CONCLUSIONS: The proposed approach can be used for efficient analysis of HSV data to study laryngeal maneuvers in patients with AdSD during connected speech. Additionally, this method will facilitate development of vocal fold vibratory measures for HSV frames with an unobstructed view of the vocal folds. Indicating parts of connected speech that provide an unobstructed view of the vocal folds can be used for developing optimal passages for precise HSV examination during connected speech and subject-specific clinical voice assessment protocols.

8.
J Voice ; 2022 Jan 31.
Artigo em Inglês | MEDLINE | ID: mdl-35115223

RESUMO

OBJECTIVE: Evidence-based practice and precision medicine can significantly benefit from the ability to perform calibrated spatial measurements (eg, mm) from endoscopic images. However, calibrated measurements are not readily available from laryngeal images. Laser-projection endoscopes can provide the required information for performing calibrated spatial measurements, but their applications require a process known as calibration. During calibration, a set of benchtop recordings are used to determine the effect of confounding factors of spatial measurements, and also to learn their proper compensation strategies. Calibration benchtop recordings are acquired from flat surfaces and at a perpendicular imaging angle which is significantly different from in-vivo situations, where a three-dimensional (3D) surface gets recorded at a semi-unknown imaging angle. The aim of this study was to quantify changes in calibrated vertical and horizontal measurement accuracies as we move from the controlled condition of calibration to more realistic and uncontrolled settings. METHOD: A flat surface was positioned in front of a calibrated laser-projection transnasal fiberoptic endoscope at different working distances and imaging angles. Calibrated vertical and horizontal measurement errors were computed from each condition. Multiple linear regression analyses were used to quantify the dependence of vertical and horizontal measurement errors on the imaging angle and working distance. Next, a 3D-printed surface was positioned in front of the laser-projection endoscope at different working distances. Calibrated vertical and horizontal measurement errors were computed from each condition and then they were compared to measurement errors from a flat surface positioned at comparable working distances. RESULTS: The outcome of analyses supported a significant effect of imaging angle on calibrated vertical measurement accuracy, while no significant effect of imaging angle on calibrated horizontal measurement accuracy was established. Additionally, the result of multiple linear regression analyses showed that the coefficient of imaging angle was two times larger than the working distance, which further highlights the significant effect of imaging angle on vertical measurement accuracy. Comparing the magnitude of calibrated vertical and horizontal measurement errors between the 3D surface and a flat surface suggested a significant effect of surface topology on calibrated measurement accuracies. CONCLUSIONS: The mean percent magnitude of error of vertical and horizontal measurement errors from the 3D surface were respectively around 6% and 11%, at most working distances, which are acceptable for many applications. However, the significant effect of imaging angle and surface topology on measurement errors highlights the need for further research on these confounding factors. It also suggests that significant improvements in measurement accuracies may be achieved if these factors are properly accounted for during the calibration process. Last but not least, this study highlights the need for the evaluation of laser-projection endoscopes in uncontrolled and more realistic settings. Specifically, evaluations of laser-projection endoscopes in very controlled settings could significantly overestimate their accuracies and hence it will not represent their actual performances during in-vivo data acquisitions.

9.
J Voice ; 2022 Jan 02.
Artigo em Inglês | MEDLINE | ID: mdl-34986994

RESUMO

OBJECTIVE: Calibrated horizontal-plane measurements from laryngeal images could contribute significantly to refining evidence-based practice and developing patient-specific models and precision-medicine approaches. Laser-projection endoscopes can address the need for direct calibrated measures; however, these systems are not widely available. This study presents the framework for an alternative indirect horizontal-plane calibration approach. METHOD: A spatial attribute of a common object, a distinct characteristic that is maintained across images, may be used as a scale for the normalization of other spatial measurements. The outcome of this indirect approach could be used for absolute measurements (eg, in units of mm) or relative measurements (eg, percent change), depending on the information that is available from the common attribute. The required conditions of a common attribute for achieving a valid calibration outcome were studied. Three conditions were derived: registration accuracy of the common attribute, size consistency of the common attribute, and similarity in the vertical distance between the region of interest (ROI) (eg, vocal fold) and the common attribute. Any common attribute satisfying these three conditions was called proper and would result in a valid indirect calibration outcome. Three tests were presented for evaluating the properness of a common attribute. A data-driven statistical method was presented that can evaluate the registration accuracy of a common attribute. The second test used variation in calibrated lengths of a common attribute under different phonatory configurations for evaluating the size consistency condition. Finally, the effect of differences between vertical distances of the ROI and the common attribute was mathematically tested and quantified. The application of the proposed framework for indirect calibration was demonstrated using a pre existing dataset with a vocal fold as the ROI and four different common attributes (vocal fold length, vocal fold width, blood vessel on the vocal fold, and blood vessel on nearby tissue). RESULTS: The proposed registration-accuracy test was able to detect and eliminate instances of common attributes with low accuracies. The analysis suggested that among the studied four common attributes, the vocal fold length had the highest (ie, best) registration accuracy; however, the vocal fold length exhibited the lowest (ie, worst) size consistency. The analysis also suggested that, among the studied attributes, the vocal fold width offered the best trade-off among the three conditions and, hence, was a proper common attribute for calibrating spatial aspects of the vocal folds (length, displacement of edges, velocity, etc). CONCLUSION: Indirect calibration is a feasible alternative for calibration of laryngeal endoscopic images, given a proper common attribute is selected. Future work is needed to systematically evaluate the effects of various phonatory conditions on the characteristics of common attributes.

10.
J Voice ; 36(6): 755-769, 2022 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-32958427

RESUMO

Laryngeal images obtained via high-speed videoendoscopy are an invaluable source of information for the advancement of voice science because they can capture the true cycle-to-cycle vibratory characteristics of the vocal folds in addition to the transient behaviors of the phonatory mechanism, such as onset, offset, and breaks. This information is obtained through relating the spatial and temporal features from acquired images using objective measurements or subjective assessments. While these images are calibrated temporally, a great challenge is the lack of spatial calibration. Recently, a laser-projection system allowing for spatial calibration was developed. However, various sources of optical distortions deviate the images from reflecting the reality. The main purpose of this study was to evaluate the effect of the fiberoptic flexible endoscope distortions on the calibration of images acquired by the laser-projection system. Specifically, it is shown that two sources of nonlinear distortions could deviate captured images from reality. The first distortion stems from the wide-angle lens used in flexible endoscopes. It is shown that endoscopic images have a significantly higher spatial resolution in the center of the field of view than in its periphery. The difference between the two could lead to as high as 26.4% error in calibrated horizontal measurements. The second distortion stems from variation in the imaging angle. It is shown that the disparity between spatial resolution in the center and periphery of endoscopic images increases as the imaging angle deviates from the perpendicular position. Furthermore, it is shown that when the imaging angle varies, the symmetry of the distortion is also affected significantly. The combined distortions could lead to calibrated horizontal measurement errors as high as 65.7%. The implications of the findings on objective measurements and subjective visual assessments are discussed. These findings can contribute to the refinement of the methods for clinical assessment of voice disorders. Considering that the studied phenomena are due to optical principles, the findings of this study, especially those related to the effects of the imaging angle, can provide further insights regarding other endoscopic instruments (eg, distal-chip and rigid endoscopes) and procedures (eg, gastroendoscopy and colonoscopy).


Assuntos
Endoscópios , Laringe , Humanos , Prega Vocal/diagnóstico por imagem , Fonação , Endoscopia/métodos
11.
J Voice ; 36(2): 176-182, 2022 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-32712076

RESUMO

OBJECTIVE: To provide data on the measurable vocal fold vibratory differences in children with and without vocal fold lesions using high-speed videoendoscopy. DESIGN: Prospective study, 24 participants (8 healthy; 16 with lesions) between the ages of 5 and 10. METHODS: Rigid high-speed videoendoscopy at the rate of 8,000 frames per second was used to examine participants. Four objective vocal fold phase linearity measures were obtained to establish anterior-posterior contact and separation vibratory patterns. RESULTS: All objective measures showed a difference between nonlesion and bilateral vocal fold lesion groups. Contact-separation patterns in all nonlesion girls and young pre-pubertal boys exhibited an anterior-to-posterior contact and posterior-to-anterior separation; while older boys differed. The objective measures of open quotient, left-right relative phase asymmetry and speed index, showed linear anterior-posterior patterns within the nonlesion group; while the bilateral vocal fold lesion group displayed nonlinear patterns. Patterns in the posterior region of the vocal fold were similar in both groups; while patterns in the anterior region differed. CONCLUSIONS: This study suggests lesions have an effect on the anterior aspect of vocal fold vibratory patterns specifically anterior to the lesions. Age-related differences for males are also evidenced, prompting further investigation of laryngeal development in males and females from childhood to adulthood. This study could serve as a basis for the development of objective clinical measurements of vocal fold vibration in presence of lesions. Further findings could help redefine the theoretical framework of pediatric voice.


Assuntos
Prega Vocal , Voz , Adolescente , Criança , Pré-Escolar , Feminino , Humanos , Masculino , Modalidades de Fisioterapia , Estudos Prospectivos , Vibração , Adulto Jovem
12.
Appl Sci (Basel) ; 11(3)2021 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-33717604

RESUMO

Investigating the phonatory processes in connected speech from high-speed videoendoscopy (HSV) demands the accurate detection of the vocal fold edges during vibration. The present paper proposes a new spatio-temporal technique to automatically segment vocal fold edges in HSV data during running speech. The HSV data were recorded from a vocally normal adult during a reading of the "Rainbow Passage." The introduced technique was based on an unsupervised machine-learning (ML) approach combined with an active contour modeling (ACM) technique (also known as a hybrid approach). The hybrid method was implemented to capture the edges of vocal folds on different HSV kymograms, extracted at various cross-sections of vocal folds during vibration. The k-means clustering method, an ML approach, was first applied to cluster the kymograms to identify the clustered glottal area and consequently provided an initialized contour for the ACM. The ACM algorithm was then used to precisely detect the glottal edges of the vibrating vocal folds. The developed algorithm was able to accurately track the vocal fold edges across frames with low computational cost and high robustness against image noise. This algorithm offers a fully automated tool for analyzing the vibratory features of vocal folds in connected speech.

13.
Appl Sci (Basel) ; 11(2)2021 Jan 02.
Artigo em Inglês | MEDLINE | ID: mdl-33628469

RESUMO

OBJECTIVE: Calibrated horizontal measurements (e.g., mm) from endoscopic procedures could be utilized for advancement of evidence-based practice and personalized medicine. However, the size of an object in endoscopic images is not readily calibrated and depends on multiple factors, including the distance between the endoscope and the target surface. Additionally, acquired images may have significant non-linear distortion that would further complicate calibrated measurements. This study used a recently developed in-vivo laser-projection fiberoptic laryngoscope and proposes a method for calibrated spatial measurements. METHOD: A set of circular grids were recorded at multiple working distances. A statistical model was trained that would map from pixel length of the object, the working distance, and the spatial location of the target object into its mm length. RESULT: A detailed analysis of the performance of the proposed method is presented. The analyses have shown that the accuracy of the proposed method does not depend on the working distance and length of the target object. The estimated average magnitude of error was 0.27 mm, which is three times lower than the existing alternative. CONCLUSION: The presented method can achieve sub-millimeter accuracy in horizontal measurement. SIGNIFICANCE: Evidence-based practice and personalized medicine could significantly benefit from the proposed method. Implications of the findings for other endoscopic procedures are also discussed.

14.
J Voice ; 35(1): 122-128, 2021 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-31383516

RESUMO

The design specifications and experimental characteristics of a newly developed laser-projection transnasal flexible endoscope coupled with a high-speed videoendoscopy system are provided. The hardware and software design of the proposed system benefits from the combination of structured green light projection and laser triangulation techniques, which provide the capability of calibrated absolute measurements of the laryngeal structures along the horizontal and vertical planes during phonation. Visual inspection of in vivo acquired images demonstrated sharp contrast between laser points and background, confirming successful design of the system. Objective analyses were carried out for assessing the irradiance of the system and the penetration of the green laser light into the red and blue channels in the recorded images. The analysis showed that the system has irradiance of 372 W/m2 at a working distance of 20 mm, which is well within the safety limits, indicating minimal risk of usage of the device on human subjects. Additionally, the color penetration analysis showed that, with probability of 90%, the ratio of contamination of the red channel from the green laser light is less than 0.002. This indicates minimal effect of the laser projection on the measurements performed on the red data channel, making the system applicable for calibrated 3D spatial-temporal segmentation and data-driven subject-specific modeling, which is important for further advancing voice science and clinical voice assessment.


Assuntos
Laringe , Prega Vocal , Humanos , Laringoscopia , Laringe/diagnóstico por imagem , Lasers , Fonação , Vibração , Gravação em Vídeo , Prega Vocal/diagnóstico por imagem
15.
J Voice ; 34(5): 769-782, 2020 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-31005449

RESUMO

OBJECTIVES: A major limitation of comparing the efficacy of videostroboscopy (VS) and high-speed videoendoscopy (HSV) is the lack of an objective reference by which to compare the functional assessment ratings of the two techniques. For patients with vocal fold mass lesions, intraoperative measures of lesion size and depth may serve as this objective reference. This study compared the relationships between the pre- to postoperative change in VS and HSV visual-perceptual ratings to intraoperative measures of lesion size and depth. DESIGN: Prospective visual-perceptual study with intraoperative measures of lesion size and depth. METHODS: VS and HSV samples were obtained preoperatively and postoperatively from 28 patients with vocal fold lesions and from 17 vocally healthy controls. Two experienced clinicians rated amplitude, mucosal wave, vertical phase difference, left-right phase asymmetry, and vocal fold edge on a visual-analog scale using both imaging techniques. The change in perioperative ratings from VS and HSV was compared between groups and correlated to intraoperative measures of lesion size and depth. RESULTS: HSV was as reliable as VS for ratings of amplitude and edge, and substantially more reliable for ratings of mucosal wave and left-right phase asymmetry. Both VS and HSV had mild-moderate correlations between change in perioperative ratings and intraoperative measures of lesion area. Change in function could be obtained in more patients and for more parameters using HSV than VS. Group differences were noted for postoperative ratings of amplitude and edge; however, these differences were within one level of the visual-perceptual rating scale. The presence of asynchronicity in VS recordings renders vibratory features either uninterpretable or potentially distorted and thus should not be rated. CONCLUSIONS: Amplitude and edge are robust vibratory measures for perioperative functional assessment, regardless of imaging modality. HSV is indicated for evaluation of subepithelial lesions or if asynchronicity is present in the VS image sequence.


Assuntos
Doenças da Laringe , Prega Vocal , Humanos , Doenças da Laringe/diagnóstico por imagem , Doenças da Laringe/cirurgia , Laringoscopia , Fonação , Estudos Prospectivos , Estroboscopia , Vibração , Gravação em Vídeo , Prega Vocal/diagnóstico por imagem , Prega Vocal/cirurgia
16.
J Voice ; 34(6): 847-861, 2020 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-31151853

RESUMO

The ability to provide absolute calibrated measurement of the laryngeal structures during phonation is of paramount importance to voice science and clinical practice. Calibrated three-dimensional measurement could provide essential information for modeling purposes, for studying the developmental aspects of vocal fold vibration, for refining functional voice assessment and treatment outcomes evaluation, and for more accurate staging and grading of laryngeal disease. Recently, a laser-calibrated transnasal fiberoptic endoscope compatible with high-speed videoendoscopy (HSV) and capable of providing three-dimensional measurements was developed. The optical principle employed is to project a grid of 7 × 7 green laser points across the field of view (FOV) at an angle relative to the imaging axis, such that (after calibration) the position of each laser point within the FOV encodes the vertical distance from the tip of the endoscope to the laryngeal tissues. The purpose of this study was to develop a precise method for vertical calibration of the endoscope. Investigating the position of the laser points showed that, besides the vertical distance, they also depend on the parameters of the lens coupler, including the FOV position within the image frame and the rotation angle of the endoscope. The presented automatic calibration method was developed to compensate for the effect of these parameters. Statistical image processing and pattern recognition were used to detect the FOV, the center of FOV, and the fiducial marker. This step normalizes the HSV frames to a standard coordinate system and removes the dependence of the laser-point positions on the parameters of the lens coupler. Then, using a statistical learning technique, a calibration protocol was developed to model the trajectories of all laser points as the working distance was varied. Finally, a set of experiments was conducted to measure the accuracy and reliability of every step of the procedure. The system was able to measure absolute vertical distance with mean percent error in the range of 1.7% to 4.7%, depending on the working distance.


Assuntos
Fonação , Prega Vocal , Calibragem , Humanos , Lasers , Reprodutibilidade dos Testes , Prega Vocal/diagnóstico por imagem
17.
J Voice ; 32(2): 216-220, 2018 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-28596101

RESUMO

OBJECTIVE: This study aimed to assess the utility of laryngeal high-speed videoendoscopy (HSV) as a clinical tool. DESIGN: This is a prospective study of 151 patients. METHODS: A total of 151 adult patients (52 male, 99 female) underwent both videoendoscopy with stroboscopy (videostroboscopy) and HSV examination as part of a routine clinical voice assessment. At the time of the examination, ratings for videostroboscopy were reported in the clinical report. Next, the clinicians reviewed the HSV examination and indicated the changes in ratings of HSV relative to videostroboscopy. Finally, the clinical reports were reviewed by a clinician not involved in data collection or clinical care of the patients and noted differences between videostroboscopy and HSV clinical ratings, and resulting diagnoses were identified and grouped. RESULTS: Ratings of all vibratory features showed change between videostroboscopy and HSV. Mucosal wave and amplitude of vibration showed the largest percentage change, respectively, in 74% and 53% of the reports. They were followed by the features of glottal closure (36%), phase closure (32%), glottal edge (25%), and phase symmetry (21%). Ratings of supraglottic compression and vocal fold vertical level showed the least change between videostroboscopy and HSV. Changes in initial diagnosis owing to the inclusion of HSV were indicated in 7% of the cases. CONCLUSIONS: HSV may be an important laryngeal imaging technique for functional assessment of the pathophysiology of certain voice disorders. HSV could enable important refinements in the diagnosis and management of vocal fold pathology.


Assuntos
Doenças da Laringe/diagnóstico por imagem , Laringoscopia , Fonação , Estroboscopia , Gravação em Vídeo , Prega Vocal/diagnóstico por imagem , Distúrbios da Voz/diagnóstico por imagem , Qualidade da Voz , Adulto , Idoso , Idoso de 80 Anos ou mais , Feminino , Humanos , Doenças da Laringe/patologia , Doenças da Laringe/fisiopatologia , Masculino , Pessoa de Meia-Idade , Valor Preditivo dos Testes , Estudos Prospectivos , Vibração , Prega Vocal/patologia , Prega Vocal/fisiopatologia , Distúrbios da Voz/patologia , Distúrbios da Voz/fisiopatologia , Adulto Jovem
18.
J Voice ; 32(2): 256.e1-256.e12, 2018 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-28647431

RESUMO

OBJECTIVE: This study proposes a gradient-based method for temporal segmentation of laryngeal high-speed videoendoscopy (HSV) data obtained during connected speech. METHODS: A custom-developed HSV system coupled with a flexible fiberoptic nasolaryngoscope was used to record one vocally normal female participant during reading of the "Rainbow Passage." A gradient-based algorithm was developed to generate a motion window. When applied to the HSV data, the motion window acted as a filter tracking the location of the vibrating vocal folds. The glottal area waveform was estimated using a statistical-based image-processing approach. The vocal fold vibratory frequency was computed by an autocorrelation-based extraction of the fundamental frequency (f0) from the glottal area waveform. Temporal segmentation was then performed based on the f0 contour and automatic detection of the epiglottic obstructions. Additionally, visual temporal segmentation was performed by viewing the HSV images frame by frame to determine the time points of the vocalization onsets and offsets, and the epiglottic obstructions of the glottis. RESULTS: The time points resulting from the automatic and visual temporal segmentation methods were cross-validated. The f0-contour patterns of rise and fall resulting from the automatic algorithm were found to be in agreement with the visual inspection of the vibratory frequency change in the HSV data. CONCLUSIONS: This study demonstrated the feasibility of automatic temporal segmentation of HSV imaging of connected speech, which allows for mapping the video content into onsets, offsets, and epiglottic obstructions for each vocalization. Automated analysis of HSV imaging of connected speech has significant clinical potential for advancing instrumental voice assessment protocols.


Assuntos
Interpretação de Imagem Assistida por Computador/métodos , Laringoscopia/métodos , Laringe/anatomia & histologia , Laringe/fisiologia , Fonação , Acústica da Fala , Gravação em Vídeo/métodos , Qualidade da Voz , Adulto , Algoritmos , Automação , Estudos de Viabilidade , Feminino , Humanos , Doenças da Laringe/diagnóstico , Doenças da Laringe/fisiopatologia , Valor Preditivo dos Testes , Reprodutibilidade dos Testes , Fatores de Tempo , Vibração , Distúrbios da Voz/diagnóstico , Distúrbios da Voz/fisiopatologia
19.
J Voice ; 31(2): 254.e11-254.e15, 2017 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-27469450

RESUMO

PURPOSE: Clinicians commonly teach patients alternative clearing behaviors to reduce coughing and hard throat clearing with the assumption that these behaviors clear mucus from the vocal folds. Yet there is limited evidence of the effectiveness of these alternative behaviors at clearing mucus. This study's purpose was to evaluate the efficacy of reducing laryngeal mucus aggregation using alternative approaches in comparison with hard coughing and hard throat clearing in people with and without voice disorders. METHOD: Mucus aggregation of 46 participants, 22 with and 24 without voice disorders, was evaluated from stroboscopy recordings taken before and after each of six clearing behaviors: hard coughing, hard throat clearing, silent coughing, soft throat clearing, dry swallowing, and swallowing with a fluid bolus. Each participant performed each clearing behavior twice. Two trained raters evaluated mucus aggregation for type, thickness, and pooling. RESULTS: Of the six clearing behaviors studied, only hard throat clearing changed vocal fold mucus aggregation. The features of mucus aggregation that were changed by hard throat clearing were the severity of mucus thickness and the presence of type 3 mucus. CONCLUSIONS: Despite the widespread clinical use of alternative clearing behaviors, the results of this study indicate that hard throat clearing is the only clearing behavior to have a significant impact on removing mucus aggregation from the vocal folds. This finding should be further investigated in a larger scale study. If the results of this study are replicated, clinicians should consider changing their use and description of alternative clearing behaviors in clinical practice.


Assuntos
Tosse/fisiopatologia , Deglutição , Ingestão de Líquidos , Muco/metabolismo , Prega Vocal/fisiopatologia , Distúrbios da Voz/terapia , Qualidade da Voz , Adulto , Feminino , Humanos , Laringoscopia , Masculino , Pessoa de Meia-Idade , Estroboscopia , Resultado do Tratamento , Prega Vocal/metabolismo , Distúrbios da Voz/diagnóstico , Distúrbios da Voz/fisiopatologia
20.
Am J Speech Lang Pathol ; 25(4): 576-589, 2016 11 01.
Artigo em Inglês | MEDLINE | ID: mdl-27716854

RESUMO

Purpose: Videostroboscopy (VS) uses an indirect physiological signal to predict the phase of the vocal fold vibratory cycle for sampling. Simulated stroboscopy (SS) extracts the phase of the glottal cycle directly from the changing glottal area in the high-speed videoendoscopy (HSV) image sequence. The purpose of this study is to determine the reliability of SS relative to VS for clinical assessment of vocal fold vibratory function in patients with mass lesions. Methods: VS and SS recordings were obtained from 28 patients with vocal fold mass lesions before and after phonomicrosurgery and 17 controls who were vocally healthy. Two clinicians rated clinically relevant vocal fold vibratory features using both imaging techniques, indicated their internal level of confidence in the accuracy of their ratings, and provided reasons for low or no confidence. Results: SS had fewer asynchronous image sequences than VS. Vibratory outcomes were able to be computed for more patients using SS. In addition, raters demonstrated better interrater reliability and reported equal or higher levels of confidence using SS than VS. Conclusion: Stroboscopic techniques on the basis of extracting the phase directly from the HSV image sequence are more reliable than acoustic-based VS. Findings suggest that SS derived from high-speed videoendoscopy is a promising improvement over current VS systems.


Assuntos
Doenças da Laringe/patologia , Estroboscopia , Prega Vocal/patologia , Humanos , Doenças da Laringe/fisiopatologia , Laringoscopia , Fonação , Reprodutibilidade dos Testes , Gravação em Vídeo , Prega Vocal/fisiopatologia
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...