Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 10 de 10
Filter
1.
Laryngoscope ; 128(10): 2367-2374, 2018 10.
Article in English | MEDLINE | ID: mdl-29536548

ABSTRACT

OBJECTIVE/HYPOTHESIS: Laser-based three-dimensional (3D) imaging allows realistic visualization and absolute measurement of the vocal folds for comprehensive characterization of the oscillation pattern. STUDY DESIGN: Exploratory study with descriptive data analysis in healthy cohort. METHODS: The custom-developed miniaturized measuring device is an endoscopic camera-laser setup for stereo triangulation and suitable for in vivo application. The laser projection unit generates a regular laser grid that is projected on the vocal folds and recorded at 4000 fps using a high-speed camera. Recordings are performed during sustained phonation on 10 healthy subjects. RESULTS: We present absolute values for the lateral and vertical displacement amplitudes and maximum velocities during opening and closing phase. On average over all study participants, the vertical parameters are at least 50% higher than their lateral counterparts. The mean male/female amplitudes are 0.93 mm/0.80 mm in the lateral and 1.28 mm/1.45 mm in the vertical direction. Only 20% of the healthy subjects displayed slight asymmetries in the lateral direction, but 70% showed significant asymmetries in the vertical component. In only 30% of all subjects, the asymmetry trends matched in the lateral and vertical direction. CONCLUSIONS: The study illustrates the additional benefit of 3D imaging: consideration of the vertical component and computation of metric units. New, relevant, and even unexpected information was obtained by the incorporation of the additional vertical dimension. Metric units allow for an absolute evaluation of anatomy and dynamics and therefore enables cross-study comparability. The noninvasive acquisition of quantitative measures for evidence-based medicine has the potential to enhance diagnostic and therapeutic procedures as well as basic medical research. LEVEL OF EVIDENCE: 4. Laryngoscope, 128:2367-2374, 2018.


Subject(s)
Imaging, Three-Dimensional/methods , Laryngoscopy/methods , Lasers , Phonation , Vocal Cords/diagnostic imaging , Adult , Equipment Design , Female , Healthy Volunteers , Humans , Imaging, Three-Dimensional/instrumentation , Laryngoscopy/instrumentation , Male , Middle Aged , Video Recording
2.
Ann Otol Rhinol Laryngol ; 127(2): 89-98, 2018 Feb.
Article in English | MEDLINE | ID: mdl-29199445

ABSTRACT

PURPOSE: The aim of this study was to investigate the auditory feedback mechanisms and voice quality during phonation in response to a spontaneous pitch change in the auditory feedback. Does the pitch shift reflex (PSR) change voice pitch and voice quality? Quantitative and qualitative voice characteristics were analyzed during the PSR. METHOD: Twenty-eight healthy subjects underwent transnasal high-speed video endoscopy (HSV) at 8000 fps during sustained phonation [a]. While phonating, the subjects heard their sound pitched up for 700 cents (interval of a fifth), lasting 300 milliseconds in their auditory feedback. The electroencephalography (EEG), acoustic voice signal, electroglottography (EGG), and high-speed-videoendoscopy (HSV) were analyzed to compare feedback mechanisms for the pitched and unpitched condition of the phonation paradigm statistically. Furthermore, quantitative and qualitative voice characteristics were analyzed. RESULTS: The PSR was successfully detected within all signals of the experimental tools (EEG, EGG, acoustic voice signal, HSV). A significant increase of the perturbation measures and an increase of the values of the acoustic parameters during the PSR were observed, especially for the audio signal. CONCLUSIONS: The auditory feedback mechanism seems not only to control for voice pitch but also for voice quality aspects.


Subject(s)
Feedback, Sensory , Phonation , Pitch Perception , Speech Acoustics , Voice Quality , Adult , Electroencephalography , Female , Humans , Laryngoscopy , Male , Video Recording , Young Adult
3.
Comput Math Methods Med ; 2016: 8469139, 2016.
Article in English | MEDLINE | ID: mdl-27066108

ABSTRACT

The human voice originates from oscillations of the vocal folds in the larynx. The duration of the voice onset (VO), called the voice onset time (VOT), is currently under investigation as a clinical indicator for correct laryngeal functionality. Different analytical approaches for computing the VOT based on endoscopic imaging were compared to determine the most reliable method to quantify automatically the transient vocal fold oscillations during VO. Transnasal endoscopic imaging in combination with a high-speed camera (8000 fps) was applied to visualize the phonation onset process. Two different definitions of VO interval were investigated. Six analytical functions were tested that approximate the envelope of the filtered or unfiltered glottal area waveform (GAW) during phonation onset. A total of 126 recordings from nine healthy males and 210 recordings from 15 healthy females were evaluated. Three criteria were analyzed to determine the most appropriate computation approach: (1) reliability of the fit function for a correct approximation of VO; (2) consistency represented by the standard deviation of VOT; and (3) accuracy of the approximation of VO. The results suggest the computation of VOT by a fourth-order polynomial approximation in the interval between 32.2 and 67.8% of the saturation amplitude of the filtered GAW.


Subject(s)
Larynx/physiology , Phonation/physiology , Vocal Cords/physiology , Adult , Communication , Endoscopy , Female , Glottis/metabolism , Humans , Image Processing, Computer-Assisted , Male , Models, Theoretical , Reproducibility of Results , Vibration , Video Recording , Voice , Young Adult
4.
IEEE Trans Med Imaging ; 35(7): 1615-24, 2016 07.
Article in English | MEDLINE | ID: mdl-26829782

ABSTRACT

Standard laryngoscopic imaging techniques provide only limited two-dimensional insights into the vocal fold vibrations not taking the vertical component into account. However, previous experiments have shown a significant vertical component in the vibration of the vocal folds. We present a 3D reconstruction of the entire superior vocal fold surface from 2D high-speed videoendoscopy via stereo triangulation. In a typical camera-laser set-up the structured laser light pattern is projected on the vocal folds and captured at 4000 fps. The measuring device is suitable for in vivo application since the external dimensions of the miniaturized set-up barely exceed the size of a standard rigid laryngoscope. We provide a conservative estimate on the resulting resolution based on the hardware components and point out the possibilities and limitations of the miniaturized camera-laser set-up. In addition to the 3D vocal fold surface, we extended previous approaches with a G2-continuous model of the vocal fold edge. The clinical applicability was successfully established by the reconstruction of visual data acquired from 2D in vivo high-speed recordings of a female and a male subject. We present extracted dynamic parameters like maximum amplitude and velocity in the vertical direction. The additional vertical component reveals deeper insights into the vibratory dynamics of the vocal folds by means of a non-invasive method. The successful miniaturization allows for in vivo application giving access to the most realistic model available and hence enables a comprehensive understanding of the human phonation process.


Subject(s)
Larynx , Female , Humans , Imaging, Three-Dimensional , Laryngoscopy , Male , Phonation , Vibration , Vocal Cords
5.
J Voice ; 29(4): 517.e11-21, 2015 Jul.
Article in English | MEDLINE | ID: mdl-25944290

ABSTRACT

OBJECTIVE: Within the functional assessment of voice disorders, an objective analysis of measured parameters from audio, electroglottographic (EGG), or visual signals is desired. In a typical clinical situation, reliable objective analysis is not always possible due to missing standardization and unknown stability of the clinical parameters. The aim of this study was to investigate the robustness/stability of measured clinical parameters of the audio and EGG signals in a typical clinical setting to ensure a reliable objective analysis. In particular, the influence of F0 and of the sequence length on several definitions of jitter and shimmer will be analyzed. PATIENTS AND METHODS: Seventy-four young healthy women produced a sustained vowel /a/ and an upward triad with abrupt changeovers. Different sequence lengths (100, 150, 500, and 1000 ms) of sustained phonation and triads (100 and 150 ms) were extracted from the audio and EGG signals. In total, six variations of jitter and four variations of shimmer parameters were analyzed. RESULTS: Jitter%, Jitter11p, and JitterPPQ of the audio signal as well as Jittermean, Shimmer, and Shimmer11p of the EGG signal are unaffected by both sequence length and F0. CONCLUSIONS: Influence of F0 and sequence length on several perturbation measures of the audio and EGG signals was identified. For an objective clinical voice assessment, unaffected definitions of jitter and shimmer should be preferred and applied to enable comparability between different recordings, examinations, and studies.


Subject(s)
Speech Acoustics , Voice Disorders/diagnosis , Adolescent , Adult , Female , Healthy Volunteers , Humans , Voice , Young Adult
6.
Biomech Model Mechanobiol ; 14(1): 169-84, 2015 Jan.
Article in English | MEDLINE | ID: mdl-24861998

ABSTRACT

Laryngeal cancer due to, e.g., extensive smoking and/or alcohol consumption can necessitate the excision of the entire larynx. After such a total laryngectomy, the voice generating structures are lost and with that the quality of life of the concerning patients is drastically reduced. However, the vibrations of the remaining tissue in the so called pharyngoesophageal (PE) segment can be applied as alternative sound generator. Tissue, scar, and geometric aspects of the PE-segment determine the postoperative substitute voice characteristic, being highly important for the future live of the patient. So far, PE-dynamics are simulated by a biomechanical model which is restricted to stationary vibrations, i.e., variations in pitch and amplitude cannot be handled. In order to investigate the dynamical range of PE-vibrations, knowledge about the temporal processes during substitute voice production is of crucial interest. Thus, time-dependent model parameters are suggested in order to quantify non-stationary PE-vibrations and drawing conclusions on the temporal characteristics of tissue stiffness, oscillating mass, pressure, and geometric distributions within the PE-segment. To adapt the numerical model to the PE-vibrations, an automatic, block-based optimization procedure is applied, comprising a combined global and local optimization approach. The suggested optimization procedure is validated with 75 synthetic data sets, simulating non-stationary oscillations of differently shaped PE-segments. The application to four high-speed recordings is shown and discussed. The correlation between model and PE-dynamics is ≥ 97%.


Subject(s)
Esophagus/physiopathology , Esophagus/surgery , Laryngectomy , Larynx/physiopathology , Larynx/surgery , Models, Biological , Computer Simulation , Female , Humans , Male , Middle Aged , Oscillometry/methods , Time Factors , Treatment Outcome , Vibration
7.
J Speech Lang Hear Res ; 57(4): 1148-61, 2014 Aug.
Article in English | MEDLINE | ID: mdl-24686496

ABSTRACT

PURPOSE: The aim of this study was to identify parameters that would differentiate healthy from pathological organic-based vocal fold vibrations to emphasize clinical usefulness of high-speed imaging. METHOD: Fifty-five men (M age = 36 years, SD = 20 years) were examined and separated into 4 groups: 1 healthy (26 individuals) and 3 pathological (10 individuals with contact granuloma, 12 with polyps, and 7 with cysts). Vocal fold vibrations were recorded using a high-speed camera during sustained phonation. Twenty objective glottal area waveform and 24 phonovibrogram parameters representing spatiotemporal characteristics were analyzed. Statistical group comparisons were performed to document spatiotemporal changes for organic lesions that cannot be determined visually. To look for specific pattern profiles within organic lesions, the authors performed linear discriminant analysis. RESULTS: Thirteen parameters showed significant differences between the healthy group and at least 1 pathological group. The differences occurred more in temporal than in spatial parameters. Contact granuloma showed the fewest statistical differences (3 parameters), followed by cysts (9 parameters), and polyps (10 parameters). Linear discriminant analysis achieved accuracy performance of 76% (all groups separated) and 82% (healthy vs. pathological). CONCLUSION: The results suggest that for males, the differences between healthy voices and organic voice disorders may be more pronounced within temporal characteristics that cannot be visually detected without high-speed imaging.


Subject(s)
Image Processing, Computer-Assisted/methods , Laryngeal Diseases/diagnostic imaging , Laryngoscopy/methods , Vocal Cords/diagnostic imaging , Adult , Glottis/diagnostic imaging , Granuloma, Laryngeal/diagnostic imaging , Granuloma, Laryngeal/physiopathology , Humans , Laryngeal Diseases/physiopathology , Male , Middle Aged , Phonation/physiology , Polyps/diagnostic imaging , Spatio-Temporal Analysis , Vibration , Video Recording , Vocal Cords/physiopathology , Voice Disorders/diagnostic imaging , Voice Disorders/physiopathology
8.
Laryngoscope ; 123(7): 1686-93, 2013 Jul.
Article in English | MEDLINE | ID: mdl-23649746

ABSTRACT

OBJECTIVES/HYPOTHESIS: Quantitative analysis of endoscopic high-speed video recordings of vocal fold vibrations has been growing in importance in recent years. The videos have mainly been analyzed using subjective evaluation, but this is examiner dependent, and the results show inadequate interobserver agreement. The aims of this study were therefore to identify appropriate objective parameters for analyzing high-speed recordings to differentiate healthy voice production from organic disorders. METHODS: A total of 152 females were examined, divided into 77 healthy and 75 with four different pathological conditions: laryngeal epithelial thickening, Reinke edema, vocal fold polyps, and vocal fold cysts. Vocal fold vibrations were recorded with a high-speed camera (4,000 Hz, 256 × 256 pixels) during sustained phonation. Parameters computed from the glottal area waveform (GAW) and from phonovibrogram (PVG) were analyzed. Multiparametric linear discriminant analysis was performed to classify pathological conditions versus the healthy group. RESULTS: Twenty of 44 parameters were identified that are capable of distinguishing between the individual types of pathology. PVG parameters showed better performance than GAW parameters. Parameters representing vibrational periodicity via standard deviation showed better performance than absolute parameters. In addition, linear discriminant analysis achieved reliable differentiation between healthy and pathological vocal fold vibrations: 72% for the five-class problem (all groups separately) and 88% for the two-class problem (healthy vs. all pathologies taken as one class). CONCLUSIONS: The study succeeded in defining objective parameters for analyzing endoscopic high-speed videos and suggesting first parameters for differentiation between healthy dynamics and dynamics of organic pathologies.


Subject(s)
Laryngeal Diseases/diagnosis , Laryngoscopy/instrumentation , Video Recording , Vocal Cords/pathology , Adult , Case-Control Studies , Discriminant Analysis , Female , Humans , Laryngeal Diseases/pathology , Middle Aged , Phonation , Statistics, Nonparametric , Vibration
9.
Logoped Phoniatr Vocol ; 38(1): 1-10, 2013 Apr.
Article in English | MEDLINE | ID: mdl-22414332

ABSTRACT

PURPOSE: Digit span and sentence repetition are identified as potential markers for specific language impairment (SLI). We investigated if language learning of bilingual children with suspected language impairment (biSLI) was also influenced and led by memory constraints. METHOD: In a retrospective study, 19 children with SLI and 25 controls (ages 4;9-5;9), as well as 15 biSLI children and 14 controls (ages 5;1-8;9) were compared with regard to their performance on a digit span and sentence repetition task. RESULTS: Both groups with language impairment (SLI/biSLI) showed reduced performance on both tasks. Sentence repetition predicted language comprehension, and the digit span task predicted grammar abilities of the SLI, biSLI, and their controls. CONCLUSION: Sentence repetition and short-term memory provide information on language comprehension and grammar abilities in language-impaired mono- and bilingual children and confirm their function as SLI markers.


Subject(s)
Child Behavior , Child Language , Language Disorders/diagnosis , Multilingualism , Child , Child, Preschool , Comprehension , Female , Humans , Infant , Language , Language Disorders/psychology , Language Tests , Linear Models , Male , Memory, Short-Term , Retrospective Studies , Vocabulary
10.
J Voice ; 26(6): 726-33, 2012 Nov.
Article in English | MEDLINE | ID: mdl-22632795

ABSTRACT

SUMMARY: Acoustic and endoscopic voice assessments are routinely performed to determine the vocal fold vibratory function as part of the voice assessment protocol in clinics. More often than not these data are separately recorded, resulting in information being obtained from two different phonation segments and an increase of time for the voice evaluation process. This study explores the use of acoustic data, simultaneously recorded during high-speed endoscopy (HSE), for the evaluation of vocal fold function. PATIENTS AND METHODS: HSE and acoustic data were recorded from the subjects simultaneously during sustained phonation. The data included voices of 73 healthy subjects, 148 paresis, 210 functional dysphonias, and 119 benign lesions of vocal folds. For this study, only acoustic data were analyzed using Dr. Speech software (Tiger electronics Inc., MA). Twelve parameters were computed; 82% of the acoustic voice recordings could be analyzed. Statistical analysis was performed with SPSS 17.0. RESULTS: Acoustic data was easily recorded simultaneously allowing analyses of the same phonation segment to determine vocal fold function and therefore eliminating the need for another voice recording. The acoustic voice parameters differed between genders in the healthy voice group. Most of the parameters showed significant differences between healthy and pathological groups. CONCLUSION: Simultaneously recorded endoscopic and acoustic data is valuable. Differentiation between healthy and pathological groups was possible using acoustic data only. We suggest that the synchronously recorded acoustic signal is of sufficient quality for objective analysis yielding reduced examination time.


Subject(s)
Acoustics , Dysphonia/diagnosis , Laryngoscopy , Phonation , Speech Production Measurement , Vocal Cords/physiopathology , Voice Quality , Adolescent , Adult , Aged , Biomechanical Phenomena , Case-Control Studies , Discriminant Analysis , Dysphonia/pathology , Dysphonia/physiopathology , Factor Analysis, Statistical , Female , Humans , Linear Models , Male , Middle Aged , Predictive Value of Tests , Signal Processing, Computer-Assisted , Time Factors , Vibration , Vocal Cord Paralysis/diagnosis , Vocal Cord Paralysis/pathology , Vocal Cord Paralysis/physiopathology , Vocal Cords/pathology , Young Adult
SELECTION OF CITATIONS
SEARCH DETAIL
...