Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 11 de 11
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
J Voice ; 2023 Oct 06.
Artigo em Inglês | MEDLINE | ID: mdl-37806902

RESUMO

INTRODUCTION: Singers use a whistle register to sing at a fundamental frequency above 1000 Hz. In previous studies, vocal fold vibrations with or without complete closure and partial vocal fold vibrations were observed depending on the subject. However, the production mechanism of the whistle register is not yet clearly understood because of the limitations of the imaging device for the glottis and subjects. OBJECTIVES: This study aims to examine vocal fold vibrations in a whistle register. METHODS: The dynamic behavior of the glottis was recorded for six singers (four females and two males) using a high-speed digital imaging device with a frame rate above 10,000 fps. Audio signals were recorded simultaneously. The data were analyzed in the form of topography, glottal area waveforms, spectrograms, and phonovibrography to examine spatiotemporal patterns of glottal motion. RESULTS: The vibratory motion of the vocal folds was classified into six patterns. The first pattern was the entire vocal fold vibration with complete closure during the closed phase. The second to fifth was the entire vocal fold vibration without complete closure, where a gap was observed for the full length of the vocal folds for the second, at the posterior part of the glottis for the third, at the anterior for the fourth, and at both ends for the fifth. In the sixth pattern, the vocal folds vibrated partially. Our results support the previous findings on the vibration of the vocal folds. In addition, we identified novel vibratory patterns in the vocal folds. CONCLUSION: We conclude that the production of the whistle register is not just an extension of the falsetto register to the higher fundamental-frequency region; rather, the production mechanism of the whistle register appeared to be diverse as a means of vocalization.

2.
J Voice ; 37(2): 298.e11-298.e29, 2023 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-33518476

RESUMO

INTRODUCTION: Typical singing registers are the chest and falsetto; however, trained singers have an additional register, namely, the mixed register. The mixed register, which is also called "mixed voice" or "mix," is an important technique for singers, as it can help bridge from the chest voice to falsetto without noticeable voice breaks. OBJECTIVE: The present study aims to reveal the nature of the voice-production mechanism of the different registers (chest, mix, and falsetto) using high-speed digital imaging (HSDI), electroglottography (EGG), and acoustic and aerodynamic measurements. STUDY DESIGN: Cross-sectional study. METHODS: Aerodynamic measurements were acquired for twelve healthy singers (six men and women) during the phonation of a variety of pitches using three registers. HSDI and EGG devices were simultaneously used on three healthy singers (two men and one woman) from which an open quotient (OQ) and speed quotient (SQ) were detected. Audio signals were recorded for five sustained vowels, and a spectral analysis was conducted to determine the amplitude of each harmonic component. Furthermore, the absolute (not relative) value of the glottal volume flow was estimated by integrating data obtained from the HSDI and aerodynamic studies. RESULTS: For all singers, the subglottal pressure (PSub) was the highest for the chest in the three registers, and the mean flow rate (MFR) was the highest for the falsetto. Conversely, the PSub of the mix was as low as the falsetto, and the MFR of the mix was as low as the chest. The HSDI analysis showed that the OQ differed significantly among the registers, even when the fundamental frequency was the same; the OQ of the mix was higher than that of the chest but lower than that of the falsetto. The acoustic analysis showed that, for the mix, the harmonic structure was intermediate between the chest and falsetto. The results of the glottal volume-flow analysis revealed that the maximum volume velocity was the least for the mix register at every fundamental frequency. The first and second harmonic (H1-H2) difference of the voice source spectrum was the greatest for the falsetto, then the mix, and finally, the chest. CONCLUSIONS: We found differences in the registers in terms of the aeromechanical mechanisms and vibration patterns of the vocal folds. The mixed register proved to have a distinct voice-production mechanism, which can be differentiated from those of the chest or falsetto registers.


Assuntos
Prega Vocal , Qualidade da Voz , Masculino , Humanos , Feminino , Estudos Transversais , Fonação , Glote
3.
J Voice ; 2022 Nov 24.
Artigo em Inglês | MEDLINE | ID: mdl-36437171

RESUMO

OBJECTIVES: Auditory-perceptual evaluation frameworks, such as the grade-roughness-breathiness-asthenia-strain (GRBAS) scale, are the gold standard for the quantitative evaluation of pathological voice quality. However, the evaluation is subjective; thus, the ratings lack reproducibility due to inter- and intra-rater variation. Prior researchers have proposed deep-learning-based automatic GRBAS score estimation to address this problem. However, these methods require large amounts of labeled voice data. Therefore, this study investigates the potential of automatic GRBAS estimation using deep learning with smaller amounts of data. METHODS: A dataset consisting of 300 pathological sustained /a/ vowel samples was created and rated by eight experts (200 for training, 50 for validation, and 50 for testing). A neural network model that predicts the probability distribution of GRBAS scores from an onset-to-offset waveform was proposed. Random speed perturbation, random crop, and frequency masking were investigated as data augmentation techniques, and power, instantaneous frequency, and group delay were investigated as time-frequency representations. RESULTS: Five-fold cross-validation was conducted, and the automatic scoring performance was evaluated using the quadratic weighted Cohen's kappa. The results showed that the kappa values of the automatic scoring performance were comparable to those of the inter-rater reliability of experts for all GRBAS items and the intra-rater reliability of experts for items G, B, A, and S. Random speed perturbation was the most effective data augmentation technique overall. When data augmentation was applied, power was the most effective for items G, R, A, and S; for Item B, combining group delay and power yielded additional performance gains. CONCLUSION: The automatic GRBAS scoring achieved by the proposed model using scant labeled data was comparable to that of experts. This suggests that the challenges resulting from insufficient data can be alleviated. The findings of this study can also contribute to performance improvements in other tasks such as automatic voice disorder detection.

4.
J Acoust Soc Am ; 143(3): 1514, 2018 03.
Artigo em Inglês | MEDLINE | ID: mdl-29604699

RESUMO

Higher-order frequency locking of an organ pipe was investigated in terms of relationships between the locking phenomena and the harmonics of the pipe sound and an external force acting onto the pipe. The authors first assumed the pipe as a phase oscillator that is used in synchronization theory and predicted frequency ratios that can cause frequency locking. The authors then forced an actual pipe using a pure tone with frequency ratios of 1 : 1, 1 : 2, 2 : 1, 1 : 3, and 2 : 3. In addition, experiments were conducted using complex tones to investigate effects of higher harmonics of the external force on frequency locking. As a result, frequency locking occurred only at frequency ratios of 1 : 1, 1 : 2, and 1 : 3 in the case of the pure tone in agreement with the prediction of synchronization theory. For the complex tone, the authors succeeded in inducing 2 : 1 locking. The results show that the frequency of a harmonic component of the external force was close to that of the pipe sound when frequency locking occurred. Frequency locking of an organ pipe was therefore enhanced through the proximity of the harmonics of the pipe sound and the external force.

5.
J Acoust Soc Am ; 130(1): 536-45, 2011 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-21786919

RESUMO

A methodological study is presented to examine the acoustic role of the vocal tract in playing the trumpet. Preliminary results obtained for one professional player are also shown to demonstrate the effectiveness of the method. Images of the vocal tract with a resolution of 0.5 mm (2 mm in thickness) were recorded with magnetic resonance imaging to observe the tongue posture and estimate the vocal-tract area function during actual performance. The input impedance was then calculated for the player's air column including both the supra- and subglottal tracts using an acoustic tube model including the effect of wall losses. Finally, a time-domain blowing simulation by Adachi and Sato [J. Acoust. Soc. Am. 99, 1200-1209 (1996)] was performed with a model of the lips. In this simulation, the oscillating frequency of the lips was slightly affected by using different shapes of the vocal tract measured for the player. In particular, when the natural frequency of the lips was gradually increased, the transition to the higher mode occurred at different frequencies for different vocal-tract shapes. Furthermore, simulation results showed that the minimum blowing pressure required to attain the lip oscillation can be reduced by adjusting the vocal-tract shape properly.


Assuntos
Música , Prega Vocal/fisiologia , Fenômenos Biomecânicos , Simulação por Computador , Humanos , Lábio/fisiologia , Imageamento por Ressonância Magnética , Modelos Anatômicos , Modelos Biológicos , Análise Numérica Assistida por Computador , Oscilometria , Projetos Piloto , Pressão , Fatores de Tempo , Língua/fisiologia , Torque , Prega Vocal/anatomia & histologia
6.
J Acoust Soc Am ; 129(3): 1554-67, 2011 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-21428519

RESUMO

A voice production model is created in this work by considering essential aerodynamic and acoustic phenomena in human voice production. A precise flow analysis is performed based on a boundary-layer approximation and the viscous-inviscid interaction between the boundary layer and the core flow. This flow analysis can supply information on the separation point of the glottal flow and the thickness of the boundary layer, both of which strongly depend on the glottal configuration and yield an effective prediction of the flow behavior. When the flow analysis is combined with the modified two-mass model of the vocal fold [Pelorson et al. (1994). J. Acoust. Soc. Am. 96, 3416-3431], the resulting acoustic wave travels through the vocal tract and a pressure change develops in the vicinity of the glottis. This change can affect the glottal flow and the motion of the vocal folds, causing source-filter coupling. The property of the acoustic feedback is explicitly expressed in the frequency domain by using an acoustic tube model, allowing a clear interpretation of the coupling. Numerical experiments show that the vocal-tract input impedance and frequency responses representing the source-filter coupling have dominant peaks corresponding to the fourth and fifth formants. Results of time-domain simulations also suggest the importance of these high-frequency peaks in voice production.


Assuntos
Glote/fisiologia , Modelos Biológicos , Fonação , Voz , Fenômenos Biomecânicos , Simulação por Computador , Retroalimentação Fisiológica , Glote/anatomia & histologia , Humanos , Análise Numérica Assistida por Computador , Oscilometria , Pressão , Reologia , Fatores de Tempo , Prega Vocal/fisiologia
7.
J Acoust Soc Am ; 125(1): 391-404, 2009 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-19173426

RESUMO

The behavior of glottal flow can, to a large extent, be characterized by development and separation of the boundary layer. The point of flow separation is known to vary during the phonatory cycle due to change in channel configuration. To take the movable nature of the separation point into account, the boundary-layer equation is solved numerically, and the values of the characteristic quantities are determined as well as the point of separation. Development of the boundary layer in general reduces the effective size of the channel, and, therefore, increases the core flow velocity, which, in turn provides the boundary condition of the boundary-layer equation. The interaction between the viscous (boundary layer) and inviscid (core flow) parts of the glottal flow is, therefore, strongly indicated. To apply this viscous-inviscid interaction, the expression of the core flow is derived for a two-dimensional flow field, and is solved jointly with the boundary-layer equation. Numerical results are shown to examine the effect of the Reynolds number and glottal configuration, with special emphasis on the comparison of flow models developed for one- and two-dimensional flow fields. Numerical results are also quantitatively compared with data obtained from flow measurement experiments.


Assuntos
Glote/fisiologia , Fonação/fisiologia , Humanos , Modelos Biológicos , Prega Vocal/fisiologia
8.
J Acoust Soc Am ; 121(2): 745-8, 2007 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-17348497

RESUMO

To analyze the characteristics of voice source signals from speech, a model is presented in the form of polynomial function by expanding the definition of the Rosenberg model. In combination with the all-pole assumption of the vocal-tract filter, methods are described for the pitch-synchronous speech analysis and temporal search of the glottal opening and closing instants. Because the source and filter models are both linear, the parameter estimation problem can be conveniently solved. In addition, the temporal search method can refine the locations of the glottal events and improve the accuracy of the parameter estimation. Analyses of non-nasalized voiced speech are conducted using an electroglottographic device from which the initial estimate of the temporal information is given.


Assuntos
Modelos Estatísticos , Fonética , Localização de Som , Acústica da Fala , Percepção da Fala , Qualidade da Voz , Eletrodiagnóstico/instrumentação , Filtração , Glote/fisiologia , Humanos , Modelos Lineares , Processamento de Sinais Assistido por Computador/instrumentação , Software , Localização de Som/fisiologia , Espectrografia do Som/instrumentação , Percepção da Fala/fisiologia , Medida da Produção da Fala/instrumentação , Qualidade da Voz/fisiologia
9.
J Acoust Soc Am ; 121(1): 456-68, 2007 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-17297800

RESUMO

A method for synthesizing vocal-tract spectra from phoneme sequences by mimicking the speech production process of humans is presented. The model consists of four main processes and is particularly characterized by an adaptive formation of articulatory movements. First, our model determines the time when each phoneme is articulated. Next, it generates articulatory constraints that must be met for the production of each phoneme, and then it generates trajectories of the articulatory movements that satisfy the constraints. Finally, the time sequence of spectra is estimated from the produced articulatory trajectories. The articulatory constraint of each phoneme does not change with the phonemic context, but the contextual variability of speech is reproduced because of the dynamic articulatory model. The accuracy of the synthesis model was evaluated using data collected by the simultaneous measurement of speech and articulatory movements. The accuracy of the phonemic timing estimates were measured and compared the synthesized results to the measured results. Experimental results showed that the model captured the contextual variability of both the articulatory movements and speech acoustics.


Assuntos
Modelos Biológicos , Fonética , Acústica da Fala , Prega Vocal/fisiologia , Humanos , Fala
10.
J Acoust Soc Am ; 118(1): 428-43, 2005 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-16119363

RESUMO

A measurement principle of the three-dimensional electromagnetic articulographic device is presented. The state of the miniature receiver coil is described by five variables representing the position in the three-dimensional coordinate system and the rotation angles relative to it. When the receiver coil is placed in the magnetic field produced from the distributed transmitter coils, its state can be optimally estimated by minimizing the difference between the measured strength of the received signal and the predicted one using the known spatial pattern of the magnetic field. Therefore, the design and calibration of the field function inherently determine the accuracy in estimating the state of the receiver coil. The field function in our method is expressed in the form of a multivariate B spline as a function of position in the three-dimensional space. Because of the piecewise property of the basis function and the freedom in the selection of the rank and the number of basis functions, the spline field function has a superior ability to flexibly and accurately represent the actual magnetic field. Given a set of calibration data, the spline function is designed to form a smooth curved surface interpolating all of these data samples. Then, an iterative procedure is employed to solve the nonlinear estimation problem of the receiver state variables. Because the spline basis function is a polynomial, it is also shown that the calculation of the Jacobian or Hessian required to obtain updated quantities for the state variables can be efficiently performed. Finally, experimental results reveal that the measurement accuracy is about 0.2 mm for a preliminary condition, indicating that the method can achieve the degree of precision required for observing articulatory movements in a three-dimensional space. It is also experimentally shown that the Marquardt method is a better nonlinear programming technique than the Gauss-Newton or Newton-Raphson method for solving the receiver state problem.


Assuntos
Fenômenos Eletromagnéticos , Imageamento Tridimensional , Modelos Teóricos , Testes de Articulação da Fala/métodos , Humanos , Testes de Articulação da Fala/normas
11.
J Acoust Soc Am ; 111(3): 1414-21, 2002 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-11931318

RESUMO

Electromagnetic articulograph (EMA) devices are capable of measuring movements of the articulatory organs inside and outside the vocal tract with fine spatial and temporal resolutions, thus providing useful articulatory data for investigating the speech production process. The position of the receiver coil is detected in the EMA device on the basis of a field function representing the spatial pattern of the magnetic field in relation to the relative positions of the transmitter and receiver coils. Therefore, the design and calibration of the field function are quite important and influence the accuracy of position detection. This paper presents a nonparametric method for representing the magnetic field, and also describes a method for determining the receiver position from the strength of the induced signal in the receiver coil. The field pattern in this method is expressed by using a multivariate spline as a function of the position in the device's coordinate system. Because of the piecewise property of the basis functions and the freedom in the selection of the rank and the number of the basis functions, the spline function has a superior ability to flexibly and accurately represent the field pattern, even when it suffers from fluctuations caused by the interference between the transmitting channels. The position of the receiver coil is determined by minimizing the difference between the measured strength of the received signal and the predicted one from the spline representation of the magnetic field. Experimental results show that the error in estimating the receiver position is less than 0.1 mm for a 14 x 14-cm measurement area, and this error can be further reduced by using a spline-smoothing technique.


Assuntos
Campos Eletromagnéticos , Testes de Articulação da Fala/instrumentação , Humanos , Arcada Osseodentária/fisiologia , Lábio/fisiologia , Maxila/fisiologia , Palato Mole/fisiologia , Sensibilidade e Especificidade , Processamento de Sinais Assistido por Computador/instrumentação , Língua/fisiologia
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...