This paper introduces a protocol for the preparation of hemi-larynx specimens facilitating a multi-dimensional view of vocal fold vibration, in order to investigate various biophysical aspects of voice production in humans and non-human mammals.
The voice of humans and most non-human mammals is generated in the larynx through self-sustaining oscillation of the vocal folds. Direct visual documentation of vocal fold vibration is challenging, particularly in non-human mammals. As an alternative, excised larynx experiments provide the opportunity to investigate vocal fold vibration under controlled physiological and physical conditions. However, the use of a full larynx merely provides a top view of the vocal folds, excluding crucial portions of the oscillating structures from observation during their interaction with aerodynamic forces. This limitation can be overcome by utilizing a hemi-larynx setup where one half of the larynx is mid-sagittally removed, providing both a superior and a lateral view of the remaining vocal fold during self-sustained oscillation.
Here, a step-by-step guide for the anatomical preparation of hemi-laryngeal structures and their mounting on the laboratory bench is given. Exemplary phonation of the hemi-larynx preparation is documented with high-speed video data captured by two synchronized cameras (superior and lateral views), showing three-dimensional vocal fold motion and corresponding time-varying contact area. The documentation of the hemi-larynx setup in this publication will facilitate application and reliable repeatability in experimental research, providing voice scientists with the potential to better understand the biomechanics of voice production.
Voice is typically created by vibrating laryngeal tissue (mainly the vocal folds), which converts a steady airflow, supplied by the lungs, into a sequence of airflow pulses. The acoustic pressure waveform (i.e., the primary sound) emerging from this sequence of flow pulses acoustically excites the vocal tract which filters them, and the resulting sound is radiated from the mouth and (to a certain degree) from the nose1. The spectral composition of the generated sound is largely influenced by the quality of vocal fold vibration, governed by laryngeal biomechanics and interactions with the tracheal airflow2. Both in a clinical and a research context, documentation and assessment of vocal fold vibration is thus of foremost interest when studying voice production.
In humans, direct endoscopic investigation of the larynx during sound production in vivo is challenging, and it is virtually impossible in nonhuman mammals, given current technological means. Therefore, and in order to guarantee carefully controlled physical and/or physiological experimental boundary conditions, the use of excised larynges3,4 is in many cases an adequate substitution for investigation of in vivo voice production mechanisms.
Vocal fold vibration is a complex three-dimensional phenomenon5. While conventional investigation methods like laryngeal endoscopy (in vivo) or excised larynx preparations typically provide only a superior view of the vibrating vocal folds6, they do not allow for complete three-dimensional analysis of vocal fold motion. In particular, in the superior view the lower (caudal) margins of the vocal folds are invisible during a major portion of the vibratory cycle. This is due to the phase delay between the inferior (caudal) and the superior (cranial) edge of the vocal folds, a phenomenon which is typically seen during vocal fold oscillation5. As direct empirical evidence for backing up findings from mathematical and physical models is scarce, knowledge of the geometry and motion of the lower vocal fold edge7, and thus the geometry of the subglottal channel8,9,10 is crucial for better understanding the interaction between laryngeal airflow, vocal fold tissue, and the resulting forces and pressures11,12. Another aspect of vocal fold vibration that is hidden from the customary superior view is the vertical (caudo-cranial) depth of the contact between the two vocal folds. The vertical contact depth is related to the vertical thickness of the vocal folds, which is a potential indicator of the vocal register used in singing ("chest" vs. "falsetto" register)13,14.
In order to overcome the shortcomings of conventional (full) excised larynx preparations, a so-called hemi-larynx setup can be utilized, where one half of the larynx is removed, thus facilitating the assessment of the vibratory characteristics of the remaining vocal fold in three dimensions. Surprisingly, since the introduction of this setup in the 1960s15 and an initial validation of the concept in 199316, not many laboratories have performed experiments with this promising experimental approach17,18,19,20,21,22,23. An explanation for this might be found in the difficulties of creating a viable hemi-larynx preparation. While the conventional excised (full) larynx preparation is well documented4, no such in-depth instructions are as yet available for creating a hemi-larynx setup. It is therefore the purpose of this paper to provide a tutorial for establishing a reliably reproducible hemi-larynx setup, supplemented by experimental results from red deer specimens.
A hemi-larynx setup shares many features with a "conventional" excised larynx setup, such as measurement equipment, high-speed or other imaging technology to adequately document the vibrations of the laryngeal structures during sound generation, or proper supply of heated, humidified air. These general setup considerations are described in detail in both a book chapter4 and a technical report from the National Center of Voice and Speech24. Reiteration of these instructions would be beyond the scope of this manuscript. Here, only the specialized directives for generating a hemi-larynx setup are presented.
The animal specimens analyzed in this paper were treated in accordance with the standard ethical requirements of the Palacky University in Olomouc, Czech Republic. They stem from red deer living wildly in forests, which were hunted by the Czech Army Forest Service during a regular hunting season.
1. Preparation of the Hemi-larynx Specimen
Note: Only properly prepared specimens should be used, as indicated in4 . Quick freezing of the larynx25 immediately after excision and storage at -80 °C minimizes the potential of tissue degradation and alteration of biomechanical properties, and allows performing the experiments at any convenient time.
Figure 1: Hemi-larynx preparation and mounting. (A) and (B) Cleaned larynx specimen, medial and posterior view, before removal of left vocal fold; (C) and (D) Prepared hemi-larynx with L-shaped incision (left vocal fold removed), medial and posterior view. Please click here to view a larger version of this figure.
2. Hemi-larynx Experiment
Figure 2: Hemi-larynx setup. (A) Supporting structures: air supply tube, L-shaped glass plate arrangement, adduction prongs. (B) Mounted hemi-larynx preparation with adduction prongs. (C) and (D) Close-ups of hemi-larynx-preparation, viewed from the side and from the top, respectively. Please click here to view a larger version of this figure.
Illustrations of the hemi-larynx preparation and its mounting on the air supply tube, as referenced in the previous section, are provided in Figure 1 and Figure 2, respectively.
Documentation of vocal fold vibration from two camera angles
Airflow-induced self-sustaining oscillation of the hemi-larynx vocal fold was documented from the top and from the side with two synchronized high-speed video (HSV) cameras operated at 6,000 frame/s, complemented by time-synchronous recordings of acoustic and electroglottographic (see below) data sampled at 44.1 kHz. More information on the data acquisition setup including a list of equipment used can be found in previous publications by this group of authors 27,28. Footage from these HSV recordings is shown in the accompanying video. Still images, extracted at representative moments within the vibratory cycle, are shown in Figure 3. The top view (upper half of each panel) shows medio-lateral vocal fold movement, indicating an open glottis in Figure 3A, allowing glottal air flow, whilst in Figure 3B-D the glottis is closed (the vocal fold is in complete contact with the vertical glass plate), thus arresting the glottal air flow. The side view (lower half of each panel) in Figures 3B-D suggests a varying degree of vocal fold contact against the glass plate, as well as a varying geometry and vertical location of that contact.
Figure 3: Hemi-larynx vocal fold vibration. (A-D) Still images from high-speed video footage from the top (upper half of each panel) and the side view cameras (lower half of each panel), extracted at representative points within the vibratory cycle. Note the absence of vocal fold contact in (A), and the varying (both in area, shape, and position) vocal fold contact in (B-D). Please click here to view a larger version of this figure.
Kymographic glottal motion analysis
Quantitative glottal motion analysis is illustrated in Figure 4. The glottis is the variable opening between the (vibrating) vocal folds29, created by their deflections during self-sustained oscillation. State of the art analysis of top view HSV footage allows tracing the lateral deflections of the vocal folds30,31. The hemi-larynx preparation described here adds the facility to assess also the vertical (caudo-cranial) aspects of vocal fold vibration.
Figure 4: Kymographic glottal motion analysis.
(A) and (B) Video stills showing top and side views of the hemi-larynx, taken from high-speed video (HSV) recordings at 6,000 frames/s. The yellow vertical lines indicate the kymographic scan line position for the kymograms shown in panels C and E for the top view, and panels D and F for the side view. (C) and (D) Digital kymograms extracted from the HSV footage of the top and the side view, respectively. (E) The time-varying lateral displacement of the vocal fold extracted from the kymogram and traced with a line (short dashes). (F) The time-varying deflections of the inferior and superior edges of the vocal fold, assessed from the kymogram and traced with a dashed and a dotted line, respectively. (G) Synoptic depiction of the time-varying glottal structures: Lateral vocal fold deflection ("top", pale violet), and vertical deflection of the superior ("side sup.", dark red) and inferior ("side inf.", dark green) vocal fold edges extracted from the kymograms shown in panels E and F. (H) Motion speeds derived from the glottal structure displacement data shown in panel G. (I) and (J) Glottal motion reconstruction derived from the displacement data of the superior and inferior vocal fold margins shown in panel G. The arrows indicate the direction of the rotational movement. Please click here to view a larger version of this figure.
Two digital kymograms were generated from top and side view HSV data (Figures 4C and 4D). In a digital kymogram (DKG)32,33,34,35, the pixel data from a single line (typically at the point of maximum vocal fold vibratory amplitude), taken from a number of consecutive high-speed video frames, are concatenated to form a temporal axis on the abscissa. The time-varying displacement of the structures covered by the DKG scan line is visible on the ordinate. In the example shown in Figure 4C–F, the DKG scan line positions of the top and side view were selected halfway along the antero-posterior (ventro-dorsal) dimension of the vocal fold, utilizing the approach described by Hampala et al., Eq. 127.
The lateral and caudo-cranial deflections of the glottis, delineated by the inferior and superior vocal fold edges, were traced within the DKG data (Figures 4E and 4F) and expressed in metric units based on the video frame rate and calibration information embedded in the videos (Figure 4G and H). A reconstruction of the two-dimensional (lateral and vertical) glottal motion at the middle of the vocal fold (i.e., the place of maximum vibratory amplitude) over three complete glottal cycles is shown in Figure 4E and F. During the majority of the glottal cycle, the vocal fold was in contact with the glass plate (representing glottal closure), but with varying contact depth. During the open phase (i.e., when the vocal fold is not in contact with the glass plate), the traces of the inferior and the superior vocal fold edge fuse, and they exhibit a complex cyclic motion pattern, in partial agreement with results from other studies 5,20,36,37 (the motion pattern found in humans tends to be more elliptical than that of the red deer specimen investigated here). Interestingly, the vertical displacement reached a vibratory amplitude of about 10 mm, i.e., almost an order of magnitude larger than what was found in humans.
Assessment of vocal fold contact area
Electroglottography (EGG)38 is a widely used non-invasive method for measuring changes in relative vocal fold contact area (VFCA) during phonation. A low intensity, high-frequency current is passed between two electrodes placed at vocal fold level on each side of the larynx. The admittance variations resulting from vocal fold (de)contacting during laryngeal sound production are largely proportional to the time-varying relative vocal fold contact area39. The EGG signal is assumed to be a reliable physiological correlate of vocal fold vibration, reflecting the fundamental frequency and the oscillatory regime (irregular or periodic, including bifurcations). Despite its broad application, the possible direct relation between the VFCA and the EGG waveform has, until recently, only been tested in a single study17, suggesting an approximately linear relationship between VFCA and the EGG signal magnitude. However, flow-induced vocal fold vibration was not investigated in that study. Therefore, a rigorous empirical evaluation of EGG as a measure of relative VFCA under proper physiological conditions was therefore still needed.
In addressing this issue, this group of authors has recently investigated three red deer larynges in an excised hemi-larynx preparation utilizing a conducting glass plate27. The time varying contact between the vocal fold and the glass plate was monitored by high-speed video recordings made in the sagittal plane at 6000 fps, synchronized with the EGG signal with an accuracy of ± 0.167 ms. Representative results from that study are shown in Figure 5, indicating an average to good agreement between the EGG signal and VFCA – see reference27 for details).
Figure 5: Comparison of vocal fold contact area (VFCA) and electroglottographic (EGG) waveform. (A-D) Video stills from high-speed video data showing the side view of a red deer hemi-larynx at four instants within a glottal cycle. The manually assessed vocal fold contact area (i.e., the area where the vocal fold was in contact with the vertical glass plate in the hemi-larynx setup) is superimposed in cyan. (E) Comparison of normalized EGG and VFCA data for the vocal fold contact phase of one glottal cycle. The VFCA data stemmed from assessment of vocal fold contact area (counted in pixels) over the glottal cycle. Please click here to view a larger version of this figure.
The hemi-larynx preparation shares the advantages of the "conventional" (full) excised larynx setup: In such an experimental approach, physical and physiological boundary conditions and parameters (such as subglottal pressure or vocal fold elongation) can be controlled fairly well. The behavior of the hemilarynx is homologous to that of a full larynx with a perfect lateral symmetry, with the exception that magnitudes of some parameters (e.g., air flow rate, sound pressure) are reduced by approximately 50 %, yet still being within realistic ranges16. The major disadvantage of the full excised larynx approach, i.e., the lack of visibility of the vocal fold surface along the superior-inferior (caudo-cranial) dimension, is overcome in the hemi-larynx setup by providing a side view of the vibrating vocal fold. The hemi-larynx setup thus allows assessment of vocal fold motion in multiple dimensions, which is crucial when trying to understand the finer details of the biophysical sound generation mechanism in humans and nonhuman mammals.
Here, several exemplary applications of the hemi-larynx setup have been demonstrated. The documentation of vocal fold vibration from two camera angles allows further qualitative and quantitative data analysis. The kymographic glottal motion analysis in the vertical direction, newly introduced in this paper, allows reconstruction of the temporal geometric variations of the glottis along a selected position along the antero-posterior (dorso-ventral) glottal axis. When repeating this analysis for several points equidistantly spaced along the glottal axis, the entire glottal motion could be reconstructed. Note that this approach provides comparable but not identical results as compared to assessment of vocal fold motion by marking and tracking individual "fleshpoints" in the vocal fold tissue (also on points not forming the glottis), e.g., with micro-sutures20 or silicon carbide particles5,40. Precise knowledge about the time-varying glottal geometry in three dimensions is crucial to further investigate details of the glottal airflow and its interaction with the vibrating laryngeal tissue. For example, computational models of self-sustaining vocal fold vibration could be improved as more empirical data concerning the point of airflow jet separation 41,42,43,44,45,46,47,48 become available.
As illustrated in Figure 5, the hemi-larynx preparation enables assessment of the vocal fold contact area (VFCA) during self-sustained vocal fold vibration. For one, knowledge of the time-varying relative magnitude of VFCA is useful to validate the results from electroglottographic measurements27, as EGG is a widely-used method for non-invasive assessment of vocal fold vibration in vivo. Furthermore, measurement of the exact VFCA geometry and its variation over time might prove to be crucial for better understanding the notion of vocal fold contact depth 49 and its potential relation to the speed of the so-called mucosal wave50,51,52,53. There, an airflow-driven travelling wave occurs within the surface cover layer of the vocal fold tissue. This wave moves initially along with the trans-glottal airflow from the inferior to the superior vocal fold edge, and then it propagates laterally across the upper vocal fold surface once every oscillatory cycle54.
All things considered, the hemi-larynx approach is a powerful, yet not widely used constituent of the currently available arsenal of empirical methods for basic voice science. Here, a tutorial for creating a hemi-larynx preparation is presented, and some potential future applications are discussed. The given instructions may help improving the repeatability of experiments across different labs, thus providing voice scientists with the potential to better understand the biomechanics of voice production.
The authors have nothing to disclose.
This work was supported by an APART grant of the Austrian Academy of Sciences (CTH), the Technology Agency of the Czech Republic project no. TA04010877 (CTH, VH and JGS), and the Czech Science Foundation (GACR) project no GA16-01246S (to JGS). We thank W. Tecumseh Fitch for his suggestion to use denture fixative cream, and Ing. P. Liska from the Czech Army Forest Service for his help in acquiring the excised deer larynges.
Surgical blades | Surgeon | Jai Surgical Ltd., New Delhi, India | |
Saw | Hand saw (Lux, 150 mm length) | Lux, Wermelskirchen, Germany | |
Thermometer | Testo 922 | Testo Ltd., Hampshire, UK | K-type Probe, Operating temperature -20 to +50 °C |
Autoclave bags | Autoclave bags | vwr.com, VWR International s.r.o., Stribrna Skalice, Czech republic | |
Conductive glass plates | Custom made | UPOL – Joint laboratory of Optics Trida 17. listopadu 50A, 772 07 Olomouc, the Czech Rep. |
|
Fixative cream | Denture fixative cream | Blend-a-dent Natural | |
Prongs and fastening system | Customized Kanya Al eloxed profiles | Distributor: VISIMPEX a.s.. Seifertova 33, 750 02 Prerov, the Czech Rep.; | Combination of Kanya RVS and PVS fastening systems (http://www.kanya.cz/) + custom made prongs |
Mounting tube | Custom made | UPOL – Joint laboratory of Optics, Trida 17. listopadu 50A, 772 07 Olomouc, the Czech Rep. |
|
LED Light | Verbatim 52204 LED Lamp | Mitsubishi Chemical Holdings Corporation, Tokyo, Japan | |
Camera | Canon EOS1100D | Canon Inc. | 18-55 mm lens |
Airpump | Resun LP100 | Resun | |
Strobe light | ELMED Helio-Strob micro2 | ELMED Dr. Ing. Mense GmbH, Heiligenhaus, Germany | |
Humidifier | Custom made | Voice Research Lab, Dept. Biophysics, Faculty of Sciences, Palacky University Olomouc, Czech republic | |
Subglottic tract | Custom made adjustable subglottic tract | Voice Research Lab, Dept. Biophysics, Faculty of Sciences, Palacky University Olomouc, Czech republic | Hampala, V., Svec, Jan, Schovanek, P., and Mandat, D. Uzitny vzor c. 25585: Model subglotickeho traktu. [Utility model no. 25585: Model of subglottal tract] (In Czech) Soukup, P. 2013-27834(CZ 25505 U1), 1-7. 24-6-2013. Praha, Urad prumysloveho vlastnictvi |