Recent from talks
Nothing was collected or created yet.
Hoarse voice
View on Wikipedia| Hoarse voice | |
|---|---|
| Other names | Hoarseness, dysphonia |
| Specialty | Otolaryngology |
A hoarse voice, also known as dysphonia or hoarseness,[1] is when the voice involuntarily sounds breathy, raspy, or strained, or is softer in volume or lower in pitch.[2][3][clarification needed] A hoarse voice can be associated with a feeling of unease or scratchiness in the throat.[2] Hoarseness is often a symptom of problems in the vocal folds of the larynx.[2] It may be caused by laryngitis, which in turn may be caused by an upper respiratory infection, a cold, or allergies.[2] Cheering at sporting events, speaking loudly in noisy environments, talking for too long without resting one's voice, singing loudly, or speaking with a voice that is too high or too low can also cause temporary hoarseness. Smoking can also contribute to a hoarse voice.[4][2] A number of other causes for losing one's voice exist, and treatment is generally by resting the voice and treating the underlying cause.[2] If the cause is misuse or overuse of the voice, drinking plenty of water may alleviate the problems.[2]
It appears to occur more commonly in females and the elderly.[5] Furthermore, certain occupational groups, such as teachers and singers, are at an increased risk.[6][7]
Long-term hoarseness, or hoarseness that persists over three weeks, especially when not associated with a cold or flu should be assessed by a medical doctor.[2] It is also recommended to see a doctor if hoarseness is associated with coughing up blood, difficulties swallowing, a lump in the neck, pain when speaking or swallowing, difficulty breathing, or complete loss of voice for more than a few days.[2] For voice to be classified as "dysphonic", abnormalities must be present in one or more vocal parameters: pitch, loudness, quality, or variability.[8] Perceptually, dysphonia can be characterised by hoarse, breathy, harsh, or rough vocal qualities, but some kind of phonation remains.[8]
Dysphonia can be categorized into two broad main types: organic and functional, and classification is based on the underlying pathology. While the causes of dysphonia can be divided into five basic categories, all of them result in an interruption of the ability of the vocal folds to vibrate normally during exhalation, which affects the voice. The assessment and diagnosis of dysphonia is done by a multidisciplinary team, and involves the use of a variety of subjective and objective measures, which look at both the quality of the voice as well as the physical state of the larynx.[citation needed] Multiple treatments have been developed to address organic and functional causes of dysphonia. Dysphonia can be targeted through direct therapy, indirect therapy, medical treatments, and surgery. Functional dysphonias may be treated through direct and indirect voice therapies, whereas surgeries are recommended for chronic, organic dysphonias.[9]
Types
[edit]Voice disorders can be divided into two broad categories: organic and functional.[10] The distinction between these broad classes stems from their cause, whereby organic dysphonia results from some sort of physiological change in one of the subsystems of speech (for voice, usually respiration, laryngeal anatomy, and/or other parts of the vocal tract are affected). Conversely, functional dysphonia refers to hoarseness resulting from vocal use (i.e. overuse/abuse).[11] Furthermore, according to ASHA, organic dysphonia can be subdivided into structural and neurogenic; neurogenic dysphonia is defined as impaired functioning of the vocal structure due to a neurological problem (in the central nervous system or peripheral nervous system); in contrast, structural dysphonia is defined as impaired functioning of the vocal mechanism that is caused by some sort of physical change (e.g. a lesion on the vocal folds).[11] Notably, an additional subcategory of functional dysphonia recognized by professionals is psychogenic dysphonia, which can be defined as a type of voice disorder that has no known cause and can be presumed to be a product of some sort of psychological stressors in one's environment.[11][12] It is important to note that these types are not mutually exclusive and much overlap occurs. For example, Muscle Tension Dysphonia (MTD) has been found to be a result of many different causes including the following: MTD in the presence of an organic pathology (i.e. organic type), MTD stemming from vocal use (i.e. functional type), and MTD as a result of personality and/or psychological factors (i.e. psychogenic type).[11][13]
- Organic dysphonia
- Laryngitis (Acute: viral, bacterial) - (Chronic: smoking, GERD, LPR)
- Neoplasm (Premalignant: dysplasia) - (Malignant: Squamous cell carcinoma)
- Trauma (Iatrogenic: surgery, intubation) - (Accidental: blunt, penetrating, thermal)
- Endocrine (Hypothyroidism, hypogonadism)
- Haematological (Amyloidosis)
- Iatrogenic (inhaled corticosteroids)
- Functional dysphonia
- Psychogenic
- Vocal misuse
- Idiopathic
Causes
[edit]The most common causes of hoarseness is laryngitis (acute 42.1%; chronic 9.7%) and functional dysphonia (30%).[14] Hoarseness can also be caused by laryngeal tumours (benign 10.7 - 31%; malignant 2.2 - 3.0%).[14] Causes that are overall less common include neurogenic conditions (2.8 - 8.0%), psychogenic conditions (2.0 - 2.2%), and aging (2%).[14]
A variety of different causes, which result in abnormal vibrations of the vocal folds, can cause dysphonia. These causes can range from vocal abuse and misuse to systemic diseases. Causes of dysphonia can be divided into five basic categories, although overlap may occur between categories.[15][16][17][18] (Note that this list is not exhaustive):
- Neoplastic/structural: Abnormal growths of the vocal fold tissue.
- Inflammatory: Changes in the vocal fold tissue as a result of inflammation.
- Allergy
- Infections
- Reflux
- Smoking
- Trauma
- Voice abuse
- Neuromuscular: Disturbances in any of the components of the nervous system that control laryngeal function.
- Associated Systemic Diseases: Systemic diseases which have manifestations that affect the voice.
- Technical: Associated with poor muscle functioning or psychological stresses, with no corresponding physiological abnormalities of the larynx.
- Psychogenic such as dissociation disorder[19]
- Excess demands
- Stress
- Vocal strain
Employment
[edit]It has been suggested that certain occupational groups may be at increased risk of developing dysphonia[6][7] due to the excessive or intense vocal demands of their work.[20] Research on this topic has primarily focused on teachers and singers, although some studies have examined other groups of heavy voice users (e.g. actors, cheerleaders, aerobic instructors, etc.).[6][21] At present, it is known that teachers and singers are likely to report dysphonia.[20][22] Moreover, physical education teachers, teachers in noisy environments, and those who habitually use a loud speaking voice are at increased risk.[20] The term clergyman's throat or dysphonia clericorum was previously used for painful dysphonia associated with public speaking, particularly among preachers.[23] However, the exact prevalence rates for occupational voice users are unclear, as individual studies have varied widely in the methodologies used to obtain data (e.g. employing different operational definitions for "singer").[20][22]
Mechanism
[edit]Located in the anterior portion of the neck is the larynx (also known as the voice box), a structure made up of several supporting cartilages and ligaments, which houses the vocal folds.[24] In normal voice production, exhaled air moves out of the lungs and passes upward through the vocal tract.[24] At the level of the larynx, the exhaled air causes the vocal folds to move toward the midline of the tract (a process called adduction). The adducted vocal folds do not close completely but instead remain partially open. The narrow opening between the folds is referred to as the glottis.[24][8] As air moves through the glottis, it causes a distortion of the air particles which sets the vocal folds into vibratory motion. It is this vibratory motion that produces phonation or voice.[8] In dysphonia, there is an impairment in the ability to produce an appropriate level of phonation. More specifically, it results from an impairment in vocal fold vibration or the nerve supply of the larynx.[8]
Diagnosis
[edit]The assessment and diagnosis of a dysphonic voice is completed by a multidisciplinary team, such as an otolaryngologist (ear, nose and throat doctor) and Speech-Language Pathologist, involving the use of both objective and subjective measures to evaluate the quality of the voice as well as the condition of the vocal fold tissue and vibration patterns.[25]
Definition
[edit]Dysphonia is a broad clinical term which refers to abnormal functioning of the voice.[24][8] More specifically, a voice can be classified as "dysphonic" when there are abnormalities or impairments in one or more of the following parameters of voice: pitch, loudness, quality, and variability.[8] For example, abnormal pitch can be characterized by a voice that is too high or low whereas abnormal loudness can be characterized by a voice that is too quiet or loud.[8] Similarly, a voice that has frequent, inappropriate breaks characterizes abnormal quality while a voice that is monotone (i.e., very flat) or inappropriately fluctuates characterizes abnormal variability.[8] While hoarseness is used interchangeably with the term dysphonia, it is important to note that the two are not synonymous. Hoarseness is merely a subjective term to explain the perceptual quality (or sound) of a dysphonic voice.[26] While hoarseness is a common symptom (or complaint) of dysphonia,[24] there are several other signs and symptoms that can be present such as: breathiness, roughness, and dryness. Furthermore, a voice can be classified as dysphonic when it poses problems in the functional or occupational needs of the individual or is inappropriate for their age or sex.[8]
Auditory-perceptual measures
[edit]Auditory-perceptual measures are the most commonly used tool by clinicians to evaluate the voice quality due to its quick and non-invasive nature.[27] Additionally, these measures have been proven to be reliable in a clinical setting.[28] Ratings are used to evaluate the quality of a patient's voice for a variety of voice features, including overall severity, roughness, breathiness, strain, loudness and pitch. These evaluations are done during spontaneous speech, sentence or passage reading or sustained vowel productions.[18] The GRBAS (Grade, Roughness, Breathiness, Asthenia, Strain) and the CAPE-V (Consensus Auditory Perceptual Evaluation—Voice) are two formal voice rating scales commonly used for this purpose.[27]
Vocal fold imaging
[edit]Vocal fold imaging techniques are used by clinicians to examine the vocal folds and allows them to detect vocal pathology and assess the quality of the vocal fold vibrations. Laryngeal stroboscopy is the primary clinical tool used for this purpose. Laryngeal stroboscopy uses a synchronized flashing light passed through either a rigid or flexible laryngoscope to provide an image of the vocal fold motion; the image is created by averaging over several vibratory cycles and is thus not provided in real-time.[29] As this technique relies on periodic vocal fold vibration, it cannot be used in patients with moderate to severe dysphonia.[18] High speed digital imaging of the vocal folds (videokymography), another imaging technique, is not subject to the same limitations as laryngeal stroboscopy. A rigid endoscope is used to take images at a rate of 8000 frames per second, and the image is displayed in real time. As well, this technique allows imaging of aperiodic vibrations[18] and can thus be used with patients presenting with all severities of dysphonia.
Acoustic measures
[edit]Acoustic measures can be used to provide objective measures of vocal function. Signal processing algorithms are applied to voice recordings made during sustained phonation or during spontaneous speech.[30] The acoustic parameters which can then be examined include fundamental frequency, signal amplitude, jitter, shimmer, and noise-to-harmonic ratios.[18] However, due to limitations imposed by the algorithms employed, these measures cannot be used with patients who exhibit severe dysphonia.[30]
Aerodynamic measures
[edit]Aerodynamic measures of voice include measures of air volume, air flow and sub glottal air pressure. The normal aerodynamic parameters of voice vary considerably among individuals, which leads to a large overlapping range of values between dysphonic and non-dysphonic patients. This limits the use of these measures as a diagnostic tool.[18] Nonetheless, they are useful when used in adjunct with other voice assessment measures, or as a tool for monitoring therapeutic effects over time.[28]
Prevention
[edit]Given that certain occupations are more at risk for developing dysphonia (e.g., teachers) research into prevention studies have been conducted.[31] Research into the effectiveness of prevention strategies for dysphonia have yet to produce definitive results, however, research is still ongoing.[10][31] Primarily, there are two types of vocal training recognized by professionals to help with prevention: direct and indirect. Direct prevention describes efforts to reduce conditions that may serve to increase vocal strain (such as patient education, relaxation strategies, etc.), while indirect prevention strategies refer to changes in the underlying physiological mechanism for voice production (e.g., adjustments to the manner in which vocal fold adduction occurs, respiratory training, shifting postural habits, etc.).[10][31]
Treatment
[edit]Although there is no universal classification of voice problems, voice disorders can be separated into certain categories: organic (structural or neurogenic), functional, neurological (psychogenic) or iatrogenic, for example.[32] Depending on the diagnosis and severity of the voice problem, and depending on the category that the voice disorder falls into, there are various treatment methods that can be suggested to the patient. The professional has to keep in mind there is not one universal treatment, but rather the clinical approach must find what the optimal effective course of action for that particular patient is.[citation needed]
There are three main type of treatments: medical treatments, voice therapy and surgical treatments.[33] When necessary, certain voice disorders use a combination of treatment approaches.[10] A medical treatment involves the use of botulinum toxin (botox) or anti-reflux medicines, for example. Botox is a key treatment for voice disorders such as Spasmodic Dysphonia.[34] Voice therapy is mainly used with patients who have an underlying cause of voice misuse or abuse.[35] Laryngologists also recommend this type of treatment to patients who have an organic voice disorder - such as vocal fold nodules, cysts or polyps as well as to treat functional dysphonia.[10] Certain surgical treatments can be implemented as well - phono microsurgery (removal of vocal fold lesions performed with a microscope), laryngeal framework surgery (the manipulation of the voice box), as well as injection augmentation (injection of substance to vocal folds to improve closure). Surgical treatments may be recommended for patients having an organic dysphonia.[36][37]
A combination of both an indirect treatment method (an approach used to change external factors affecting the vocal folds)[38] and a direct treatment method (an approach used where the mechanisms functioning during the use of the vocal folds, such as phonation or respiration, are the main focus)[38] may be used to treat dysphonia.[10][13][39][40]
Direct therapies
[edit]Direct therapies address the physical aspects of vocal production.[10] Techniques work to either modify vocal fold contact, manage breathing patterns, and/or change the tension at level of the larynx.[10] Notable techniques include, but are not limited to, the yawn-sigh method, optimal pitch, laryngeal manipulation, humming, the accent method, and the Lee Silverman Voice Treatment.[10][39] An example of a direct therapy is circumlaryngeal manual therapy, which has been used to reduce tension and massage hyoid-laryngeal muscles.[13] This area is often tense from chronic elevation of the larynx.[13] Pressure is applied to these areas as the patient hums or sustains a vowel.[13] Traditional voice therapy is often used to treat muscular tension dysphonia.[13]
Indirect therapies
[edit]Indirect therapies take into account external factors that may influence vocal production.[10] This incorporates maintenance of vocal hygiene practices, as well as the prevention of harmful vocal behaviours.[41] Vocal hygiene includes adequate hydration of the vocal folds, monitoring the amount of voice use and rest, avoidance of vocal abuse (e.g., shouting, clearing of the throat), and taking into consideration lifestyle choices that may affect vocal health (e.g., smoking, sleeping habits).[41] Vocal warm-ups and cool-downs may be employed to improve muscle tension and decrease risk of injury before strenuous vocal activities.[41] It should be taken into account that vocal hygiene practices alone are minimally effective in treating dysphonia, and thus should be paired with other therapies.[41]
Medication and surgery
[edit]Medical and surgical treatments have been recommended to treat organic dysphonias. An effective treatment for spasmodic dysphonia (hoarseness resulting from periodic breaks in phonation due to hyperadduction of the vocal folds) is botulinum toxin injection.[9][42] The toxin acts by blocking acetylcholine release at the thyro-arytenoid muscle. Although the use of botulinum toxin injections is considered relatively safe, patients' responses to treatment differ in the initial stages; some have reported experiencing swallowing problems and breathy voice quality as a side-effect to the injections.[9][42] Breathiness may last for a longer period of time for males than females.[42]
Surgeries involve myoectomies of the laryngeal muscles to reduce voice breaks, and laryngoplasties, in which laryngeal cartilage is altered to reduce tension.[9]
Epidemiology
[edit]Dysphonia is a general term for voice impairment that is sometimes used synonymously with the perceptual voice quality of hoarseness.[14] It is the reason for 1% of all visits to primary care providers.[14] The lifetime risk of hoarse voice complaints among primary care patients is 30%.[14] Since hoarseness is a general symptom, it is associated with a number of laryngeal diagnoses.[14]
There is an interplay of sex and age differences associated with dysphonia. The point prevalence of dysphonia in adults under the age of 65 is 6.6%.[21] Dysphonia is more common in adult females than males,[21][43] possibly due to sex-related anatomical differences of the vocal mechanism.[5] In childhood, however, dysphonia is more often found in boys than girls.[44] As there are no anatomical differences in larynges of boys and girls prior to puberty, it has been proposed that the higher rate of voice impairment found in boys arises from louder social activities, personality factors, or more frequent inappropriate vocal use.[44] The most common laryngeal diagnosis among children is vocal fold nodules,[21] a condition known to be associated with vocally damaging behaviours.[45] However, a causal relationship has not yet been definitively proven.[44] The overall prevalence of dysphonia in children ranges from 3.9% - 23.4%, most commonly affecting children between the ages of 8 - 14.[21] Among the elderly, dysphonia is associated with both naturally occurring anatomical and physiological changes as well as higher rates of pathological conditions.[43] The point prevalence of dysphonia among the elderly is 29%.[failed verification][21] Findings regarding the prevalence of geriatric dysphonia in the general population are very variable, ranging from 4 - 29.1%.[43] This variability is likely due to different methodology used in obtaining information from participants.[21] The most common laryngeal diagnoses among the elderly are polyps, laryngopharyngeal reflux, muscle tension dysphonia, vocal fold paresis or paralysis, vocal fold mass, glottic insufficiency, malignant lesions, and neurologic conditions affecting the larynx.[43]
References
[edit]- ^ "Hoarseness".
- ^ a b c d e f g h i "Hoarseness". NIDCD. 2015-08-18. Retrieved 2017-07-24.
- ^ Johns MM, Sataloff RT, Merati AL, Rosen CA (August 2010). "Shortfalls of the American Academy of Otolaryngology-Head and Neck Surgery's Clinical practice guideline: Hoarseness (Dysphonia)". Otolaryngology–Head and Neck Surgery. 143 (2): 175–7, discussion 175–80. doi:10.1016/j.otohns.2010.05.026. PMID 20647114. S2CID 22553812.
- ^ https://health.clevelandclinic.org/is-smokers-voice-real
- ^ a b Cohen SM, Kim J, Roy N, Asche C, Courey M (February 2012). "Prevalence and causes of dysphonia in a large treatment-seeking population". The Laryngoscope. 122 (2): 343–8. doi:10.1002/lary.22426. PMID 22271658. S2CID 31457683.
- ^ a b c Williams, N. R. (2003). "Occupational groups at risk for voice disorders: A review of the literature". Occupational Medicine. 53 (7): 456–460. doi:10.1093/occmed/kqg113. PMID 14581643.
- ^ a b Verdolini K, Ramig LO (2001). "Review: occupational risks for voice problems". Logopedics, Phoniatrics, Vocology. 26 (1): 37–46. doi:10.1080/14015430119969. PMID 11432413. S2CID 7436065.
- ^ a b c d e f g h i j Aronson AE, Bless DM (2009). Clinical voice disorders. New York: Thieme. pp. 1–5.
- ^ a b c d Ludlow CL (June 2009). "Treatment for spasmodic dysphonia: limitations of current approaches". Current Opinion in Otolaryngology & Head and Neck Surgery. 17 (3): 160–5. doi:10.1097/moo.0b013e32832aef6f. PMC 2763389. PMID 19337127.
- ^ a b c d e f g h i j Ruotsalainen J, Sellman J, Lehto L, Verbeek J (May 2008). "Systematic review of the treatment of functional dysphonia and prevention of voice disorders". Otolaryngology–Head and Neck Surgery. 138 (5): 557–65. doi:10.1016/j.otohns.2008.01.014. PMID 18439458.
- ^ a b c d "Voice Disorders Overview". American Speech-Language Hearing Association. Retrieved October 2, 2016.
- ^ Duffy JR, Yorkston KM (2003). "Medical interventions for Spasmodic Dysphonia and some related conditions: A systematic review". Journal of Medical Speech-Language Pathology. 11.
- ^ a b c d e f Van Houtte E, Van Lierde K, Claeys S (March 2011). "Pathophysiology and treatment of muscle tension dysphonia: a review of the current knowledge". Journal of Voice. 25 (2): 202–7. doi:10.1016/j.jvoice.2009.10.009. PMID 20400263.
- ^ a b c d e f g Reiter R, Hoffmann TK, Pickhard A, Brosch S (May 2015). "Hoarseness-causes and treatments". Deutsches Ärzteblatt International. 112 (19): 329–37. doi:10.3238/arztebl.2015.0329. PMC 4458789. PMID 26043420.
- ^ Feierabend RH, Shahram MN (August 2009). "Hoarseness in adults". American Family Physician. 80 (4): 363–70. PMID 19678604.
- ^ Pylypowich A, Duff E (2016). "Differentiating the Symptom of Dysphonia". The Journal for Nurse Practitioners. 12 (7): 459–466. doi:10.1016/j.nurpra.2016.04.025.
- ^ Harries M (2013). "Hoarseness and Voice Disorders". In Ludman HS, Bradley PJ (eds.). ABC of ear, nose, and throat. West Sussex, UK: Wiley-Blackwell. p. 95.
- ^ a b c d e f Morris R, Bernard Harmon A (2010). "Describing Voice Disorders". In Damico J, Muller N, Ball MJ (eds.). Handbook of language and speech disorders. Chichester, U.K.: Wiley-Blackwell. pp. 455–473. ISBN 978-1-4051-5862-6.
- ^ Dr Pragya Semwal, Dr Shobit Garg Chronic Dissociation Presenting as Dysphonia Plica Ventricularis: An Atypical Presentation. Int. j. med. case reports Vol 6 Issue 3 Jul-Sep 2020 1-3. http://www.ijomcr.net Archived 2021-02-07 at the Wayback Machine
- ^ a b c d Cantor Cutiva LC, Vogel I, Burdorf A (March 2013). "Voice disorders in teachers and their associations with work-related factors: a systematic review". Journal of Communication Disorders. 46 (2): 143–55. doi:10.1016/j.jcomdis.2013.01.001. PMID 23415241.
- ^ a b c d e f g Stachler RJ, Francis DO, Schwartz SR, Damask CC, Digoy GP, Krouse HJ, McCoy SJ, Ouellette DR, Patel RR, Reavis CC, Smith LJ, Smith M, Strode SW, Woo P, Nnacheta LC (March 2018). "Clinical Practice Guideline: Hoarseness (Dysphonia) (Update)". Otolaryngology–Head and Neck Surgery. 158 (1_suppl): S1 – S42. doi:10.1177/0194599817751030. PMID 29494321.
- ^ a b Pestana PM, Vaz-Freitas S, Manso MC (November 2017). "Prevalence of Voice Disorders in Singers: Systematic Review and Meta-Analysis". Journal of Voice. 31 (6): 722–727. doi:10.1016/j.jvoice.2017.02.010. PMID 28342677.
- ^ Broadwater, Kimberly (2021). "Clergyman's Sore Throat". Journal of Singing. 78 (1): 113–117. doi:10.53830/CNLB1302. ISSN 2769-4046. S2CID 239663449.
- ^ a b c d e Colton, R. H., Casper, J. K., Leonard, R. (2011). Understanding voice problems: A physiological perspective for diagnosis and treatment. Baltimore, MB: Lippincott Williams & Wilkins. pp. 372–385.
{{cite book}}: CS1 maint: multiple names: authors list (link) - ^ Mehta DD, Hillman RE (June 2008). "Voice assessment: updates on perceptual, acoustic, aerodynamic, and endoscopic imaging methods". Current Opinion in Otolaryngology & Head and Neck Surgery. 16 (3): 211–5. doi:10.1097/moo.0b013e3282fe96ce. PMC 3775647. PMID 18475073.
- ^ Schwartz SR, Cohen SM, Dailey SH, Rosenfeld RM, Deutsch ES, Gillespie MB, Granieri E, Hapner ER, Kimball CE, Krouse HJ, McMurray JS, Medina S, O'Brien K, Ouellette DR, Messinger-Rapport BJ, Stachler RJ, Strode S, Thompson DM, Stemple JC, Willging JP, Cowley T, McCoy S, Bernad PG, Patel MM (September 2009). "Clinical practice guideline: hoarseness (dysphonia)". Otolaryngology–Head and Neck Surgery. 141 (3 Suppl 2): S1 – S31. doi:10.1016/j.otohns.2009.06.744. PMID 19729111. S2CID 14655786.
- ^ a b Oates J (2009). "Auditory-perceptual evaluation of disordered voice quality: pros, cons and future directions". Folia Phoniatrica et Logopaedica. 61 (1): 49–56. doi:10.1159/000200768. PMID 19204393.
- ^ a b Dejonckere PH, Bradley P, Clemente P, Cornut G, Crevier-Buchman L, Friedrich G, Van De Heyning P, Remacle M, Woisard V (February 2001). "A basic protocol for functional assessment of voice pathology, especially for investigating the efficacy of (phonosurgical) treatments and evaluating new assessment techniques. Guideline elaborated by the Committee on Phoniatrics of the European Laryngological Society (ELS)". European Archives of Oto-Rhino-Laryngology. 258 (2): 77–82. doi:10.1007/s004050000299. PMID 11307610. S2CID 3795132.
- ^ Kendall KA (March 2009). "High-speed laryngeal imaging compared with videostroboscopy in healthy subjects". Archives of Otolaryngology–Head & Neck Surgery. 135 (3): 274–81. doi:10.1001/archoto.2008.557. PMID 19289706.
- ^ a b Little MA, McSharry PE, Roberts SJ, Costello DA, Moroz IM (June 2007). "Exploiting nonlinear recurrence and fractal scaling properties for voice disorder detection". BioMedical Engineering OnLine. 6: 23. arXiv:0707.0086. doi:10.1186/1475-925X-6-23. PMC 1913514. PMID 17594480.
- ^ a b c Ruotsalainen JH, Sellman J, Lehto L, Jauhiainen M, Verbeek JH (October 2007). "Interventions for preventing voice disorders in adults". The Cochrane Database of Systematic Reviews. 4 (4) CD006372. doi:10.1002/14651858.cd006372.pub2. PMC 8923518. PMID 17943906.
- ^ Morrison MD, Rammage LA (May 1993). "Muscle misuse voice disorders: description and classification". Acta Oto-Laryngologica. 113 (3): 428–34. doi:10.3109/00016489309135839. PMID 8517149.
- ^ Maryn Y, De Bodt M, Roy N (May 2010). "The Acoustic Voice Quality Index: toward improved treatment outcomes assessment in voice disorders". Journal of Communication Disorders. 43 (3): 161–74. doi:10.1016/j.jcomdis.2009.12.004. PMID 20080243.
- ^ Kim JW, Park JH, Park KN, Lee SW (2014). "Treatment efficacy of electromyography versus fiberscopy-guided botulinum toxin injection in adductor spasmodic dysphonia patients: a prospective comparative study". TheScientificWorldJournal. 2014 327928. doi:10.1155/2014/327928. PMC 4213399. PMID 25383369.
- ^ Chow YW, Pietranico R, Mukerji A (October 1975). "Studies of oxygen binding energy to hemoglobin molecule". Biochemical and Biophysical Research Communications. 66 (4): 1424–31. doi:10.1016/0006-291x(75)90518-5. PMID 6.
- ^ Nacci A, Romeo SO, Berrettini S, Matteucci J, Cavaliere MD, Mancini V, Panicucci E, Ursino F, Fattori B (August 2017). "Stabilometric findings in patients affected by organic dysphonia before and after phonomicrosurgery". Acta Otorhinolaryngologica Italica. 37 (4): 286–294. doi:10.14639/0392-100x-1035. PMC 5584100. PMID 28872158.
- ^ Encyclopedia of otolaryngology, head and neck surgery. Kountakis, Stilianos E. Berlin: Springer. 2013. ISBN 978-3-642-23499-6. OCLC 837855971.
{{cite book}}: CS1 maint: others (link) - ^ a b "Voice Disorders: Treatment". American Speech-Language-Hearing Association. Retrieved 2018-10-03.
- ^ a b Hicks DM (June 1999). "The efficacy of voice treatment". Current Opinion in Otolaryngology & Head and Neck Surgery. 7 (3): 125. doi:10.1097/00020840-199906000-00005.
- ^ Ulis JM, Yanagisawa E (June 2009). "What's new in differential diagnosis and treatment of hoarseness?". Current Opinion in Otolaryngology & Head and Neck Surgery. 17 (3): 209–15. doi:10.1097/moo.0b013e32832a2230. PMID 19469052. S2CID 26072006.
- ^ a b c d Behlau M, Oliveira G (June 2009). "Vocal hygiene for the voice professional". Current Opinion in Otolaryngology & Head and Neck Surgery. 17 (3): 149–54. doi:10.1097/moo.0b013e32832af105. PMID 19342952. S2CID 38511217.
- ^ a b c Boutsen F, Cannito MP, Taylor M, Bender B (June 2002). "Botox treatment in adductor spasmodic dysphonia: a meta-analysis". Journal of Speech, Language, and Hearing Research. 45 (3): 469–81. doi:10.1044/1092-4388(2002/037). PMID 12069000.
- ^ a b c d de Araújo Pernambuco L, Espelt A, Balata PM, de Lima KC (October 2015). "Prevalence of voice disorders in the elderly: a systematic review of population-based studies". European Archives of Oto-Rhino-Laryngology. 272 (10): 2601–9. doi:10.1007/s00405-014-3252-7. PMID 25149291. S2CID 24668793.
- ^ a b c Maia AA, Gama AC, Kümmer AM (April 2014). "Behavioral characteristics of dysphonic children: integrative literature review". CoDAS. 26 (2): 159–63. doi:10.1590/2317-1782/2014408IN. PMID 24918510.
- ^ Mansuri B, Tohidast SA, Soltaninejad N, Kamali M, Ghelichi L, Azimi H (September 2018). "Nonmedical Treatments of Vocal Fold Nodules: A Systematic Review". Journal of Voice. 32 (5): 609–620. doi:10.1016/j.jvoice.2017.08.023. PMID 29032130. S2CID 37755681.
External links
[edit]Hoarse voice
View on GrokipediaOverview
Definition
A hoarse voice, medically termed dysphonia, refers to an alteration in the normal quality, pitch, loudness, or resonance of the voice, resulting in sounds that are rough, raspy, strained, breathy, weak, or inconsistent.[7] This condition arises from disruptions in the vibration or closure of the vocal folds within the larynx, impairing the efficient production of voiced sounds during phonation.[8] Unlike temporary vocal changes from minor irritation, dysphonia encompasses a spectrum of voice disorders that can be acute or persistent, often affecting communication and quality of life.[9] Key characteristics of dysphonia include a breathy or hoarse timbre due to incomplete vocal fold adduction, reduced vocal intensity leading to softer speech, and variations in pitch that may make the voice higher, lower, or unsteady.[7] It may also manifest as vocal fatigue, where the voice tires quickly during use, or as intermittent breaks in phonation, distinguishing it from aphonia, which involves complete loss of voice.[8] These features stem from either organic causes, such as structural lesions on the vocal folds, or functional issues, like improper muscle tension, and are commonly evaluated through perceptual assessment by clinicians.[9] Dysphonia affects approximately one-third of individuals at some point in their lives, with higher incidence among vocal professionals such as teachers and singers, as well as those with risk factors like smoking or frequent voice overuse.[10] While often benign and self-resolving, persistent dysphonia beyond two weeks warrants medical evaluation to rule out underlying pathology.[7] The term "hoarseness" is frequently used interchangeably with dysphonia in clinical contexts, though it specifically emphasizes the raspy quality over other voice alterations.[8]Signs and symptoms
Hoarseness, also known as dysphonia, primarily manifests as an alteration in voice quality, making it difficult to produce clear vocal sounds. The voice may sound breathy, raspy, strained, rough, or husky, often accompanied by changes in pitch that can be higher or lower than usual, or a reduction in volume that results in a softer or weaker tone.[1][11][12] Individuals with a hoarse voice frequently experience associated sensations in the throat, such as scratchiness, rawness, tickling, dryness, or soreness, which can contribute to an urge to clear the throat repeatedly.[1][11] These symptoms often arise from inflammation or irritation of the vocal cords and typically resolve within 2 to 3 weeks if caused by acute conditions like infections.[12][11] In some cases, hoarseness may progress to partial or complete short-term voice loss, where speaking becomes strained or impossible. Additional symptoms can include a dry cough, pain during speaking or swallowing, or difficulty swallowing (dysphagia).[11][3] More concerning signs that warrant medical evaluation include persistent hoarseness lasting over 2 to 3 weeks in adults or 1 week in children, trouble breathing, coughing up blood, a lump in the neck, or fever, as these may indicate underlying issues beyond simple irritation.[1][12]Classification
Types of dysphonia
Dysphonia refers to any impairment in the ability to produce voice sounds using the vocal organs, and it is primarily classified into three main categories: organic, functional, and neurologic, based on the underlying etiology and presence of structural or physiological changes. This classification helps guide diagnosis and treatment by distinguishing between disorders caused by physical lesions, improper voice use without structural damage, and neurological impairments affecting vocal cord control.[8][13] Organic dysphonia arises from identifiable structural or physiological abnormalities in the larynx, vocal folds, or surrounding tissues, often due to inflammation, trauma, or growths. Common subtypes include inflammatory conditions such as acute or chronic laryngitis, which account for a significant portion of cases (e.g., acute laryngitis in approximately 42% of dysphonia presentations), resulting from viral infections, overuse, or irritants like smoking.[8] Benign lesions, such as vocal fold nodules, polyps, or cysts, frequently develop from chronic voice abuse and lead to irregular vocal fold vibration.[8] Neoplastic causes encompass malignant tumors like squamous cell carcinoma, which represents 85-95% of laryngeal cancers and is strongly associated with tobacco use and alcohol consumption. Traumatic organic dysphonia may occur from surgical interventions, intubation, or external injuries, causing scarring or hemorrhage.[8] Functional dysphonia involves normal anatomical structures but disordered voice production due to behavioral or psychological factors, such as excessive muscle tension or inefficient phonatory patterns. The most prevalent subtype is muscle tension dysphonia (MTD), characterized by hyperfunction of laryngeal muscles leading to strained or breathy voice quality, often linked to stress, vocal fatigue, or compensatory habits from prior minor injuries.[8][13] Psychogenic dysphonia, a rarer form, stems from emotional or psychological distress without organic basis, manifesting as sudden voice loss or inconsistency. These functional types typically respond well to voice therapy aimed at retraining proper muscle coordination.[8] Neurologic dysphonia results from disruptions in the neural control of the larynx, often as a subset of organic causes but distinguished by involvement of the central or peripheral nervous system. Key subtypes include vocal fold paralysis, where one or both vocal folds fail to move due to nerve damage from conditions like stroke, surgery, or viral infections, leading to breathy or weak voice.[8][13] Spasmodic dysphonia, a focal dystonia, causes involuntary spasms in the vocal folds during speech, producing a strained, interrupted, or tremulous voice, and is considered a primary neurologic disorder.[8] Other examples involve neurodegenerative diseases such as Parkinson's disease or multiple sclerosis, which impair laryngeal coordination and contribute to hypophonic or tremulous dysphonia.[8]Severity assessment
Severity assessment of hoarse voice, or dysphonia, involves evaluating the degree of vocal impairment to guide diagnosis, treatment planning, and monitoring progress. This process typically combines subjective auditory-perceptual ratings by trained clinicians with objective acoustic and aerodynamic measures to quantify the extent of voice deviation from normal. Perceptual evaluations focus on attributes such as overall severity, roughness, breathiness, asthenia, and strain, while objective tools provide numerical indices based on voice parameters. These assessments are essential for determining functional impact and are recommended in clinical guidelines for patients presenting with altered vocal quality, pitch, loudness, or effort. A primary method for auditory-perceptual evaluation is the GRBAS scale, developed by Hirano in 1981, which rates five voice characteristics on a 4-point ordinal scale from 0 (normal) to 3 (severe). The components include Grade (overall hoarseness severity), Roughness (irregularity of vocal fold vibration), Breathiness (air escape during phonation), Asthenia (weak voice quality), and Strain (effortful phonation). This scale is widely used due to its simplicity and reliability in clinical settings, with the Grade component directly indicating overall dysphonia severity; for example, a Grade of 1 represents mild dysphonia, while 3 indicates severe impairment affecting communication. Inter-rater reliability is moderate to high when performed by experienced speech-language pathologists, making it suitable for initial assessments.[14] Another standardized perceptual tool is the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V), introduced in 2009 and revised as CAPE-Vr in 2025, which employs a 100-mm visual analog scale (VAS) for rating voice attributes including Overall Severity, Roughness, Breathiness, Strain, Pitch, and Loudness. The 2025 revision includes modifications to tasks, stimuli, and rating procedures for improved standardization and fidelity in clinical use. Severity anchors on the VAS range from "mild" to "severe," allowing for finer gradations than ordinal scales; for instance, ratings below 30 mm typically indicate mild dysphonia, while above 70 mm suggest severe levels. The CAPE-V promotes consistent documentation across clinicians and has demonstrated strong validity and reliability in distinguishing normal from disordered voices, particularly when using sustained vowels, connected speech, and reading tasks for evaluation. It is preferred in research and multidisciplinary settings for its comprehensive capture of perceptual deviations.[15][16] Objective severity assessment often utilizes the Dysphonia Severity Index (DSI), a multiparameter acoustic index calculated from four voice measures: maximum phonation time (MPT), highest fundamental frequency (F0-high), lowest intensity (I-low), and jitter percentage. The formula is DSI = 0.13 × MPT + 0.0053 × F0-high - 0.26 × I-low - 1.18 × jitter + 12.4, yielding scores from +5 (perceptually normal voice) to -5 (severe dysphonia). A DSI below 0 indicates clinically significant impairment, with values around -2.5 representing moderate severity; this tool correlates well with perceptual ratings and is valuable for tracking changes post-treatment, though it requires specialized software like Praat for computation. Limitations include its focus on male-normed parameters, prompting adaptations for diverse populations.[17] In practice, the American Academy of Otolaryngology–Head and Neck Surgery (AAO-HNS) clinical practice guideline recommends that clinicians with dysphonia undergo evaluation by a speech-language pathologist, incorporating both perceptual and objective methods to assess severity and associated factors like vocal effort or quality of life impacts via tools such as the Voice Handicap Index. Combining these approaches enhances accuracy, as perceptual ratings provide clinical insight into functional severity, while objective indices offer quantifiable, reproducible data for longitudinal monitoring.Causes
Acute causes
Acute causes of hoarseness, also known as acute dysphonia, generally onset rapidly and resolve within two weeks, often without intervention. The predominant etiology is acute laryngitis, which involves inflammation of the larynx and vocal cords, leading to altered voice quality due to edema and impaired vibration.[18][19] Acute laryngitis is most frequently triggered by viral upper respiratory infections, such as those caused by rhinoviruses, parainfluenza viruses, or influenza viruses, which directly infect the laryngeal mucosa or provoke an inflammatory response. These infections account for the majority of cases, with hoarseness emerging as a key symptom alongside sore throat and cough. Bacterial causes, such as group A Streptococcus, are less common but can occur, particularly in cases of supraglottic involvement or concurrent pharyngitis.[19][4][20] Noninfectious acute causes include vocal overuse or strain, often seen in individuals engaging in prolonged loud speaking, shouting, singing, or cheering, which mechanically irritates the vocal folds and induces temporary edema. Allergic reactions or acute exposure to irritants like smoke or chemicals can also provoke rapid laryngeal swelling, mimicking infectious laryngitis. In rare instances, acute hoarseness may stem from trauma, such as intubation during surgery or foreign body aspiration, disrupting normal vocal cord function.[21][7][4] Acute laryngitis represents approximately 40% of all hoarseness presentations in primary care settings, with viral etiologies comprising nearly all infectious cases. These conditions are typically self-limiting, but persistence beyond two weeks warrants further evaluation to rule out progression to chronic forms.[4][10]Chronic causes
Chronic hoarseness, defined as persistent dysphonia lasting more than three weeks, arises from ongoing laryngeal inflammation, structural alterations, neurological impairments, or systemic conditions that disrupt normal voice production.[11] Unlike acute causes, chronic etiologies often involve cumulative damage from environmental, behavioral, or pathological factors, leading to vocal fold edema, fibrosis, or impaired vibration.[1] A primary category of chronic causes is prolonged laryngeal inflammation, known as chronic laryngitis, which accounts for a significant portion of persistent hoarseness cases. This condition frequently results from repeated exposure to irritants, including chemical fumes, allergens, tobacco smoke, and excessive alcohol consumption, which erode the vocal fold mucosa over time.[11] Ongoing gastroesophageal reflux disease (GERD) contributes substantially, as stomach acid refluxes into the larynx, causing irritation and swelling of the vocal folds, often exacerbated at night or after meals.[1] Chronic sinusitis with postnasal drip can also perpetuate inflammation by allowing mucus to irritate the larynx continuously.[11] Less commonly, persistent infections from bacteria, fungi, or parasites may underlie chronic laryngitis in immunocompromised individuals.[11] Structural changes to the vocal folds represent another major chronic etiology, often stemming from mechanical stress or trauma. Vocal nodules, polyps, and cysts—benign growths—develop due to friction and pressure from habitual voice overuse, such as in singers or teachers, leading to irregular vocal fold closure and hoarse timbre.[1] Vocal fold paralysis, affecting one or both folds, impairs vibration and often results from nerve damage caused by surgical complications, infections, tumors, or trauma; in older adults, it is a leading cause alongside vocal fold bowing, where age-related atrophy weakens fold tension.[1][22] Neurological and systemic disorders further contribute to chronic hoarseness by affecting laryngeal innervation or muscle function. Conditions like Parkinson's disease, stroke, multiple sclerosis, and spasmodic dysphonia induce involuntary spasms or weakness in laryngeal muscles, disrupting phonation. Post-COVID-19 syndrome can also result in chronic hoarseness, potentially through vocal fold paralysis or ongoing laryngeal inflammation, affecting a notable proportion of recovered patients.[23] Hypothyroidism can cause vocal fold edema through mucopolysaccharide deposition, while rheumatoid arthritis leads to cricoarytenoid joint inflammation, both resulting in persistent breathiness or strain.[1][24] Neoplastic causes, though less common, are critical chronic contributors requiring prompt evaluation. Benign tumors or recurrent respiratory papillomatosis—noncancerous growths linked to human papillomavirus—can obstruct vocal fold movement, while malignant laryngeal cancer, often associated with smoking and alcohol, presents with progressive hoarseness in up to 3% of chronic cases.[1][4]Occupational and environmental causes
Occupational causes of hoarseness primarily stem from vocal overuse and strain in professions requiring prolonged or intense voice production, such as teaching, acting, singing, and customer service roles. These demands can lead to vocal fatigue, inflammation of the vocal folds, and conditions like nodules or polyps, with studies showing higher prevalence among educators and healthcare workers who speak for extended periods daily.[25][26][27] Environmental factors contributing to hoarseness include exposure to airborne irritants that inflame the larynx, such as chemical fumes, dust, smoke, and pollutants in industrial or urban settings. Dry or polluted indoor air, often encountered in workplaces like factories or offices with poor ventilation, exacerbates vocal cord irritation by reducing moisture and promoting dehydration of the mucosal lining. Additionally, background noise in noisy environments forces individuals to raise their voice volume, indirectly increasing strain and risk of dysphonia.[28][29]00125-9/fulltext) In occupational contexts, combined exposures—such as vocal demands alongside irritants like secondhand smoke or allergens—heighten risk, with research indicating that healthcare professionals and teachers report hoarseness rates up to 20-30% due to these synergistic effects. Moisture-damaged indoor environments, leading to mold and microbial growth, have also been linked to increased hoarseness frequency through chronic laryngeal irritation.[27][30][31]Pathophysiology
Normal voice production
Normal voice production, or phonation, relies on the coordinated interaction of the respiratory, phonatory, and resonatory systems to generate audible sound. The process begins with airflow generated by the lungs, which provides the power source for voice. As air is exhaled from the lungs through the trachea into the larynx, it encounters the vocal folds—two bands of muscular tissue that extend horizontally across the laryngeal lumen, forming a narrow passageway called the glottis.[32][33] The larynx, located in the anterior neck above the trachea and below the pharynx, houses the vocal folds and is composed of cartilages such as the thyroid (forming the Adam's apple) and cricoid, with the vocal folds attached to the arytenoid cartilages in a V-shaped configuration. During phonation, subglottal air pressure builds below the closed or adducted vocal folds, forcing them apart and allowing a puff of air to escape. This creates a region of low pressure above the folds due to the Bernoulli effect, causing them to snap back together rapidly. The vocal folds then vibrate as air continues to flow, alternately opening and closing the glottis in a self-sustaining oscillation.[34][35][32] This vibration, occurring at frequencies typically ranging from 60 to over 1,000 cycles per second depending on pitch, produces a buzzing sound source at the glottis known as the fundamental frequency. The rate of vibration determines pitch: slower vibrations (around 60–120 Hz for adult males) yield lower pitches, while faster ones (200–250 Hz for adult females) produce higher pitches. The amplitude and tension of the vocal folds, controlled by intrinsic laryngeal muscles innervated by branches of the vagus nerve (recurrent and superior laryngeal nerves), modulate volume and pitch. The sound waves then travel through the pharynx, mouth, and nasal cavities, where they are resonated and articulated into intelligible speech.[33][36][34] In normal function, the vocal folds' multilayered structure—consisting of epithelium, superficial lamina propria (mucosa), vocal ligament, and thyroarytenoid muscle—allows for efficient mucosal wave propagation during vibration, minimizing effort and ensuring clear tone. This myoelastic-aerodynamic theory of phonation underscores how muscle tension (myoelastic) and airflow (aerodynamic) interact to sustain oscillation without excessive strain.[35][37]Mechanisms of hoarseness
Hoarseness, or dysphonia, arises primarily from disruptions in the normal aerodynamic and myoelastic properties of the vocal folds, which impair their ability to vibrate efficiently during phonation. In healthy voice production, the vocal folds approximate and vibrate symmetrically under controlled airflow from the lungs, generating a clear, periodic sound wave. When these processes are altered, the resulting irregular mucosal wave propagation leads to a rough, strained, or breathy vocal quality. Key mechanisms include incomplete glottal closure, alterations in vocal fold mass or stiffness, and imbalances in neuromuscular control, each contributing to turbulent airflow and asymmetric vibration patterns.[38] Incomplete glottal closure is a fundamental mechanism, where the vocal folds fail to meet fully along their length during phonation, allowing excess air to escape and creating a breathy or weak voice. This can occur due to vocal fold paralysis, atrophy, or structural defects, reducing the Bernoulli effect necessary for sustained vibration and leading to increased phonatory effort. For instance, unilateral vocal fold paralysis disrupts adduction, resulting in a glottal gap that causes air turbulence and irregular fold oscillation, often manifesting as persistent hoarseness.[39][40] Alterations in vocal fold mass or tension represent another core mechanism, where lesions such as nodules, polyps, or edema increase tissue bulk or change viscoelastic properties, thereby disrupting the regularity of the mucosal wave. These changes cause phase asymmetries in fold vibration, producing a rough or harsh quality as the sound source becomes aperiodic. Inflammation from acute laryngitis, for example, induces superficial edema that alters the fold's cover-body interaction, reducing vibration amplitude and frequency stability. Similarly, chronic irritation can lead to hyperkeratosis or fibrosis, stiffening the folds and further irregularizing their oscillatory pattern.[4][40] Neuromuscular imbalances also play a critical role, often resulting in hyper- or hypotonicity of the laryngeal muscles that control fold adduction and tension. In functional dysphonia, excessive muscle tension (hyperfunctional state) leads to supraglottic constriction or ventricular phonation, where secondary structures vibrate irregularly instead of the true folds, producing strained hoarseness. Neurologic conditions like spasmodic dysphonia introduce involuntary spasms, causing intermittent glottal closure disruptions and tremorous voice quality. These mechanisms collectively reduce vocal efficiency, increasing subglottal pressure requirements and risking further tissue trauma.[41][42]Diagnosis
In Japan, patients experiencing hoarseness (dysphonia) or difficulty producing voice (声が出にくい) are recommended to consult an otolaryngologist (耳鼻咽喉科, jibiinkoka), the medical specialty that addresses disorders of the ear, nose, throat, and larynx. Otolaryngologists specialize in evaluating vocal cord and laryngeal issues through examinations such as laryngoscopy, which is essential for diagnosing underlying causes and ruling out serious conditions like laryngeal cancer or vocal cord polyps, particularly in persistent or severe cases. Mild hoarseness associated with acute upper respiratory infections, such as colds, may initially be managed by internal medicine physicians, but prolonged symptoms require otolaryngological evaluation.[5][43]History and physical examination
The diagnosis of hoarseness, or dysphonia, begins with a comprehensive history and physical examination to identify potential underlying causes, assess severity, and determine factors that may necessitate expedited further evaluation.[44] This initial assessment helps differentiate between acute and chronic etiologies, such as infection, vocal overuse, gastroesophageal reflux, or malignancy, while guiding decisions on whether immediate laryngoscopy is warranted.[21] In taking the history, clinicians should inquire about the onset and duration of symptoms, distinguishing sudden from gradual development to suggest acute events like viral laryngitis or trauma versus progressive conditions like tumors.[21] Associated symptoms, including pain on speaking or swallowing (odynophagia), difficulty swallowing (dysphagia), ear pain (otalgia), cough, hemoptysis, shortness of breath, or unexplained weight loss, are critical to elicit, as they may indicate complications such as airway obstruction or neoplasm.[21] Risk factors must be explored, including recent upper respiratory infection, smoking or tobacco use, excessive voice use (e.g., in teachers or singers), gastroesophageal reflux disease symptoms (e.g., heartburn, regurgitation), recent endotracheal intubation or head/neck/chest surgery, occupational exposures, and medication history (e.g., inhaled corticosteroids or ACE inhibitors).[7][45] Neurologic symptoms like weakness or tremor, as well as social history including alcohol consumption, should also be documented to uncover contributing elements.[21] The physical examination starts with an auditory-perceptual assessment of the voice, where the clinician listens to the patient's spontaneous speech, sustained vowel production (e.g., /i/), and reading or counting to characterize quality (e.g., breathy, rough), pitch, loudness, and effort.[46] A general examination includes vital signs and evaluation for signs of systemic illness, such as fever or cachexia.[47] The head and neck are inspected and palpated for masses, tenderness, crepitus, or lymphadenopathy, with particular attention to the thyroid, larynx, and hyoid bone; the oral cavity and oropharynx are examined for lesions, edema, or erythema.[21] If feasible, indirect mirror laryngoscopy may be attempted to visualize the larynx, though flexible nasendoscopy is often preferred for better views in primary care settings.[46] Chest auscultation may be performed if respiratory symptoms suggest pulmonary involvement.[47] According to the American Academy of Otolaryngology—Head and Neck Surgery guideline, history and physical examination should specifically identify indicators for expedited laryngeal evaluation, including recent head, neck, or chest surgery; endotracheal intubation; presence of a neck mass; respiratory distress or stridor; tobacco abuse; or professional voice use, as these raise concern for urgent pathology like vocal fold paralysis or cancer.[44][48] In such cases, expedited referral for visualization of the larynx is recommended to avoid delays in diagnosis.[44]Auditory-perceptual evaluation
Auditory-perceptual evaluation is a fundamental component of voice disorder assessment, involving the clinician's subjective listening and rating of voice characteristics to quantify hoarseness or dysphonia. This method relies on the trained ear of speech-language pathologists or otolaryngologists to identify deviations from normal voice quality, such as roughness, breathiness, or strain, which are hallmarks of hoarseness. It provides an initial, non-invasive measure of voice impairment severity and guides further diagnostic and therapeutic decisions.[42] The evaluation typically occurs during standardized vocal tasks to elicit consistent samples for comparison. These include sustaining vowels (e.g., /i/ or /a/ for 3-5 seconds), producing connected speech like reading a passage or counting, and spontaneous conversation to capture natural prosody. Such tasks allow clinicians to assess hoarseness across phonetic contexts, revealing patterns like increased breathiness during prolonged phonation or roughness in running speech. Reliability improves when evaluations are conducted in a quiet environment with high-quality audio recordings.[49] Two widely adopted scales dominate auditory-perceptual evaluation: the GRBAS scale and the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V). The GRBAS scale, developed by Hirano, rates five parameters—Grade (overall severity of hoarseness), Roughness (irregularity in vocal fold vibration), Breathiness (air escape through glottal gaps), Asthenia (weakness or lack of power), and Strain (effortful hyperfunction)—on a 4-point ordinal scale from 0 (normal) to 3 (severe). It is particularly valued for its simplicity and has demonstrated high inter- and intra-rater reliability among experienced clinicians, making it suitable for quick clinical assessments of hoarseness.[50][51] In contrast, the CAPE-V employs a 100-mm visual analog scale for more nuanced ratings of Severity (overall dysphonia), Roughness, Breathiness, Strain, Pitch, and Loudness, along with anchors for reference (e.g., "mild" at 30 mm, "moderate" at 60 mm). Additional sections address resonance and visible structures if applicable. Developed through expert consensus, the CAPE-V offers greater sensitivity for tracking changes in hoarseness over time, such as post-therapy improvements, and shows strong validity when correlated with acoustic measures. Its protocol emphasizes anchoring judgments to predefined voice samples for consistency.[49][52] Both scales emphasize that auditory-perceptual judgments are inherently subjective, yet training and calibration enhance agreement, with intra-class correlation coefficients often exceeding 0.70 for trained raters. For hoarseness specifically, these evaluations help differentiate organic causes (e.g., vocal fold lesions causing roughness) from functional ones (e.g., strain from misuse). Limitations include rater bias and reduced reliability for novice evaluators, underscoring the need for interdisciplinary collaboration in diagnosis.[53]Laryngeal imaging
Laryngeal imaging plays a crucial role in the diagnosis of hoarseness by providing direct visualization of the larynx to identify structural abnormalities, lesions, or functional impairments of the vocal folds. It is typically recommended after initial history and physical examination, particularly for persistent hoarseness lasting more than two weeks without an obvious benign cause, or sooner in cases with risk factors such as smoking, excessive alcohol use, or symptoms like dysphagia or hemoptysis.[10] The primary goal is to differentiate between benign conditions like vocal nodules or polyps and more serious pathologies such as tumors or vocal cord paralysis, guiding appropriate management.[54] Endoscopic techniques, including laryngoscopy, form the cornerstone of laryngeal imaging due to their ability to offer real-time assessment of vocal fold mobility and mucosal integrity. Indirect laryngoscopy, performed using a laryngeal mirror, allows for non-invasive examination in an office setting but may be limited by patient tolerance or anatomical obstructions. Flexible fiberoptic laryngoscopy, inserted transnasally, provides superior visualization of the glottis and supraglottic structures, enabling evaluation of subtle asymmetries or lesions, and is routinely used in otolaryngology clinics for initial assessment. It achieves a diagnostic accuracy of approximately 68% in cases of dysphonia.[10][55] Direct laryngoscopy, often under general anesthesia, is reserved for more detailed intraoperative evaluation or biopsy when office-based methods are inadequate. Videostroboscopy enhances standard laryngoscopy by incorporating stroboscopic light to capture high-speed images of vocal fold vibration, revealing subtle mucosal wave disruptions or asymmetries not visible with continuous illumination. It is particularly valuable for diagnosing benign vocal fold pathologies such as nodules, cysts, or early carcinoma, where vibratory abnormalities indicate functional deficits contributing to hoarseness. Videostroboscopy achieves approximately 68% diagnostic accuracy. It aids in the differentiation of organic from functional voice disorders and monitoring treatment response through serial examinations.[56][55] Limitations include the need for patient cooperation to produce phonation and potential inadequacy in cases of severe edema or poor glottic closure.[57] Radiologic imaging, such as computed tomography (CT) and magnetic resonance imaging (MRI), is indicated when endoscopic findings suggest deeper invasion, neurologic involvement, or when laryngoscopy is inconclusive, rather than as a first-line tool. Contrast-enhanced CT is preferred for evaluating laryngeal tumors, cartilage invasion, or vocal cord paralysis (VCP) etiology, offering detailed assessment of soft tissue and nodal involvement with high sensitivity for malignancy.[58] It excels in identifying recurrent laryngeal nerve lesions or extrinsic masses causing hoarseness, though it involves radiation exposure and may overestimate subtle mucosal changes.[54] MRI provides superior soft tissue contrast for assessing tumor extent, perineural spread, or inflammatory conditions like laryngitis, without radiation, but is more time-consuming and less accessible. Laryngeal ultrasound, an emerging non-invasive option, can detect vocal fold edema or masses but lacks the resolution for detailed vibratory analysis and is not routinely recommended over endoscopy. Overall, imaging selection depends on clinical suspicion, with guidelines emphasizing endoscopic primacy to avoid unnecessary radiation in benign cases.[10]Acoustic and aerodynamic measures
Acoustic measures provide an objective analysis of the voice signal to identify perturbations associated with hoarseness, such as irregular vibration of the vocal folds leading to rough or breathy quality. These non-invasive assessments involve recording sustained vowel phonations (e.g., /a/ at comfortable pitch and loudness) or running speech using a calibrated microphone in a quiet environment, followed by computerized analysis. They quantify frequency, amplitude, and spectral characteristics that correlate with perceptual ratings of dysphonia severity. Emerging applications of machine learning on acoustic data show promise for automated hoarseness severity assessment and disorder classification as of 2025.[59][60][61] Fundamental parameters include fundamental frequency (F0), which measures average pitch stability, typically showing greater variability in hoarseness due to inconsistent vocal fold oscillation. Jitter assesses cycle-to-cycle frequency perturbations, with values exceeding 1% indicating irregularity common in dysphonic voices. Shimmer evaluates amplitude variations, often elevated above 3-5% in hoarse voices, reflecting turbulent airflow. The harmonics-to-noise ratio (HNR) gauges the balance between periodic harmonics and aperiodic noise, where reduced HNR (below 12-15 dB) signifies increased noise from incomplete glottal closure. These metrics are sensitive to early voice changes and aid in differentiating organic from functional causes of hoarseness.[62][63] Advanced acoustic indices enhance diagnostic precision by integrating multiple features. Cepstral peak prominence smoothed (CPPS) analyzes the cepstral domain to capture overall periodicity, demonstrating high reliability (intraclass correlation >0.9) and sensitivity to dysphonia across tasks like vowel prolongation and sentence reading. The acoustic voice quality index (AVQI) combines jitter, shimmer, HNR, and spectral tilt into a composite score, with values above 2.0 signaling significant hoarseness; it outperforms individual parameters in classifying voice quality and tracking treatment outcomes. These measures are recommended in consensus protocols for standardized clinical evaluation.[64][65] Aerodynamic measures assess the physiological efficiency of phonation by quantifying airflow, pressure, and volume dynamics between the respiratory and laryngeal systems. Performed with instruments like a face mask pneumotachograph for airflow and pressure transducers for subglottal estimation, they reveal inefficiencies such as air escape or excessive effort in hoarse voices. Tasks include sustained phonation, syllable strings (e.g., /pɑ/), and vital capacity maneuvers, providing data on glottal resistance and closure.[66] Core parameters encompass maximum phonation time (MPT), the longest duration of sustained /i/ or /a/ on residual air volume, typically reduced below 15-20 seconds in dysphonia due to glottal incompetence or fatigue. Mean airflow rate (MFR) during phonation, measured in liters per second, is often elevated (>0.2 L/s) in breathy hoarseness, indicating poor adduction. Subglottal pressure (Psub), estimated via intraoral pressure during lip-trilled or syllable tasks, rises in compensatory hyperfunction but falls in paralytic conditions. The phonation quotient (PQ), calculated as vital capacity divided by MPT, highlights laryngeal economy; values below 6-8 mL/s suggest inefficiency. These metrics correlate with perceptual hoarseness grades and inform therapeutic interventions.[67][68]| Measure Type | Key Parameters | Typical Task | Significance in Hoarseness |
|---|---|---|---|
| Acoustic | Jitter, Shimmer, HNR | Sustained /a/ (3 trials) | Quantify irregularity and noise; elevated values indicate rough/breathy quality[62] |
| Acoustic | CPPS, AVQI | Vowel + sentence reading | Composite quality scores; sensitive to severity and change[64] |
| Aerodynamic | MPT, MFR | Sustained /i/ at comfortable level | Assess endurance and air conservation; reduced MPT signals inefficiency[67] |
| Aerodynamic | Psub, PQ | /pɑ/ repetitions or vital capacity | Evaluate pressure-flow balance; altered in glottal disorders[68] |
