A Meta-Analysis: Acoustic Measurement of Roughness and Breathiness
Journal of Speech, Language & Hearing Research
Barsties v. Latoszek, B., Maryn, Y., et al. (2018).
Journal of Speech, Language & Hearing Research, 61(2), 298-323.
This meta-analysis investigates the correlations between auditory-perceptual and acoustic measures of roughness and breathiness in sustained vowels and continuous speech.
Not stated
1960 to 2013
Published and unpublished studies (not further specified)
33 (roughness); 34 (breathiness)
<p>Acoustic measures with higher correlation coefficients (weighted mean<em> r</em> ≥ .60) with auditory perceptual measures were considered to be more valid; that is, they better "quantify/objectify the subjective/perceptual concept" (p. 2) of roughness or breathiness.</p>
<p>For roughness, this included the following as the most promising measures:</p>
<ul>
<li>Six spectral noise level (SNL) parameters:
<ul>
<li>100-5100 Hz (r = 0.91); </li>
<li>100-8100 Hz (r = 0.90); </li>
<li>100-2600 Hz (r = 0.89); </li>
<li>100-3000 Hz (r = 0.85); </li>
<li>2600-5100 Hz ( r= 0.82); and </li>
<li>5100-8000 Hz (r = 0.75);</li>
</ul>
</li>
<li>Second harmonic amplitude (H2A; r = 0.73); </li>
<li>Pearson <em>r</em> at autocorrelation peak (RPK; r = 0.71);</li>
<li>Normalized noise energy (NNE; r = 0.68*);</li>
<li>Jitter Factor (r = 0.67);</li>
<li>Glottal-to-noise excitation ratio (GNE) 1000 Hz (r = 0.65);</li>
<li>Amplitude variability index (AVI; r = 0.63);</li>
<li>Energy perturbation quotient (EPQ; r = 0.62*); and</li>
<li>Smoothed pitch perturbation quotient (sPPQ; r = 0.61).</li>
</ul>
For breathiness, this included:
<ul>
<li>Natural log of period standard deviation (LNPSD; r = 0.76);</li>
<li>GNE 3000 Hz (r = 0.71); </li>
<li>Differences between the amplitude of Formant 0 and Formant 1 (F0-F1 [L0-L1]; r = 0.72);</li>
<li>Relative energy level of high frequency noise (Hfno; r = 0.70); </li>
<li>Cepstral peak prominence (CPP; r = 0.66*);</li>
<li>Differences between the amplitude of the first and second harmonics (H1-H2; r = 0.66);</li>
<li>Smoothed cepstral peak prominence (CPPs; r = 0.64*); </li>
<li>Smoothed pitch perturbation quotient (sPPQ; r = 0.64);</li>
<li>Harmonic-to-noise ratio from Dejonckere & Lebacq (1987; HNR Dejonckere; r = 0.63)</li>
<li>Amplitude perturbation quotient-5 (APQ5; r = 0.62);</li>
<li>Normalized noise energy (NNE) 1000-5000 Hz (r = 0.61*); and</li>
<li>Smoothed amplitude perturbation quotient (sAPQ; r = 0.60). </li>
</ul>
Asterisks represent heterogeneity in results.