Goal of Tutorial 2:
In this tutorial, you will explore how to measure
Voice Onset Time (VOT), one of the main acoustic correlates of
plosive voicing. Languages differ in how they implement voicing
distinctions, so you will be measuring both English and French
consonants in order to compare them. VOT is typically measured in
seconds or milliseconds.
You’ll also be learning how to measure the fundamental
frequency of vowels, also known as F0. This is the number
of times per second (measured in Hz) that the overall waveform cycle
repeats, and it is perceived auditorily as pitch.
INSTRUCTIONS
Part 1: Measuring VOT
Download Tutorial2_measurements.xlsx from eClass
and open it in Excel.
Download Tutorial2_Eng_stop_a.wav from eClass
and open the sound in Praat. Select it in the Objects Window and click
View and Edit.
Take a listen to the syllables spoken in the file. You should hear
“pa ba ta da ka ga”.
- Zoom in so that you can clearly see the transition between the stop
and vowel.
- Measure the time between the end of the stop closure ( = the
beginning of the release burst) and the onset of voicing ( = the onset
of regular pitch pulses in the waveform). This is the voice onset time
or VOT.
- If the onset of voicing follows the release of the
stop closure, then VOT is calculated as positive; stops
with positive VOT are termed voiceless.
- VOTs of 0ms to about 20ms or 30ms correspond to voiceless
unaspirated stops.
- VOTs above 20-30ms correspond to voiceless aspirated
stops.
- If the onset of voicing precedes the release of the
stop closure, then VOT is calculated as negative; stops
with negative VOT are termed prevoiced or more
generally, voiced.
Positive and negative VOT measurements
TIP:
Praat reports time values in seconds, but in phonetic analysis we report
in milliseconds, so the values need to be converted. Once you have
converted to milliseconds, you do not need to report any decimal
places.
You might also want to change the dynamic range to
50 dB from its default setting of 70 dB in
Spectrogram > Spectrogram settings. This helps to
remove background noise that might otherwise look like a voicing
bar.
- Download Tutorial2_Fr_stop_a.wav and open the sound
in Praat. Select it in the Objects Window and click View and
Edit. Take a listen to the syllables spoken in the file. You should
hear “pa ba ta da ka ga”. Repeat the steps as above and record your VOT
measurements (positive or negative, in ms) in the Excel sheet.
Consider:
- How does VOT differ between voiced vs. voiceless stops?
- How do English and French seem to differ in terms of how they use
VOT as a cue to voiced vs. voiceless stops?
- How would we transcribe each of these stops in IPA narrow
transcription?
- Now that you’ve explored some sounds up close, can you think of any
potential issues with the IPA’s goal to have a one-to-one mapping
between symbol and sound?
Part 2: Measuring F0
Now, measure the pitch (F0) in each of the vowels in the
syllables. As you do so, note down your measurements in the Excel
spreadsheet. As with ms, you can usually round your Hz measurements to
the nearest whole number unit – no decimal points necessary.
Click Pitch in the View & Edit window. If
Show Pitch is not selected, click it so that a
checkmark appears next to it. You should now see a blue line
superimposed on your spectrogram.
We now need to adjust the pitch range to see the pitch contor
clearly. Click Pitch > Pitch settings…
- Every voice is going to have a different ideal pitch range to enter
here.
- The voice of the male speaker of Eng_stop_a.wav
seems to do well with a pitch tracking range of 75 Hz to 200 Hz.
- The female speaker of Fr_stop_a.wav seems to do
better at 100 Hz to 300 Hz.
- Also ensure that your ‘analysis method’ in Pitch settings is set to
autocorrelation and the Drawing method is automatic to get
the correct display for the pitch contours.
- Find the midpoint of each vowel. Remember that one way to do this is
to drag your cursor so that you have selected the vowel from its start
to its endpoint, then click Time > Move cursor to…
and click OK.
Note that creaky voice is a natural part of many, if not most,
speakers’ vocal performance. The irregular vocal fold phonation may
wreak havoc with F0 measurements and can make it hard to measure exactly
where a vowel ends. If this is the case for some vowels, you can find a
point slightly before the midpoint, where there the blue pitch line is
accurately tracking pitch.
- Look at the right side of the spectrogram. You will see here the F0
measurement at the moment you have selected as the midpoint.
On the right side of the spectrogram is the F0
measurement at the selected moment
TIP:
The fundamental frequency value is always displayed on the
right side of the window in dark red text (or, on some older
versions of Praat, blue text). The bright red text at the left
is the spectrogram frequency at the vertical point selected. These are
two very different things – the value on the right corresponds
to a reading along the blue line, which is visualized within the scale
indicated by the blue text above and below the F0 reading. The value on
the left is based on a scale indicated by the black text above
and below it. Be careful not to get these confused!
- After you record the F0 value from the right side of the screen in
your Excel spreadsheet, confirm the pitch measurement manually.
- Zoom in to a small section of the waveform around the midpoint of
the vowel so that 2 or 3 full cycles are showing.
- Measure the period by highlighting one complete cycle, going
from zero-crossing to zero-crossing, and noting the time associated with
it (in the panel above the waveform).
- To get the frequency f from the period
T, remeber the formula: f=1÷T. Or if
you’re zoomed in far enough, the panel above the waveform will include
both the period and the frequency (in parentheses), so you don’t have to
do any calculations.
Selecting one cycle will let you measure the F0
manually as a sanity check on Praat’s pitch tracking algorithm
Consider:
- Are the words you measured in this tutorial uttered with level,
rising, or falling intonation? How do can we tell from the blue pitch
track?
- Why do these two speakers have different mean F0 measurements? Is it
a property of the difference between English and French (like VOT) or
something else?
- Why does creaky voice (also known as glottalisation) appear
to be problematic for measuring fundamental frequency?