The Formants of Monophthong Vowels in Standard Southern British English Pronunciation
David Deterding, Universiti Brunei Darussalam
Journal of the International Phonetic Association (1997) 27: 47–55
This file contains some details of the measurements of the vowels, as reported in the JIPA paper shown above. Please feel free to use these measurements in any way you find useful.
This directory contains 10 files in XL format. Each file contains the measurements of the first 3 formants of the 11 monophthong vowels of one speaker from the MARSEC database. In each file, the measurements of each monophthong are held in a separate XL sheet (so there are 11 sheets in each file).
Speakers
The measurements are from 5 male and 5 female BBC broadcasters. The speakers
are the ones whose voice is heard at the start of the first file in each
directory of the following MARSEC directories:
ASIG female
BSIG male
CSIG male
DSIG female
ESIG female
FSIG female
GSIG female
HSIG male
JSIG male
KSIG male
The speaker from directory ASIG is referred to as A, from BSIG as B, etc.
Selection of words
In most cases, there are many instances of each vowel that could be selected for
measurement. Wherever possible, vowels following /j/, /w/, and /r/ or preceding
/l/ are avoided, to minimize the effects of coarticulation. For some vowels,
particularly /u:/ and /U/, it is not always possible to avoid such environments.
At least 5 measurements are made for each vowel of each speaker, with the
exception of the /U/ of two speakers: for speakers A and E, only 2 clear
instances of this vowel could be found. (Maybe the BBC should be encouraged to
broadcast more programmes on 'Good Books on Cooking'!)
Methods of measurement
The measurements were made from digital spectrograms with overlaid LPC
formant tracks, using the CSL software (Version 5) from Kay Elemetrics Corp. A
pre-emphasis coefficient of 0.9 was used, and a 16th order filter for the linear
prediction.
The Application Notes of the CSL documentation (page 384) recommend 2 LPC
coefficients for each expected formant, with an extra 2 coefficients for the DC
component. With the default sampling rate of 10 kHz, where one might expect 5
formants up to the Nyquist frequency of 5 kHz, they therefore recommend an LPC
order of 12 (but maybe less for female speech).
The MARSEC data is sampled at 16 kHz, and one might expect up to 8 formants
below the Nyquist frequency of 8 kHz. The default LPC order of 12 is therefore
clearly insufficient, which is why 16th order was used for these measurements.
In fact, for some speakers, 18th or even 20th order might be tried, particularly
when measurement of the first formant is problematical for open vowels such as /ae/.
However,16th order was used for all these data, to ensure consistency.
It should be emphasized that measurement of all vowels is occasionally not
possible; and when there are clear problems, when for instance there is no
formant track anywhere near the expected frequency, or when the measured value
is clearly spurious, such tokens are ignored, and others are found. It is also
questionable whether each and every vowel measurement is indicative of the
quality of that token. Attempts have been made to provide 10 reasonably
consistent measurements of most vowels for each speaker in the hope that the average values do represent a reliable measure for
the speaker.
Any comments/suggestions/criticisms, please contact me at
dhdeter@gmail.com