Description

Our data consists of a single male English native speaker, reading 1899 sentences providing a total of 2.5 hours of speech audio. A subset of 720 sentences is from the Harvard set which was read twice at a normal and fast pace. The remaining sentences were a subset of the TIMIT dataset.
Acoustics and articulatory movement were recorded using a Carstens AG501 EMA device. Passive transducers were attached to speech articulators using medical-grade cyanoacrylateglue. Three sensors were placed midsagittally on the tongue surface, one sensor on the tongue dorsum (TD), one on the tongue blade (TB), and one behind the tongue tip (TT). Two more sensors were parasagittally placed to the left (BL) and right (BR) of the tongue blade. Three additional sensors were placed on the lips, two were midsagittally attached on the upper(UL) and lower lips (LL) at the vermillion border, and one on the right corner (LC) of the lips. Additionally, two sensors were placed on the jaw on the gingiva below the medial incisors (LI)and between the canine and first premolar (LJ).

Sensor Placement

Landmark	Position
TD	Tongue Dorsum, Midsagittal
TB	Tongue Blade, Midsagittal
BR	Tongue Blade, Right Para-sagittal
BL	Tongue Blade, Left Para-sagittal
TT	Tongue Tip, Midsagittal
UL	Upper Lip, Midsagittal
LC	Center Lip, Right Commisure, Parasagittal
LL	Lowe Lip, Midsagittal
LI	Jaw, Incisors, Midsagittal
LJ	Jaw, 1st Premolar and Canine, Parasagittal