Yes/No v1.2

Structure | Protocol | Versions | Misc



General Description
The Yes/No Corpus is a collection of answers to yes/no questions from other CSLU corpora.

Recording Details
The data in this corpus were collected over telephone lines. They were collected from both analog and digital phone lines.

The analog data were recorded using a Gradient Technologies analog-to- digital conversion box. These files were recorded as 16-bit, 8 khz and stored in a linear format.

The digital data were recorded with the CSLU T1 digital data collection system. These files were sampled at 8 khz 8-bit and stored as ulaw files.

All of the data use the RIFF standard file format. This file format is 16-bit linearly encoded.

Directory Structure
There are five top-level directories in this distribution:
  • docs -- the docs directory contains assorted documentation files
  • labels -- the labels directory contains .phn files containing time align phoneme transcription.
  • misc -- the misc directory contains scripts and archival information
  • speech -- the speech directory contains the .wav files containing speech data
  • trans -- the trans directory contains .txt files containing transcriptions of the corresponding .wav files in the speech directory


The speech and trans directories contain the data files, which have the following name structure:
YN-xxxxx.yyyyy.zzz
xxxxx = call number
y = utterance code
zzz = file extension (txt/wav)

For example:
YN-10023.hispanic.wav


This utterance is from caller 10023 and contains the answer to the question "Are you of Hispanic origin?".

A corresponding transcription can be found in the file:
YN-10023.hispanic.txt


These audio and text files are subdivided into directories based on their call number divided by 100 (with remainders dropped). So, these files would be found in /yesno/speech/100 and /yesno/txt/100, respectively.

Transcriptions
The text transcriptions were performed according to the non time-aligned word-level conventions described in the CSLU Labeling Guide. This document is available at the CSLU web site. Some of the files were machine-transcribed using a yes/no recognizer. These files were spot-checked, and their accuracy was comparable to those transcribed by humans.