9 July 2004Regrettably our costs associated with producing the corpora on CD for non-commercial use were not adequately covered by the previous price. Effective immediately, the cost per corpus is now $300 US.
16 June 2003CSLU Center memberships, commercial licensing and not-for-profit licensing agreements have been updated.
22 September 2002Release of Natcell-2.3 corpus with time align phoneme for available orthographical word transcriptions and final release of Stories-1.2 with time align phoneme transcriptions (force time align) for all files with orthographical word transcriptions.
23 August 2002Final releases of Alphadigits, CWP, Names, Numbers, PortCell, SRX4, SSW, YesNo corpora, including time align phoneme transcriptions (force time align). Release of Apple Words and Phrases v1.3 with time align phoneme transcriptions (force time align) for all files with orthographical word transcriptions.
19 August 2002Final release of Isolet corpus, including time align phoneme transcriptions.
3 June 2002Several corpora got updated with new files and all corpora went through consistency check. We are currently verifying all corpora for accuracy. Next step will be working on force alignment for all English corpora with existing orthographic transcription so we can provide automatic time-align phoneme transcription.
25 February 2002A new corpus The Spoltech Brazilian Portuguese was released and is available now.
14 February 2002Note, that an license agreement must be signed by an authorized representative of the organization, whether a not for-profit organization or a university.
7 February 2002A new corpus VOICES was released and is available for commercial via a special licensing agreement, not as part of the standard membership agreement. The corpus consists of 12 speakers, 50 phonetically rich sentences per speaker. Recording procedure involved a mimicking.
28 January 2002CSLU released National Cellular v2.2. Corpus contains cellular telephone speech from 2337 speakers from locations throughout the Unined States from which 1996 speakers are transcribed.
7 December 2001We are pleased to announce that the final version of Kid's Speech Corpus was released. This corpus contains of spontaneous and scripted utterances from kids grades k to 10.
15 October 2000Regrettably our costs associated with producing the corpora on CD were not adequately covered by the previous price. Effective immediately, the cost per corpus is now $30 US.
15 October 2000All outstanding corpora orders have been filled and shipped. Watch the skies.
1 October 2000We are please to announce the first release of the Kid's Speech Corpus. This corpus contains about 1000 2 minute spontaneous conversations with kids grades 3-10.
15 September 2000CSLU hosts over 50 speech technology professionals from more than 20 companies. Yes, we have finally finished the comprehensive review and CD conversion of all our corpora. We are burning CD's now and catching up on orders. Thanks for your patience.
24 July 2000In order to provide easier access to our corpora, CSLU is transferring all of its corpora to CD format. Before burning the corpora onto CD, we will be performing the following tasks:
Our goal is to have the corpora updated and burned to CD as soon as possible; however, this will entail a delay of three to four weeks. Since we do not wish to further delay orders, we can send the corpora in the original DAT format. We will be updating this web site each time a new corpus has been updated and burned to CD. Current orders are entitled to the upgraded version of a corpora on CD when it becomes available.
To receive the updated corpora, email firstname.lastname@example.org. BE SURE TO INCLUDE THE NAME OF THE ORIGINAL CORPORA YOU ORDERED AS WELL AS YOUR ORIGINAL INVOICE NUMBER.
Please accept our apologies for these delays while we improve the quality of our corpora for our fellow researchers.
May 30, 2000Our corpora is moving to CDROM. In the past we've distributed corpora on DAT. The process is expected to take about a week. All corpus shipments will be delayed until the transition is complete. We'll make an announcement when the job is finished.
May 27, 2000Welcome four new phonetic transcribers to our team! See People to meet Flink, Lisa, Trina, and Kay.
May 11, 2000We are currently compiling a cellular speech corpus and a corpus for research into speaker recognition, as part of an initiative in human language technology supported by the National Science Foundation and DARPA.
JOB ANNOUNCEMENT: Phonetic Transcriber We are currently hiring phonetic transcribers. The OGI Employment page has the official announcement.
Center members are entitled to commercial use of our corpora as part of the membership agreement. Non-members may request our corpora for non-commercial research purposes. We have instituted a $20 US per corpus fee to cover our costs of shipping, media, and handling.