Return to the archive index

TIMIT speech corpora available

From: Vito Miliano <>
Date: Fri, 17 Jan 2003 16:38:59 -0600

wear-hard--

I picked up a copy of the TIMIT Acoustic-Phonetic Continuous Speech
Corpus for a work-related project, and since it's okay for me to
redistribute it, I thought I'd offer it to list members.  If you're
training speech recognition systems, probably no better place to
start.  :)

"The TIMIT corpus of read speech is designed to provide speech data
for acoustic-phonetic studies and for the development and evaluation
of automatic speech recognition systems.  TIMIT contains broadband
recordings of 630 speakers of 8 major dialects of American English,
each reading 10 phonetically rich sentences.  The TIMIT corpus
includes time-aligned orthographic, phonetic and word transcriptions
as well as a 16-bit, 16kHz speech waveform file for each utterance."

It's only like ~US$100 from NTIS, but if you've got the bandwidth to
rsync ~613MB, I'll give you a URL for it.  Just remember that although
it's redistributable, it's still copyrighted, so you need to cite its
use in papers, etc.

Thanks,
--Vito

--
Subscription/unsubscription/info requests: send e-mail with subject of
"subscribe", "unsubscribe", or "info" to 
Wear-Hard Mailing List Archive (searchable): http://wearables.blu.org
Please, *PLEASE* don't subscribe through a forward/expander/false domain

+Previous Message in Thread | Next Message in Thread

From Wear-Hard Mailing list Archive (WH)
Maintained by R. Paul McCarty

Archive created with babymail