联系客服
客服二维码

联系客服获取更多资料

微信号:LingLab1

客服电话:010-82185409

意见反馈
关注我们
关注公众号

关注公众号

linglab语言实验室

回到顶部
英文语料库--LibriSpeech ASR corpu

1616 阅读 2020-07-10 14:08:24 上传 0KB

该数据集是包含大约1000小时的英语语音的大型语料库。这些数据来自LibriVox项目的有声读物。它已被分割并正确对齐,如果你正在寻找一个起点,请查看已准备好的声学模型,这些模型在kaldi-asr.org和语言模型上进行了训练,适合评估。

LibriSpeech ASR corpus

Identifier: SLR12

Summary: Large-scale (1000 hours) corpus of read English speech

Category: Speech

License: CC BY 4.0

Downloads (use a mirror closer to you):
dev-clean.tar.gz [337M]   (development set, "clean" speech )   Mirrors: [China] 
dev-other.tar.gz [314M]   (development set, "other", more challenging, speech )   Mirrors: [China] 
test-clean.tar.gz [346M]   (test set, "clean" speech )   Mirrors: [China] 
test-other.tar.gz [328M]   (test set, "other" speech )   Mirrors: [China] 
train-clean-100.tar.gz [6.3G]   (training set of 100 hours "clean" speech )   Mirrors: [China] 
train-clean-360.tar.gz [23G]   (training set of 360 hours "clean" speech )   Mirrors: [China] 
train-other-500.tar.gz [30G]   (training set of 500 hours "other" speech )   Mirrors: [China] 
intro-disclaimers.tar.gz [695M]   (extracted LibriVox announcements for some of the speakers )   Mirrors: [China] 
original-mp3.tar.gz [87G]   (LibriVox mp3 files, from which corpus' audio was extracted )   Mirrors: [China] 
original-books.tar.gz [297M]   (Project Gutenberg texts, against which the audio in the corpus was aligned )   Mirrors: [China] 
raw-metadata.tar.gz [33M]   (Some extra meta-data produced during the creation of the corpus )   Mirrors: [China] 
md5sum.txt [600 bytes]   (MD5 checksums for the archive files )   Mirrors: [China] 

About this resource:

LibriSpeech is a corpus of approximately 1000 hours of 16kHz read English speech, prepared by Vassil Panayotov with the assistance of Daniel Povey. The data is derived from read audiobooks from the LibriVox project, and has been carefully segmented and aligned.

Acoustic models, trained on this data set, are available at kaldi-asr.org and language models, suitable for evaluation can be found at http://www.openslr.org/11/.

For more information, see the paper "LibriSpeech: an ASR corpus based on public domain audio books", Vassil Panayotov, Guoguo Chen, Daniel Povey and Sanjeev Khudanpur, ICASSP 2015 (submitted) (pdf)


点赞
收藏
表情
图片
附件