联系客服
客服二维码

联系客服获取更多资料

微信号:LingLab1

客服电话:010-82185409

意见反馈
关注我们
关注公众号

关注公众号

linglab语言实验室

回到顶部
英文语料库--TED-LIUM

1337 阅读 2020-07-10 14:15:59 上传 0KB

TED-LIUM 是来自 TED 讲座的语音识别训练语料库,它带有转录,采样频率为 16kHz 的音频片段, 合计包含大约 118 个小时的演讲。 该数据集由缅因大学计算机科学实验室(LIUM)于 2012 年创建。

TED-LIUM

Identifier: SLR7

Summary: English speech recognition training corpus from TED talks, created by Laboratoire d’Informatique de l’Université du Maine (LIUM) (mirrored here)

Category: Speech

License: Creative Commons BY-NC-ND 3.0 (attribution/non-commercial/no-derivatives).

Download: TEDLIUM_release1.tar.gz [21G]   (The first release )   Mirrors: [China] 

About this resource:

The TED-LIUM corpus (mirrored here) is English-language TED talks, with transcriptions, sampled at 16kHz. It contains about 118 hours of speech.

The original page requests that you cite the following paper if you make use of this corpus:

A. Rousseau, P. Deléglise, and Y. Estève, "TED-LIUM: an automatic speech recognition dedicated corpus",
in Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), May 2012.


External URL: http://www-lium.univ-lemans.fr/en/content/ted-lium-corpus   Original source


点赞
收藏
表情
图片
附件