联系客服
客服二维码

联系客服获取更多资料

微信号:LingLab1

客服电话:010-82185409

意见反馈
关注我们
关注公众号

关注公众号

linglab语言实验室

回到顶部
中文语音库-THCHS-30

2673 阅读 2020-07-24 18:21:10 上传 0KB

THCHS-30是在安静的办公室环境下,通过单个碳粒麦克风录取的,总时长超过30个小时。大部分参与录音的人员是会说流利普通话的大学生。采样频率16kHz,采样大小16bits。 THCHS-30的文本选取自大容量的新闻,目的是为了扩充863语音库。我们选取1000句来录音

THCHS-30

Identifier: SLR18

Summary: A Free Chinese Speech Corpus Released by CSLT@Tsinghua University

Category: Speech

License: Apache License v.2.0

Downloads (use a mirror closer to you):
data_thchs30.tgz [6.4G]   ( speech data and transcripts )   Mirrors: [China] 
test-noise.tgz [1.9G]   ( standard 0db noisy test data )   Mirrors: [China] 
resource.tgz [24M]   ( supplementary resources, incl. lexicon for training data, noise samples )   Mirrors: [China] 

About this resource:

THCHS30 is an open Chinese speech database published by Center for Speech and Language Technology (CSLT) at Tsinghua University. The origional recording was conducted in 2002 by Dong Wang, supervised by Prof. Xiaoyan Zhu, at the Key State Lab of Intelligence and System, Department of Computer Science, Tsinghua Universeity, and the original name was 'TCMSD', standing for 'Tsinghua Continuous Mandarin Speech Database'. The publication after 13 years has been initiated by Dr. Dong Wang and was supported by Prof. Xiaoyan Zhu. We hope to provide a toy database for new researchers in the field of speech recognition. Therefore, the database is totally free to academic users. You can cite the data using the following BibTeX entry:
@misc{THCHS30_2015,  title={THCHS-30 : A Free Chinese Speech Corpus},  author={Dong Wang, Xuewei Zhang, Zhiyong Zhang},  year={2015},  url={http://arxiv.org/abs/1512.01882} }

PEOPLE

Dong Wang, Xuewei Zhang, Zhiyong Zhang @CSLT, Tsinghua Univ.

CONTACTOR

ROOM1-303, BLDG FIT

CSLT, Tsinghua University

http://cslt.org

http://cslt.riit.tsinghua.edu.cn

External URLs:
http://data.cslt.org/thchs30/README.html  (Original URL from CSLT )
http://pan.baidu.com/s/1hqKwE00   ( Baidu disk )

点赞
收藏
表情
图片
附件