Human-Computer Speech Interaction Lab at Tsinghua University (THUHCSI) targets artificial intelligence (AI) technologies for smart voice user interface (VUI). We specializes in speech synthesis, speech recognition, speaker recognition, speech enhancement, affective computing, natural launguage understanding and generation, multimodal multimedia information processing. THUHCSI jointly establishes "Tsinghua-CUHK Joint Research Center for Media Sciences, Technologies and Systems" with Faculty of Engineering of The Chinese University of Hong Kong since 2006. It showcases Guangdong-Hong Kong, Shenzhen-Hong Kong collaborative example in science and technology initiative promoted by Shenzhen municipal government.
THUHCSI took lead or particiapated in series of projects, including National Key Technology R&D Program, National High-tech R&D Program (863 Program), National Key Basic Research Project Program (973 Program), National Natural Science Foundation of China (NSFC), as well as NSFC-RGC (Research Grants Council of Hong Kong) joint research scheme. THUHCSI also has diversified collaborations with major academic instistutions and industrial partners, e.g. Microsoft, Tencent, ByteDance, XVerse, MI, TAL, Databaker.
THUHCSI received the 2009 and 2016 Ministry of Education (MoE) Higher Education Outstanding Scientific Research Output Award in Technological Advancements. THUHCSI owned more than 20 patents, and published more than 100 papers on peer-reviewed international conferences and journals, including IEEE/ACM TASLP, TAC, TMM, AAAI, ACM Multimedia, IJCAI, EMNLP, ICASSP, INTERSPEECH, ICME, ICIP, etc.
Dozens of master and PhD students have graduated from THUHCSI. Many of them won several honors such as Outstanding Dissertation of Tsinghua University, Outstanding Graduate Students, etc.
清华大学人机语音交互实验室(THUHCSI)聚焦人工智能场景下的智能语音交互技术研究,包括语音合成、语音识别、说话人识别、语音增强、情感计算、自然语言理解与生成、多模态人机交互等。实验室与香港中文大学工程学院联合成立了“清华大学-香港中文大学媒体科学、技术与系统联合研究中心”,自2006年以来一直积极开展学术交流与合作,是深圳市政府加强粤港、深港科技合作交流政策的具体体现。
实验室曾多次负责或参加国家科技攻关、国家高技术研究发展计划(863)各类课题、国家重点基础研究发展计划(973)研究项目,以及国家自然科学基金重点项目、面上项目、地区合作项目等课题,并取得多项国内外领先的研究成果。实验室与多个重点大学、互联网智能语音交互公司如腾讯、微软、虎牙、字节跳动、思必驰、小米、标贝等有着紧密的友好合作关系。
多次荣获教育部科技进步奖等部委级以上奖项。在IEEE/ACM TASLP、TAC、TMM、AAAI、ACM Multimedia、IJCAI、EMNLP、ICASSP、INTERSPEECH、ICME、ICIP等国内外主流学术刊物和会议上发表论文100余篇。拥有发明专利20余项。
培养硕士博士研究生数十人,多人次获得清华大学优秀学位论文、清华大学优秀毕业生、北京市优秀毕业生等荣誉,在2017全球极客大赛“AI仿声验声攻防赛”中斩获桂冠。获清华大学教学优秀奖。