Associate professor of Shenzhen International Graduate School, Tsinghua University. Coordinator with the Tsinghua-CUHK Joint Research Center for Media Sciences, Technologies and Systems.
Dr. Zhiyong Wu is an associate professor of Shenzhen International Graduate School, Tsinghua University. He is also a coordinator with the Tsinghua-CUHK Joint Research Center for Media Sciences, Technologies and Systems. He received the B.S. and Ph.D. degrees in computer science and technology from Tsinghua University in 1999 and 2005 respectively. From 2005 to 2007, he was a postdoctoral fellow with the Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong (CUHK).
His main research interests cover the areas of intelligent speech interactions, specifically, speech synthesis, voice conversion, speaker recognition, speech recognition, speech enhancement and separation, audio-visual bimodal modeling, affective computing, natural language understanding and generation.
He has been awarded the Annual Teaching Excellence Award of Tsinghua University (2020), and the Higher Education Outstanding Scientific Research Output Award in Technological Advancements from the National Ministry of Education twice (2009, 2016). He won the first prize in the “Spoofing Attack Task” in the GeekPwn 2017 Shanghai Contest (2017).
He has authored more than 100 papers on peer-reviewed top international conferences and journals, including IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), Speech Communications, Multimedia Tools and Applications (MTAP), AAAI Conference on Artificial Intelligence (AAAI), International Joint Conference on Artificial Intelligence (IJCAI), ACM International Conference on Multimedia (ACM Multimedia), IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Annual Conference of the International Speech Communication Association (INTERSPEECH), etc.
He also has wide research collaborations with Tencent, Microsoft, ByteDance, XVerse, TAL, MI, Huya, etc.
吴志勇,清华大学深圳国际研究生院副研究员,博士生导师。清华大学-香港中文大学媒体科学、技术与系统联合研究中心副主任。 研究方向为面向人工智能的智能言语交互技术,包括:语音处理、表现力语音合成、个性化表现力可视语音合成(语音合成及虚拟说话人唇动、表情、头动等相关技术)、 语音转换、歌唱合成、语音识别、自然语言理解与生成、音视联合建模、情感计算、机器学习等。 中国计算机学会(CCF)、中国人工智能学会(CAAI)、IEEE、ACM、International Speech Communication Association(ISCA)会员, 担任中国计算机学会语音对话与听觉专业委员会(CCF TFSDAP)委员、全国人机语音通讯学术会议(NCMMSC)常设机构委员、 IEEE计算智能协会智能系统应用委员会委员、国际互联网联盟语音合成标记语言(SSML)国际化工作组成员。 IEEE/ACM TASLP、Speech Communications、MTAP等国际学术期刊及NeurIPS、COLING、ICASSP、INTERSPEECH、APSIPA、ISCSLP、AAAI、IJCNLP等国际学术会议审稿人。 国家自然科学基金网评专家。 承担国家自然科学基金、香港特区政府研究资助局基金、国家社会科学基金等多项课题。 获得2009及2016年度教育部科学技术进步奖,2020年度清华大学年度教学优秀奖。 在IEEE/ACM TASLP、Speech Communications、MTAP、AAAI、IJCAI、ACM Multimedia、ICASSP、ICME、INTERSPEECH等领域内主流学术刊物和会议上发表论文100余篇。 指导的学生多人次获得优秀学位论文、国家奖学金、优秀毕业生,在2017全球极客大赛“AI仿声验声攻防赛”中斩获桂冠。