权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

Study on spoken language understanding framework integrating knowkedges among multiple layers

多层次知识融合的口语理解框架研究

基本信息

批准号：
21300066
负责人：
LEE Akinobu
金额：
$ 11.23万
依托单位：
Nagoya Institute of Technology
依托单位国家：
日本
项目类别：
Grant-in-Aid for Scientific Research (B)
财政年份：
2009
资助国家：
日本
起止时间：
2009-04-01 至 2014-03-31
项目状态：
已结题

项目摘要

This study focuses on developing a framework that integrates handling of multiple knowledge layer from speech signal processing to spoken language understanding directly into speech recognition process in a statistical mannar. Statistical models at layers of language model, acoustic model and dialogue model are widely investigated. For integration, speech decoding based on Bayes-risk minimization in which all the constraint can be expressed as Bayes risk, and some integration methods that utilizes speech information for dialogue management and turn taking was investigated. Part of the results are publicly available as part of an open-source voice interaction building tool MMDAgent and Julius.

本研究的重点是开发一个框架，集成处理多个知识层从语音信号处理到口语理解直接到语音识别过程中的统计mannar。语言模型、声学模型和对话模型等层次的统计模型得到了广泛的研究。在语音融合方面，研究了基于贝叶斯风险最小化的语音解码方法，其中所有的约束条件都可以表示为贝叶斯风险，以及利用语音信息进行对话管理和话轮转换的语音融合方法。部分结果作为开源语音交互构建工具MMDAgent和Julius的一部分公开提供。

项目成果

期刊论文数量（0）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

Open answer scoring for S-CAT automated speaking test system using support vector regression

DOI：
发表时间：
2012-12
期刊：
Proceedings of The 2012 Asia Pacific Signal and Information Processing Association Annual Summit and Conference
影响因子：
0
作者：
Yutaka Ono;Misuzu Otake;T. Shinozaki;R. Nisimura;Takeshi Yamada;K. Ishizuka;Y. Horiuchi;S. Kuroiwa;S. Imai
通讯作者：
Yutaka Ono;Misuzu Otake;T. Shinozaki;R. Nisimura;Takeshi Yamada;K. Ishizuka;Y. Horiuchi;S. Kuroiwa;S. Imai

Detecting child speaker based on auditory feature vectors for VTL estimation

基于听觉特征向量检测儿童说话者进行 VTL 估计