权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

高次統計量追跡に基づくハンズフリー音声対話システムの教師なし自律最適化

基于高阶统计跟踪的免提语音对话系统无监督自主优化

基本信息

批准号：
13J05110
负责人：
宮﨑亮一
金额：
$ 0.64万
依托单位：
Nara Institute of Science and Technology
依托单位国家：
日本
项目类别：
Grant-in-Aid for JSPS Fellows
财政年份：
2013
资助国家：
日本
起止时间：
2013 至无数据
项目状态：
已结题

项目摘要

本研究は, 音声強調部と音響モデル・言語モデルなどの音声認識部を含めたハンズフリー音声対話システムを想定し, システム全体の統合的な自律的最適化を行うことを目的とするものである. 音声対話システムは様々な環境で運用されることより, 各環境に応じて最高の音声認識性能を引き出すために, 手作業での調整を余儀なくされている.そこで, 音声強調部の内部パラメータに対応する音声認識率を予測する理論を確立する. さらに, それに基づき, 音声認識率が最も高くなるように内部パラメータを自動的に最適化する. 音声認識性能を左右する要因の一つは「SN比の改善量」であることは以前より広く知られているが, この要素は目的音声に混入される雑音の「量」に関するもので, 「質」に関しては評価していない. 音声認識性能の良し悪しは「SN比の改善量」に加えて, 音声強調後の「雑音の品質」・「音声の品質」によっても決定される. そのため, これら三要素の値を事前に算出することが出来れば, それに対応する音声認識率を予測できると考えられ, かっ最も音声認識率が高くなるような内部パラメータの自動最適化にっながる.これまでに, 私は, 「音声の品質」を評価するための高次統計量についての検討を行ってきた. 目的音声に混入する背景雑音の種類が定常か非定常か, また, SN比の良し悪しによっても音声の統計量を安定的に求める手法は異なることを明らかにした. この結果は, 「音声の品質」を評価する上で安定して統計量を求めることが出来るという意味で大きな成果であり, 今後の研究につながると言える.

In this study, the sound emphasis part and the sound recognition part are included in the optimization of the integration of the sound and speech systems. The sound quality of the sound system is the highest in the environment. The theory of acoustic recognition rate prediction is established. In this case, the basic information, the highest sound recognition rate, the internal optimization, automatic optimization. Sound recognition performance is influenced by one of the main factors: SN ratio improvement, quality evaluation, etc. The improvement of SN ratio is added to the sound recognition performance, and the quality of sound is determined after the sound is emphasized. The value of the three elements is calculated in advance, and the sound recognition rate is predicted. The sound recognition rate is high. This is the first time that we have discussed the high-order statistics of "sound quality". The target sound is mixed into the background sound, and the type of background sound varies from constant to unsteady. The results show that the quality of sound can be evaluated on the basis of stability statistics.

项目成果

期刊论文数量（0）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

Musical-noise-free blind speech extraction integrating microphone array and iterative spectral subtraction

DOI：
10.1016/j.sigpro.2014.03.010
发表时间：
2014-09
期刊：
Signal Process.
影响因子：
0
作者：
Ryoichi Miyazaki;H. Saruwatari;Satoshi Nakamura;K. Shikano;Kazunobu Kondo;J. Blanchette;M. Bouchard
通讯作者：
Ryoichi Miyazaki;H. Saruwatari;Satoshi Nakamura;K. Shikano;Kazunobu Kondo;J. Blanchette;M. Bouchard

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

宮﨑亮一其他文献

宮﨑亮一的其他文献

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

立即体验

相似海外基金

対称性に基づくアレイ信号処理に関する研究

基于对称性的阵列信号处理研究

批准号：
10J06927
财政年份：
2010
资助金额：
$ 0.64万
项目类别：
Grant-in-Aid for JSPS Fellows

会员权益说明：