权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

高次元特徴量と低自由度識別器を用いた高精度音声認識器の開発と研究

利用高维特征和低自由度分类器的高精度语音识别器的开发与研究

基本信息

批准号：
09J04190
负责人：
久保陽太郎
金额：
$ 0.9万
依托单位：
Waseda University
依托单位国家：
日本
项目类别：
Grant-in-Aid for JSPS Fellows
财政年份：
2009
资助国家：
日本
起止时间：
2009 至 2010
项目状态：
已结题

项目摘要

本年度は,高次元特徴量と低自由度識別器を用いた音声認識器の大規模音声データへの適用を目的とした研究を行なった.特に,昨年度提案した高次元特徴量の利用法であるカーネル法に基づく音声処理法をデータ数に対してスケーラブルに動作するよう,拡張を行なった.提案手法はカーネル部分空間法に基づく音響モデルの近似と,近似を精密に保つための部分空間追跡法から成る.従来のカーネル法に基づくデータ解析は,一般的にトレーニングデータ数の二乗に比例して計算量が必要となる.音声の諸問題では大量のデータサンプルを用いて処理を行なう必要があり,従来法ような二乗に比例して計算時間を要するアルゴリズムでは適用することが非常に困難である.提案法によって導出されたアルゴリズムでは,トレーニングデータ数に比例する時間で計算が完了することが示されており,音声処理に適した拡張となっている.さらに,提案法は並列化可能なアルゴリズムとなっており,近年のマルチコア/メニーコア環境や,クラウドコンピューティングによる音響モデル学習が容易になっている.提案法はTIMIT音素認識タスクで評価した.TIMIT音素認識タスクで用いられているデータセットは近年の音声認識タスクとしては非常に小さいが,カーネル法をベースとした音声認識タスクでは従来の実験と比べ6倍のサイズにスケールアップすることができた.提案法は従来のさまざまな部分空間法(線形識別分析,Relief特徴選択法)を上回る精度を得られることを確認した.

This year, high-dimensional features, low-degree-of-freedom discriminators, large-scale sound devices, large-scale sound devices, high-dimensional features, low-degree-of-freedom discriminators, high-dimensional features, low-degree-of-freedom discriminators, large-scale mode sounds, high-dimensional features, low-degree-of-freedom discriminators, high-dimensional features, low-degree-of-freedom discriminators, high-dimensional features, low-degree-of-freedom discriminators, large-scale mode sounds, high-dimensional features, low-degree-of-freedom discriminators, high-dimensional features, low-degree-of-freedom discriminators, high-dimensional features, low-degree-of-freedom discriminators, large-scale sound devices, high-dimensional features, low-degree-of-freedom discriminators, high-dimensional features, low-degree-of-freedom discriminators, high- In particular, last year's proposal was made in terms of the number of sound, sound and sound. The modus operandi of the proposal is similar to that of partial space tracking, which is similar to that of precision protection and partial space tracking. In general, it is necessary to calculate the amount of money you need to calculate the quantity. There are a lot of problems with sound and sound. It is very difficult to calculate the time required for the calculation of a large number of sound and sound problems. In the proposed law, it is necessary to calculate the proportion of the number of cars, the proportion of the number of cars, the number of cars, the number of dollars, and the sound of sound. It is possible that the proposed law should be paralleled. In recent years, there has been an increase in the number of people in the environment. In recent years, it is easy to improve the quality of the environment. The proposed method is that TIMIT phonemes and phonemes are very small in recent years, and that they are six times better than those in recent years. The proposed method is used to verify the accuracy of the partial space method (shape analysis, Relief special method).

项目成果

期刊论文数量（0）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

Temporal AM-FM combination for robust speech recognition

时域 AM-FM 组合可实现稳健的语音识别

DOI：
发表时间：
2011
期刊：
Speech Communication
影响因子：
3.2
作者：
Masahiro Yokoyama;Watalu Yamamoto and Kazuyuki Suzuki;鈴木治和;喜多隆;許晋碩;Shigeki Okawa et al. (2/4)
通讯作者：
Shigeki Okawa et al. (2/4)

隠れマルコフカーネルマシンを用いた系列データの識別とその音素認識タスクへの適用

使用隐马尔可夫核机识别序列数据及其在音素识别任务中的应用