权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

低認識精度発声に対する音声認識に関する研究

针对低识别准确率话语的语音识别研究

基本信息

批准号：
15700163
负责人：
柘植覚
金额：
$ 1.73万
依托单位：
The University of Tokushima
依托单位国家：
日本
项目类别：
Grant-in-Aid for Young Scientists (B)
财政年份：
2003
资助国家：
日本
起止时间：
2003 至 2005
项目状态：
已结题

项目摘要

本研究の研究の目的は以下の2点である.◆低認識精度発声の原因解明◆低認識精度発声の認識精度向上この目的を実現するために、次のことを実施した。原因解明のため、現在定期的に収録を行っている特定話者長期間音声データベースを用い、様々な要因との相関分析を行った。この結果より、特定話者の場合、発話速度は音声認識精度への相関が低いことがわかった。これは、発話速度は置換誤りと相関が低いが、挿入誤りとは高い負の相関を持ち、脱落誤りとは高い正の相関を持つため、挿入誤りと脱落誤りが相殺し、発話速度と音声認識精度の相関が低いことがわかった。また、音声認識精度と母音の各正解率との相関をしらべ、母音/a/、/u/は音声認識精度との相関が高いことがわかった。低認識精度発声の認識精度向上のため、原因解明のために使用したデータと同様のデータを使用して、認識精度向上のため、各発声日、発声時間帯に音響モデルを適応することを試みた。これは、認識率改善のためには、一日内の音声変動が有効化、同じ時間帯の音声が有効化を検討した。この検討の結果、音声認識精度改善のためには同一内に発声された音声を用い、音響モデルを適応することが有効であることがわかった。

The purpose of this study is to study the following two points.◆ Reasons for low recognition accuracy sound ◆ Low recognition accuracy sound recognition accuracy up to the goal of the implementation of the first, second and third The reason is clear, the current regular recording is in progress, the specific speaker is in progress, the important reason is related to the analysis The result of this, the situation of the specific speaker, the speed of the speech, the accuracy of the sound recognition, and the correlation are low. For example, if the error rate of the transmission speed is low, and the error rate of the transmission speed is low. The accuracy of vowel recognition and the correlation between vowel/a/and vowel/u/are high. Low recognition accuracy sound recognition accuracy upward, cause solution upward, recognition accuracy upward, sound transmission day, sound transmission time upward, sound transmission time upward, sound transmission time upward, The improvement of the recognition rate and the change of the sound in one day are discussed. The results of this research, the improvement of acoustic recognition accuracy, and the improvement of acoustic recognition accuracy.

项目成果

期刊论文数量（18）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

Nonparametric speaker recognition method using Earth Mover's Distance

DOI：
10.1093/ietisy/e89-d.3.1074
发表时间：
2006-03-01
期刊：
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS
影响因子：
0.7
作者：
Kuroiwa, S;Umeda, Y;Ren, F
通讯作者：
Ren, F

A lost speech reconstruction method using linguistic information,

一种使用语言信息的丢失语音重建方法，

DOI：
发表时间：
期刊：
Proceedings of 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE'05), Wuhan.Oct.2005
影响因子：
0
作者：
Shingo Kuroiwa;Satoru Tsuge;Fuji Ren
通讯作者：
Fuji Ren

Frequency Characteristic Normalization Method using Blind Equalization Technique with Multiple References for DSR,

使用盲均衡技术和多参考 DSR 的频率特性归一化方法，

DOI：
发表时间：
期刊：
Proceedings of 10th International Conference SPEECH and COMPUTER (SPECOM2005), Patras, Greece, Oct.2005
影响因子：
0
作者：
Satoru Tsuge;Masami Shishibori;Fuji Ren;Kenji Kita;Shingo Kuroiwa
通讯作者：
Shingo Kuroiwa

Acoustic Model Adaptation for Cedec Speech based on Leaning-by-Doing Concept

基于边做边学概念的 Cedec 语音声学模型自适应

DOI：
发表时间：
2006
期刊：
Advances in Natural Language Processing Research in Computing Science Vol.18
影响因子：
0
作者：
Shingo Kuroiwa;Shingo Kuroiwa
通讯作者：
Shingo Kuroiwa

Shingo Kuroiwa: "Blind equalization via minimization of VQ distortion for ETSI standard DSR front-end"Proceedings of Natural Language Processing and Knowledge Engineering. 1. 585-590 (2003)

Shingo Kuroiwa：“通过最小化 ETSI 标准 DSR 前端的 VQ 失真实现盲均衡”自然语言处理和知识工程论文集。

DOI：
发表时间：
期刊：
影响因子：
0
作者：
通讯作者：

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

柘植覚其他文献

Evaluation of Fundamental Frequency Prediction from MFCC Using Japanese Utterance

使用日语话语对 MFCC 基频预测进行评估

DOI：
发表时间：
2009
期刊：
影响因子：
0
作者：
黒岩眞吾;小林邦嘉;柘植覚;任福継;Andrei Doncescu;Hidetomo Nabeshima;Mohamed Abdel Fattah;Katsumi Inoue;Katsumi Inoue;Katsumi Inoue(Hidetomo Nabeshima);Hua Xiang;Haiqing Hu;Koji Iwanuma;Shunji Mitsuyoshi;Katsumi Inoue(Hidetomo Nabeshima);Zhi Teng;Koji Iwanuma;Katsumi Inoue(Eds.);Yu Zhang;Mohamed Abdel Fattah;黒岩眞吾;Mohamed Abdel Fattah;黒岩眞吾;柘植覚;黒岩眞吾;黒岩眞吾;Peilin Jiang;黒岩員吾;Yasunori Kashihara;Takafumi YUI
通讯作者：
Takafumi YUI

種々のテキスト検索モデルの頑健性向上による音声ドキュメント検索の高精度化

通过提高各种文本检索模型的鲁棒性来提高音频文档检索的准确性

DOI：
发表时间：
2014
期刊：
影响因子：
0
作者：
北岡教英;市川賢;柘植覚;武田一哉;北研二
通讯作者：
北研二

話者認識技術の紹介と最近の研究動向

说话人识别技术简介及最新研究动态

DOI：
发表时间：
2009
期刊：
影响因子：
0
作者：
黒岩眞吾;小林邦嘉;柘植覚;任福継;Andrei Doncescu;Hidetomo Nabeshima;Mohamed Abdel Fattah;Katsumi Inoue;Katsumi Inoue;Katsumi Inoue(Hidetomo Nabeshima);Hua Xiang;Haiqing Hu;Koji Iwanuma;Shunji Mitsuyoshi;Katsumi Inoue(Hidetomo Nabeshima);Zhi Teng;Koji Iwanuma;Katsumi Inoue(Eds.);Yu Zhang;Mohamed Abdel Fattah;黒岩眞吾;Mohamed Abdel Fattah;黒岩眞吾;柘植覚;黒岩眞吾;黒岩眞吾;Peilin Jiang;黒岩員吾
通讯作者：
黒岩員吾