Robust HMMs against environmental variation for speech recognition
针对语音识别环境变化的鲁棒 HMM
基本信息
- 批准号:10680376
- 负责人:
- 金额:$ 1.92万
- 依托单位:
- 依托单位国家:日本
- 项目类别:Grant-in-Aid for Scientific Research (C)
- 财政年份:1998
- 资助国家:日本
- 起止时间:1998 至 1999
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
(1) A Study on Variance Expansion of HMMs Robust to Environmental VariationThis project addresses the problem of making HMMs robust to variation of SNR. This study developed a noise varicance expansion technique for HMMs, which consists of simply expanding the variace of cepstral coefficients for the noise model in HMM composition. The effect of this technique is examined through speaker independent digit recognition tests using NOISEX-92 noise data. The results show that the variance expansion of the 0th order cepstrum extremely improves robustness to a wide range of SNR mismatch over the standard HMM. The appropriate expansion factor is determined irrespective of noise types such that the expanded variance of the zeroth cepstrum is around 5 to 6dB with respect to its geometric mean.(2) A Stuty on a Robust Spectral Analysis to Additive NoiseA part of this project also developed a simple and efficient time domain technique to estimate an all-poll model on a mel-frequency axis (Mel-LPC). This method requires only two-fold computational cost as compared to conventional linear prediction analysis. Gender-dependent phoneme recognition tests show that the Mel-LPC cepstrum attains a significant improvement in recognition accuracy over conventional LP mel-cepstra and the mel-frequency cepstrum coefficients (MFCC). Furthermore, noisy word recognition tests revealed that the Mel-LPC cepstrum is robust to wide-band additive noise over conventional LP mel-cepstrum and MFCC.
(1)本课题研究的是如何使HISTORY算法对信噪比的变化具有鲁棒性。本文提出了一种用于HMM的噪声方差扩展技术,它包括简单地扩展HMM组成中噪声模型的倒谱系数的方差。使用NOISEX-92噪声数据,通过与说话人无关的数字识别测试,检查这种技术的效果。结果表明,0阶倒谱的方差扩展极大地提高了标准HMM对大范围信噪比失配的鲁棒性。适当的扩展因子被确定为与噪声类型无关,使得第零倒谱的扩展方差相对于其几何平均值约为5至6dB。(2)抗加性噪声的抗差谱分析研究本项目的一部分还发展了一种简单有效的时域方法来估计梅尔频率轴上的全轮询模型(Mel-LPC)。与传统的线性预测分析相比,这种方法只需要两倍的计算成本。性别相关的音素识别测试表明,梅尔LPC倒谱实现了一个显着的改善,在识别精度比传统的LP梅尔倒谱和梅尔频率倒谱系数(MFCC)。此外,噪声字识别测试表明,梅尔LPC倒谱是强大的宽带加性噪声比传统的LP梅尔倒谱和MFCC。
项目成果
期刊论文数量(6)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
H.Matsumoto, et al.: "An efficient MEL-LPC analysis method for speech recognition" Proc.of Int.Conference on Spoken Language Processing. 1051-1054 (1998)
H.Matsumoto 等人:“一种用于语音识别的高效 MEL-LPC 分析方法”Proc.of Int.Conference on Spoken Language Processing。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
H. Matsumoto, et al.: "Robust HMM to variation of noisy environments based on variance extension of noise models"Proc. of 6th ESCA. 2387-2390 (1999)
H. Matsumoto 等人:“基于噪声模型的方差扩展对噪声环境变化的稳健 HMM”Proc。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
H. Matsumoto, et al: "Robust HMM to variation of noisy environments based on variance extension of noise models"Proc. of 6th European Speech Conference. 2387-2390 (1999)
H. Matsumoto 等人:“基于噪声模型的方差扩展对噪声环境变化的稳健 HMM”Proc。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
H. Matsumoto, et al.: "An efficient Mel-LPC analysis method for speech recognition"Proc. of ICSLP'98. 1051-1054 (1998)
H. Matsumoto 等人:“一种用于语音识别的高效 Mel-LPC 分析方法”Proc。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
H.Matsumoto, et al: "An efficient Mel-LPC analysis method for speech recognition"Proc. of International Conference on Spoken Language Processing. 1051-1054 (1998)
H.Matsumoto 等人:“一种用于语音识别的高效 Mel-LPC 分析方法”Proc。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
MATSUMOTO Hiroshi其他文献
MATSUMOTO Hiroshi的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('MATSUMOTO Hiroshi', 18)}}的其他基金
Study of treatment effect for sleep bruxism with using two different type of splint alternately.
两种不同类型夹板交替使用治疗睡眠磨牙症的效果研究
- 批准号:
25861852 - 财政年份:2013
- 资助金额:
$ 1.92万 - 项目类别:
Grant-in-Aid for Young Scientists (B)
The Relationship between Visceral Fat, Adipoctokines and colorectal neoplasma development evaluated by computed tomography (CT) colonography
通过计算机断层扫描(CT)结肠成像评估内脏脂肪、脂肪因子与结直肠肿瘤发展之间的关系
- 批准号:
23591805 - 财政年份:2011
- 资助金额:
$ 1.92万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
A System Dynamics Model of Urban Energy Flow in Asian Core Cities
亚洲核心城市城市能量流系统动力学模型
- 批准号:
21560611 - 财政年份:2009
- 资助金额:
$ 1.92万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Disturbance of innate immune system in alcoholic sudden death and organ damage
先天免疫系统紊乱导致酒精性猝死和器官损伤
- 批准号:
20390196 - 财政年份:2008
- 资助金额:
$ 1.92万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
アルコール濫用による細胞・組織防御システムの破綻機構
酗酒导致细胞和组织防御系统崩溃的机制
- 批准号:
18390206 - 财政年份:2006
- 资助金额:
$ 1.92万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Molecular mechanisms of disturbance in cell survival system by alcohol abuse
酗酒扰乱细胞生存系统的分子机制
- 批准号:
16390196 - 财政年份:2004
- 资助金额:
$ 1.92万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Hands free speech recognition method based on auditory characteristics
基于听觉特征的免提语音识别方法
- 批准号:
15500106 - 财政年份:2003
- 资助金额:
$ 1.92万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Study on the development of an air cleaner using adsorption/desorption effect and its performance evaluation
利用吸附/解吸效应的空气净化器的研制及其性能评价
- 批准号:
15560512 - 财政年份:2003
- 资助金额:
$ 1.92万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Study on ion-wave-plasmas interactions in space plasmas via spacecraft observations and computer experiments-Breakthrough of the magnetospheric physics
通过航天器观测和计算机实验研究空间等离子体中离子波等离子体相互作用——磁层物理的突破
- 批准号:
15204044 - 财政年份:2003
- 资助金额:
$ 1.92万 - 项目类别:
Grant-in-Aid for Scientific Research (A)
Molecular Mechanisms of Tolerance to Reactive Oxygen Species and Its Induction in Weeds
杂草对活性氧的耐受及其诱导的分子机制
- 批准号:
15380011 - 财政年份:2003
- 资助金额:
$ 1.92万 - 项目类别:
Grant-in-Aid for Scientific Research (B)














{{item.name}}会员




