Hands free speech recognition method based on auditory characteristics
基于听觉特征的免提语音识别方法
基本信息
- 批准号:15500106
- 负责人:
- 金额:$ 2.37万
- 依托单位:
- 依托单位国家:日本
- 项目类别:Grant-in-Aid for Scientific Research (C)
- 财政年份:2003
- 资助国家:日本
- 起止时间:2003 至 2004
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Firstly, we proposed a forward masking of Mel-LPC based spectrum on the generalized logarithmic scale. Besides, the variance normalization and a mashing control with the estimated SNR are examined for improving noise robustness.The experimental results on the Aurora-2 database showed that Mel-LPC based cepstrum on generalized log-scale with cepstrum mean and variance normalization for γ=0.1 provides the best performance over the normalized forward masking parameter under any condition.Secondly, We developed a frequency warped Wiener filter to enhance Mel-LPC spectra in presence of additive noise. The proposed filter is directly estimated from the signal on the linear frequency scale and then is efficiently implemented in the autocorrelation domain without denoising input speech. As a result of evaluation using Aurora 2 database, the optimum filter order is shown to be comparable to that of Mel-LPC analysis, and thus filtering is computationally inexpensive. Word accuracy is improved by about 20% at most with the proposed Wiener filter.Thirdly, in order to reduce the influence of reverberation, we examined a reverberation model on the power trajectory domain at the output of a mel-filter in the MFCC analysis. The model parameters consists of the decay rate representing reverberation, the ratio of reverberant power to the direct sound, and the frequency response of the channel including some parts of coloration. Recognition experiments show that the dereverberation method based on this model attains about 10% improvement in Ace. compared to non-processed conditions.
首先,我们提出了一种基于广义对数尺度的Mel-LPC谱的前向掩蔽方法。在Aurora-2数据库上的实验结果表明,基于广义对数尺度倒谱的Mel-LPC方法在任何情况下都比归一化前向掩蔽参数具有更好的抗噪性能。我们开发了一个频率弯曲维纳滤波器,以增强梅尔LPC频谱中存在的加性噪声。该滤波器直接从线性频率尺度上的信号中估计,然后在自相关域中有效地实现,而无需对输入语音进行去噪。作为使用极光2数据库的评估结果,最佳滤波器的顺序被证明是可比的梅尔LPC分析,因此过滤是计算成本低廉。第三,为了减少混响的影响,我们在MFCC分析中,研究了一种在Mel滤波器输出端功率轨迹域上的混响模型。模型参数包括代表混响的衰减率、混响功率与直达声的比率以及包括某些着色部分的通道的频率响应。识别实验表明,基于该模型的去混响方法在Ace中获得了约10%的改善。与非加工条件相比。
项目成果
期刊论文数量(24)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Reverberation modeling on power spectral trajectory for distant speech recogntion
用于远程语音识别的功率谱轨迹混响建模
- DOI:
- 发表时间:2005
- 期刊:
- 影响因子:0
- 作者:H.Matsumoto;T.Takei;K.Yamamoto
- 通讯作者:K.Yamamoto
Reverberation modeling on power spectral trajectory for distant Speech recognition
用于远程语音识别的功率谱轨迹混响建模
- DOI:
- 发表时间:2005
- 期刊:
- 影响因子:0
- 作者:H.Matsumoto;T.Takei;K Yamamoto
- 通讯作者:K Yamamoto
K.Yamamoto, T.Ikeda, H.Matsumoto, et al.: "Syllable-connected models for Japanese speech recognition"Proc.of 18^<th> International Congress on Acoustics. (発表予定). (2004)
K.Yamamoto、T.Ikeda、H.Matsumoto 等人:“日语语音识别的音节连接模型”Proc.of 第 18 届国际声学大会(待提交)。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
Improved forward masking on a generalized logarithmic scale for robust speech recognition
改进了广义对数尺度上的前向掩蔽,以实现稳健的语音识别
- DOI:
- 发表时间:2004
- 期刊:
- 影响因子:0
- 作者:H.Matsumoto;T.Ichikawa;K.Yamamoto
- 通讯作者:K.Yamamoto
H.Matsumoto, T.Ichikawa, K.Yamamoto: "Improved forward masking on a generalized logarithmic scale for robust speech recognition"Proc.of 18^<th> International Congress on Acoustics. (発表予定). (2004)
H.Matsumoto、T.Ichikawa、K.Yamamoto:“在广义对数尺度上改进前向掩蔽以实现稳健的语音识别”第 18 届国际声学大会(即将提交)。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
MATSUMOTO Hiroshi其他文献
MATSUMOTO Hiroshi的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('MATSUMOTO Hiroshi', 18)}}的其他基金
Study of treatment effect for sleep bruxism with using two different type of splint alternately.
两种不同类型夹板交替使用治疗睡眠磨牙症的效果研究
- 批准号:
25861852 - 财政年份:2013
- 资助金额:
$ 2.37万 - 项目类别:
Grant-in-Aid for Young Scientists (B)
The Relationship between Visceral Fat, Adipoctokines and colorectal neoplasma development evaluated by computed tomography (CT) colonography
通过计算机断层扫描(CT)结肠成像评估内脏脂肪、脂肪因子与结直肠肿瘤发展之间的关系
- 批准号:
23591805 - 财政年份:2011
- 资助金额:
$ 2.37万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
A System Dynamics Model of Urban Energy Flow in Asian Core Cities
亚洲核心城市城市能量流系统动力学模型
- 批准号:
21560611 - 财政年份:2009
- 资助金额:
$ 2.37万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Disturbance of innate immune system in alcoholic sudden death and organ damage
先天免疫系统紊乱导致酒精性猝死和器官损伤
- 批准号:
20390196 - 财政年份:2008
- 资助金额:
$ 2.37万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
アルコール濫用による細胞・組織防御システムの破綻機構
酗酒导致细胞和组织防御系统崩溃的机制
- 批准号:
18390206 - 财政年份:2006
- 资助金额:
$ 2.37万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Molecular mechanisms of disturbance in cell survival system by alcohol abuse
酗酒扰乱细胞生存系统的分子机制
- 批准号:
16390196 - 财政年份:2004
- 资助金额:
$ 2.37万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Study on the development of an air cleaner using adsorption/desorption effect and its performance evaluation
利用吸附/解吸效应的空气净化器的研制及其性能评价
- 批准号:
15560512 - 财政年份:2003
- 资助金额:
$ 2.37万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Molecular Mechanisms of Tolerance to Reactive Oxygen Species and Its Induction in Weeds
杂草对活性氧的耐受及其诱导的分子机制
- 批准号:
15380011 - 财政年份:2003
- 资助金额:
$ 2.37万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Study on ion-wave-plasmas interactions in space plasmas via spacecraft observations and computer experiments-Breakthrough of the magnetospheric physics
通过航天器观测和计算机实验研究空间等离子体中离子波等离子体相互作用——磁层物理的突破
- 批准号:
15204044 - 财政年份:2003
- 资助金额:
$ 2.37万 - 项目类别:
Grant-in-Aid for Scientific Research (A)
Molecular mechanisms of inhibition of ubiquitination by alcohol
酒精抑制泛素化的分子机制
- 批准号:
14570393 - 财政年份:2002
- 资助金额:
$ 2.37万 - 项目类别:
Grant-in-Aid for Scientific Research (C)