Spontaneous speech recognition

自发语音识别

基本信息

  • 批准号:
    15500098
  • 负责人:
  • 金额:
    $ 2.05万
  • 依托单位:
  • 依托单位国家:
    日本
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
  • 财政年份:
    2003
  • 资助国家:
    日本
  • 起止时间:
    2003 至 2005
  • 项目状态:
    已结题

项目摘要

We investigated spontaneous speech recognition on academic lecture task and obtained the following results.(1) Lecture speech recognition using pronunciation variant modeling and unsupervised adaptationWe focus on the pronunciation variations observed in spontaneous speech. Aiming to introduce the context-dependence of pronunciation variants, we propose a new method of language modeling based on morphological analysis data designed for pronunciation variant. The proposed method was evaluated on the Corpus of Spontaneous Japanese (CSJ) and achieved the decrease in word error rate (WER) by 4.74% absolute. In addition, unsupervised adaptation of both acoustic and language models was introduced to improve the recognition performance further. The results showed the decrease in WER from 19.96% without adaptation to 15.41% with unsupervised adaptation.(2) Lecture speech recognition using discrete-mixture HMMsWe have investigated noisy speech recognition by using discrete-mixture HMM (DMHMM), … More and found that the performance of DMHMM overcame that of continuous-mixture HMM under environmental noise conditions or impulsive noise conditions. However, it is not clear whether this method is effective in clean conditions. The aim of this investigation is to evaluate the performance of the DMHMM system in clean conditions. In evaluation, we decided to use the "Corpus of Spontaneous Japanese" (CSJ) because we want to compare the performance of our system with that of other recognition systems with common speech corpus, and clarify the performance in such a more difficult task. In the recognition experiments, 3000-state DMHMMs (16 mixture components per state) were used as acoustic models. The language model which represents the pronunciation variety was trained by using 6.86 million words from 2668 lectures in CSJ and was used for recognition. As a result, the system obtained 20.30% WER for 10 academic lectures uttered by male speakers and demonstrated the effectiveness of the proposed method. Less
我们在学术讲座任务上调查了赞助商喂养的语音识别,并获得了以下结果。(1)使用发音变体建模和无监督的Appationwe的讲座语音识别我们的重点是在赞助商喂养语音中观察到的发音变化。为了介绍发音变体的上下文依赖性,我们基于为发音变体设计的形态分析数据提出了一种新的语言建模方法。对所提出的方法进行了对自发日本(CSJ)语料库的评估,并使单词错误率(WER)的下降量减少了4.74%。此外,引入了声学模型和语言模型的无监督适应,以进一步提高识别性能。结果表明,在不接受监督的适应性的情况下,WER从19.96%下降至15.41%。(2)使用离散混合HMMSWE的讲座语音识别已通过使用离散混合HMM(DMHMM)(dmhmm)(更多)来调查噪声语音识别,更多,并且发现DMHMM过度噪音的情况下的噪声频率不足,以至于噪音不足。但是,尚不清楚这种方法在清洁条件下是否有效。这项投资的目的是评估在清洁条件下DMHMM系统的性能。在评估中,我们决定使用“自发日本的语料库”(CSJ),因为我们想将系统的性能与其他识别系统的性能与共同的语音语料库进行比较,并在如此艰巨的任务中阐明了性能。在识别实验中,将3000个状态的DMHMM(每个州的16个混合物组件)用作声学模型。代表发音品种的语言模型通过使用CSJ中的2668个讲座的686万字培训,并用于识别。结果,该系统在由男说话的10个学术讲座中获得了20.30%的速度,并证明了该方法的有效性。较少的

项目成果

期刊论文数量(75)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Rebust Speech Recognition Using Discrete-Mixture HMMs
使用离散混合 HMM 重构语音识别
小坂 哲夫: "Noisy speech recognition with discrete-mixture HMMs based on MAP estimation"18th International Congress on Acoustics. Tu. P2.8. (2004)
Tetsuo Kosaka:“基于 MAP 估计的离散混合 HMM 的噪声语音识别”第 18 届国际声学大会 (2004)。
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
  • 通讯作者:
松本 和樹: "分散音声認識のクライアントにおけるマイク特性変動の除去"情報処理学会 東北支部研究会. 03-5-B2-2. 1-8 (2004)
Kazuki Matsumoto:“分布式语音识别客户端中麦克风特性波动的消除”日本信息处理学会东北分会研究组 03-5-B2-2 (2004)。
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
  • 通讯作者:
話者ベクトルを用いた雑音下話者認識手法の検討
基于说话人向量的噪声下说话人识别方法研究
Fast optimization of language model weight and insertion penalty from n-best candidates
  • DOI:
    10.1250/ast.26.384
  • 发表时间:
    2005-07
  • 期刊:
  • 影响因子:
    0.7
  • 作者:
    Akinori Ito;M. Kohda;S. Makino
  • 通讯作者:
    Akinori Ito;M. Kohda;S. Makino
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

KOHDA Masaki其他文献

KOHDA Masaki的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('KOHDA Masaki', 18)}}的其他基金

Large-vocabulary continuous speech recognition on spontaneous speech task
自发语音任务的大词汇量连续语音识别
  • 批准号:
    18500126
  • 财政年份:
    2006
  • 资助金额:
    $ 2.05万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Large Vocabulary Continuous Speech Recognition System on Japanese Newspaper Reading Task
日语报纸阅读任务的大词汇量连续语音识别系统
  • 批准号:
    10680368
  • 财政年份:
    1998
  • 资助金额:
    $ 2.05万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Algorithm of Spontaneous Speech Recognition Based on A^<**> Search
基于A^<**>搜索的自发语音识别算法
  • 批准号:
    07680379
  • 财政年份:
    1995
  • 资助金额:
    $ 2.05万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Speech Recognition Based on Intelligent Beam Search Algorithm
基于智能波束搜索算法的语音识别
  • 批准号:
    01460254
  • 财政年份:
    1989
  • 资助金额:
    $ 2.05万
  • 项目类别:
    Grant-in-Aid for General Scientific Research (B)

相似海外基金

Study of automatic captioning based on unified modeling of spontaneous speech recognition and automatic editing
基于自发语音识别与自动编辑统一建模的自动字幕研究
  • 批准号:
    25730112
  • 财政年份:
    2013
  • 资助金额:
    $ 2.05万
  • 项目类别:
    Grant-in-Aid for Young Scientists (B)
Study on automatic speech recognition based on domain-independent modeling of spontaneous speech
基于自发语音域无关建模的自动语音识别研究
  • 批准号:
    18700177
  • 财政年份:
    2006
  • 资助金额:
    $ 2.05万
  • 项目类别:
    Grant-in-Aid for Young Scientists (B)
Large-vocabulary continuous speech recognition on spontaneous speech task
自发语音任务的大词汇量连续语音识别
  • 批准号:
    18500126
  • 财政年份:
    2006
  • 资助金额:
    $ 2.05万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Automatic Speech Recognition and Understanding of Lectures and Discussions for Effective Multi-media Archiving
自动语音识别和理解讲座和讨论,以实现有效的多媒体归档
  • 批准号:
    16200011
  • 财政年份:
    2004
  • 资助金额:
    $ 2.05万
  • 项目类别:
    Grant-in-Aid for Scientific Research (A)
Acoustic analysis and modeling of spontaneous speech for speech recognition applications
用于语音识别应用的自发语音的声学分析和建模
  • 批准号:
    914-1996
  • 财政年份:
    2002
  • 资助金额:
    $ 2.05万
  • 项目类别:
    Discovery Grants Program - Individual
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了