Audio Visual Speech Recognition
视听语音识别
基本信息
- 批准号:LP0562101
- 负责人:
- 金额:$ 16.29万
- 依托单位:
- 依托单位国家:澳大利亚
- 项目类别:Linkage Projects
- 财政年份:2005
- 资助国家:澳大利亚
- 起止时间:2005-11-01 至 2009-08-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Even though significant advances have been made in automatic speech recognition using acoustic information, the recognition accuracies are still poor in noisy and hostile environments such as in crowds, traffic, factory floors etc. In many of these applications visual information is or can easily be made available in addition to the audio. The aim of this project is to achieve an order of magnitude improvement in speech recognition accuracies in adverse environments by joint processing and modelling of the acoustic modality with visual information in the form of lip shapes and movements. The outcomes will be useful in human computer interaction in adverse environments as well as in the transcription and mining of multimedia data.
尽管在使用声学信息的自动语音识别方面已经取得了重大进展,但在诸如人群、交通、工厂车间等嘈杂和恶劣的环境中,识别的准确性仍然很差。在许多这些应用中,除了音频之外,还可以或可以容易地获得视觉信息。该项目的目的是通过联合处理声学模式和嘴唇形状和运动形式的视觉信息,在恶劣环境中实现语音识别准确率的数量级改进。研究结果将有助于恶劣环境中的人机交互以及多媒体数据的转录和挖掘。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Em/Prof Sridha Sridharan其他文献
Em/Prof Sridha Sridharan的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Em/Prof Sridha Sridharan', 18)}}的其他基金
Solve it or Ignore it? The Challenge of Alignment Distortion and Creating Next Generation Automatic Facial Expression Detection
解决它还是忽略它?
- 批准号:
DP140100793 - 财政年份:2014
- 资助金额:
$ 16.29万 - 项目类别:
Discovery Projects
The next generation speaker recognition system
下一代说话人识别系统
- 批准号:
LP130100110 - 财政年份:2013
- 资助金额:
$ 16.29万 - 项目类别:
Linkage Projects
Omniscient face recognition for uncooperative subjects
针对不合作主体的全知人脸识别
- 批准号:
DP110100827 - 财政年份:2011
- 资助金额:
$ 16.29万 - 项目类别:
Discovery Projects
Robust Automatic Speaker Diarisation of Audio Documents by Exploiting Prior Sources of Information
通过利用先前的信息源对音频文档进行鲁棒的自动说话人区分
- 批准号:
LP0991238 - 财政年份:2009
- 资助金额:
$ 16.29万 - 项目类别:
Linkage Projects
Robust speaker recognition with reduced utterance duration and intersession variability
强大的说话人识别能力,可减少话语持续时间和会话期间的变异性
- 批准号:
DP0877835 - 财政年份:2008
- 资助金额:
$ 16.29万 - 项目类别:
Discovery Projects
Enhanced Multilingual Speaker Recognition through the Incorporation of High-Level Features, Late Fusion and Discriminative Classification Methods
通过结合高级特征、后期融合和判别性分类方法增强多语言说话人识别
- 批准号:
DP0557387 - 财政年份:2005
- 资助金额:
$ 16.29万 - 项目类别:
Discovery Projects
Automatic audio segmentation, classification, identification, search and retrieval
自动音频分割、分类、识别、搜索和检索
- 批准号:
LP0235648 - 财政年份:2002
- 资助金额:
$ 16.29万 - 项目类别:
Linkage Projects
相似国自然基金
基于多幅图象的Visual Hull重构及表面属性建模算法研究
- 批准号:60373031
- 批准年份:2003
- 资助金额:23.0 万元
- 项目类别:面上项目
相似海外基金
The role of audio-visual and auditory-motor integration in speech perception: Is what we hear dominated by what we see or how we move?
视听和听觉运动整合在言语感知中的作用:我们听到的内容是否受我们看到的或我们移动的方式支配?
- 批准号:
2386111 - 财政年份:2020
- 资助金额:
$ 16.29万 - 项目类别:
Studentship
Development of visual feedback speech training method based on real-time audio visualization system in cleft palate
基于实时音频可视化系统的腭裂视觉反馈言语训练方法的开发
- 批准号:
20H03891 - 财政年份:2020
- 资助金额:
$ 16.29万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Audio-Visual Speech Enhancement and Speaker Separation
视听语音增强和扬声器分离
- 批准号:
2243852 - 财政年份:2019
- 资助金额:
$ 16.29万 - 项目类别:
Studentship
Audio-visual prosody of whispered and semi-whispered speech
耳语和半耳语语音的视听韵律
- 批准号:
426673330 - 财政年份:2019
- 资助金额:
$ 16.29万 - 项目类别:
Research Grants
Improving audio-visual speech recognition with augmented facial-mapping.
通过增强面部映射改进视听语音识别。
- 批准号:
1964209 - 财政年份:2017
- 资助金额:
$ 16.29万 - 项目类别:
Studentship
Audio-visual influences on infant speech perception
视听对婴儿言语感知的影响
- 批准号:
482168-2015 - 财政年份:2015
- 资助金额:
$ 16.29万 - 项目类别:
University Undergraduate Student Research Awards
Innvestigate on audio-visual integration of speech sound
语音视听一体化研究
- 批准号:
26750215 - 财政年份:2014
- 资助金额:
$ 16.29万 - 项目类别:
Grant-in-Aid for Young Scientists (B)
Audio-visual speech processing in bilinguals across the lifespan
双语者一生中的视听语音处理
- 批准号:
449943-2013 - 财政年份:2013
- 资助金额:
$ 16.29万 - 项目类别:
University Undergraduate Student Research Awards
Analysis and synthesis method of phonetic/emotional information in audio-visual speech information
视听语音信息中语音/情感信息的分析与合成方法
- 批准号:
24650100 - 财政年份:2012
- 资助金额:
$ 16.29万 - 项目类别:
Grant-in-Aid for Challenging Exploratory Research
A multi-modal sensor fusion architecture for audio-visual speech understanding
用于视听语音理解的多模态传感器融合架构
- 批准号:
184129-2007 - 财政年份:2011
- 资助金额:
$ 16.29万 - 项目类别:
Discovery Grants Program - Individual