ITR-(NHS+ASE)-(int+dmc+sim) Automatic Speech Attribute Transcription (ASAT): A Collaborative Speech Research Paradigm and Cyberinfrastructure with Applications to Automatic Speech
ITR-(NHS ASE)-(int dmc sim) 自动语音属性转录 (ASAT):协作语音研究范式和网络基础设施及其在自动语音中的应用
基本信息
- 批准号:0427413
- 负责人:
- 金额:--
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2004
- 资助国家:美国
- 起止时间:2004-09-15 至 2011-02-28
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
It has long been postulated that a human determines the linguistic identity of a sound based on detected evidences that exist at various levels of the speech knowledge hierarchy, from acoustics to pragmatics. Indeed, people do not continuously convert a speech signal into words as an automatic speech recognition (ASR) system attempts to do. Instead, they detect acoustic and auditory evidences, weigh them and combine them to form cognitive hypotheses, and then validate the hypotheses until consistent decisions are reached. The above human-based model of speech processing suggests a candidate framework for developing next generation speech technologies that have the potential to go beyond the current limitations.In order to bridge the performance gap between ASR systems and humans, the narrow notion of speech-to-text in ASR has to be expanded to incorporate all related human information "hidden" in speech utterances. Instead of the conventional top-down, network decoding paradigm for ASR, we are establishing a bottom-up, event detection and evidence combination paradigm for speech research to facilitate collaborative Automatic Speech Attribute Transcription (ASAT). The goals of the proposed project are: (1) develop feature detection and knowledge integration modules to demonstrate ASAT and ASR; (2) build an open source, highly shared, plug-'n'-play ASAT cyberinfrastructure for collaborative research to lower entry barriers to ASR; and (3) provide an objective evaluation methodology to monitor technology advances in individual modules and across the entire system.
长期以来,人们一直假设人类根据存在于从声学到语用学的语音知识层次的各个层面上检测到的证据来确定声音的语言身份。事实上,人们并不像自动语音识别(ASR)系统试图做的那样,连续地将语音信号转换成单词。取而代之的是,他们检测声学和听觉证据,对它们进行权衡,并将它们结合起来形成认知假设,然后验证这些假设,直到得出一致的决定。上述基于人类的语音处理模型为开发下一代语音技术提供了一个候选框架,该框架具有超越当前限制的潜力。为了弥合ASR系统和人类之间的性能差距,ASR中狭隘的语音到文本的概念必须扩展到包含所有隐藏在语音话语中的相关人类信息。我们正在建立一种自下而上、事件检测和证据组合的语音研究范式,以促进协同自动语音属性转录(ASAT),而不是传统的自上而下、网络解码的ASR范式。拟议项目的目标是:(1)开发特征检测和知识集成模块,以展示反卫星技术和反卫星技术;(2)建立一个开放源码、高度共享、即插即用的反卫星技术网络基础设施,用于协作研究,以降低进入反卫星技术的门槛;(3)提供客观的评价方法,以监测各个模块和整个系统的技术进步。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Chin-Hui Lee其他文献
A Reverberation-Time-Aware Approach to Speech Dereverberation Based on Deep Neural Network
基于深度神经网络的混响时间感知语音去混响方法
- DOI:
- 发表时间:
2017 - 期刊:
- 影响因子:0
- 作者:
Wu Bo;Li Kehuang;Yang Minglei;Chin-Hui Lee - 通讯作者:
Chin-Hui Lee
On stochastic feature and model compensation approaches to robust speech recognition
- DOI:
10.1016/s0167-6393(98)00028-4 - 发表时间:
1998-08 - 期刊:
- 影响因子:0
- 作者:
Chin-Hui Lee - 通讯作者:
Chin-Hui Lee
Speech Enhancement Based on Deep Neural Networks
- DOI:
- 发表时间:
2014 - 期刊:
- 影响因子:2.9
- 作者:
Chin-Hui Lee - 通讯作者:
Chin-Hui Lee
Improving Deep Neural Network Based Speech Synthesis through Contextual Feature Parametrization and Multi-Task Learning
通过上下文特征参数化和多任务学习改进基于深度神经网络的语音合成
- DOI:
10.1007/s11265-017-1293-z - 发表时间:
2017-10 - 期刊:
- 影响因子:0
- 作者:
温正棋;Kehuang Li;Zhen Huang;Chin-Hui Lee;陶建华 - 通讯作者:
陶建华
TT+GT at TRECVID 2010 Workshop
TT GT 出席 TRECVID 2010 研讨会
- DOI:
- 发表时间:
2010 - 期刊:
- 影响因子:0
- 作者:
Nakamasa Inoue;Toshiya Wada;Yusuke Kamishima;Koichi Shinoda;Ilseo Kim;Byungki Byun;Chin-Hui Lee - 通讯作者:
Chin-Hui Lee
Chin-Hui Lee的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Chin-Hui Lee', 18)}}的其他基金
SGER: Exploring Universal Acoustic Characterization of Spoken Languages
SGER:探索口语的普遍声学特征
- 批准号:
0639204 - 财政年份:2006
- 资助金额:
-- - 项目类别:
Standard Grant
2003 Symposium on Next Generation Automatic Speech Recognition (ASR)
2003年下一代自动语音识别(ASR)研讨会
- 批准号:
0352730 - 财政年份:2003
- 资助金额:
-- - 项目类别:
Standard Grant
SGER: Exploring New Auditory Perception Based Approaches to ASR
SGER:探索基于听觉的新 ASR 方法
- 批准号:
0350408 - 财政年份:2003
- 资助金额:
-- - 项目类别:
Standard Grant
相似海外基金
ITR: Collaborative Research: -\(NHS+ASE)-\(int+dmc\): Networks of Robots and Sensors for First Responders
ITR:合作研究:-(NHS ASE)-(int dmc):急救人员的机器人和传感器网络
- 批准号:
0426838 - 财政年份:2004
- 资助金额:
-- - 项目类别:
Continuing Grant
SCI: ITR-(NHS+ASE)-(int+dmc): Dependable Grids
SCI:ITR-(NHS ASE)-(int dmc):可靠的电网
- 批准号:
0426972 - 财政年份:2004
- 资助金额:
-- - 项目类别:
Standard Grant
ITR: Collaborative Research: (NHS+ASE)-(dmc+int+soc): A Wireless Local Positioning System for Mobile Remote Monitoring
ITR:协作研究:(NHS ASE)-(dmc int soc):用于移动远程监控的无线本地定位系统
- 批准号:
0426925 - 财政年份:2004
- 资助金额:
-- - 项目类别:
Standard Grant
ITR: COLLABORATIVE RESEARCH: -(NHS+ASE)-(dmc+int): Diagnosis and Assessment of Faults, Misbehavior and Threats in Distributed Systems and Networks
ITR:协作研究:-(NHS ASE)-(dmc int):分布式系统和网络中的故障、不当行为和威胁的诊断和评估
- 批准号:
0426453 - 财政年份:2004
- 资助金额:
-- - 项目类别:
Standard Grant
ITR: Collaborative Research: -\(NHS+ASE)-\(int+dmc\): Networks of Robots and Sensors for First Responders
ITR:合作研究:-(NHS ASE)-(int dmc):急救人员的机器人和传感器网络
- 批准号:
0427313 - 财政年份:2004
- 资助金额:
-- - 项目类别:
Continuing Grant
ITR: Collaborative Research: (NHS+ASE) - (dmc+int+soc): A Wireless Local Positioning System for Mobile Remote Monitoring
ITR:协作研究:(NHS ASE) - (dmc int soc):用于移动远程监控的无线本地定位系统
- 批准号:
0427430 - 财政年份:2004
- 资助金额:
-- - 项目类别:
Standard Grant
ITR-(NHS+ASE)-(Sim): Self-Organization of Complex Network Dynamics for Efficiency and Robustness
ITR-(NHS ASE)-(Sim):复杂网络动态的自组织以提高效率和鲁棒性
- 批准号:
0427538 - 财政年份:2004
- 资助金额:
-- - 项目类别:
Standard Grant
ITR - (NHS+ASE) - (dmc+int): Distributed Communications and Control for Multiple Miniature Unmanned Air Vehicles
ITR - (NHS ASE) - (dmc int):多微型无人机的分布式通信和控制
- 批准号:
0428004 - 财政年份:2004
- 资助金额:
-- - 项目类别:
Continuing Grant
ITR - (NHS+ASE+ECS) - (dmc+sim+int): Loosely Cooperating Micro Air Vehicle Networks for Toxic Plume Characterization
ITR - (NHS ASE ECS) - (dmc sim int):用于有毒羽流表征的松散合作微型飞行器网络
- 批准号:
0427947 - 财政年份:2004
- 资助金额:
-- - 项目类别:
Standard Grant
ITR: Collaborative Research: -\(NHS+ASE)-\(int+dmc\): Networks of Robots and Sensors for First Responders
ITR:合作研究:-(NHS ASE)-(int dmc):急救人员的机器人和传感器网络
- 批准号:
0426945 - 财政年份:2004
- 资助金额:
-- - 项目类别:
Continuing Grant