ITR-Collaborative Research: Development and Evaluation of a Hybrid Concatenative/Rule-Based Visual Speech Synthesis System
ITR 合作研究:混合串联/基于规则的视觉语音合成系统的开发和评估
基本信息
- 批准号:0312810
- 负责人:
- 金额:$ 18.32万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2003
- 资助国家:美国
- 起止时间:2003-07-15 至 2007-06-30
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
This project's goal is to develop a synthetic talking face. Humans developed sophisticated abilities to perceive and integrate auditory and visual (AV) speech information long before they were required to read printed text presented by computers. Seeing as well as hearing speech reduces the cognitive workload and improves comprehension over only hearing the talker. To realize the advantages of AV speech for human-computer interactions requires synthesizing visual speech, thereby providing an unlimited supply of visual speech images without having to pre-record data. The approach here is to drive optical speech synthesis with speech acoustics. Computational methods obtain models of the transformation from acoustics to optics. The method capitalizes on the speech production coarticulatory information captured by diphones to produce naturalistic visual speech images. The method is applied directly to natural acoustic speech features to obtain coordination between acoustic and optical signals. The synthesized visual speech is based on a texture-mapped wire frame model. A natural speech corpus to base the synthesis is being obtained via simultaneously recorded 3-D optical, audio, and video data. Synthesis development is guided by human perceptual testing. The DVD archived corpus will be disseminated. The project will lead to expanded access to information and improvement in obtaining knowledge by diverse groups of individuals, for example: children still acquiring literacy skills; adults with inadequate literacy; individuals who are using a second language; and individuals with hearing losses who rely on audiovisual speech. Results will be disseminated broadly through professional outlets. Graduate and undergraduate students will participate.
这个项目的目标是开发一个合成的会说话的脸。早在人类被要求阅读由计算机呈现的印刷文本之前,他们就发展出了感知和整合听觉和视觉(AV)语音信息的复杂能力。与只听说话者相比,看和听可以减少认知负荷,提高理解能力。为了实现AV语音在人机交互中的优势,需要合成视觉语音,从而在不需要预录制数据的情况下提供无限的视觉语音图像。这里的方法是用语音声学驱动光学语音合成。计算方法得到了从声学到光学的转换模型。该方法利用由听筒捕获的语音产生协同发音信息来产生自然的视觉语音图像。该方法直接应用于自然声语音特征,获得声光信号之间的协调关系。合成的视觉语音基于纹理映射线框模型。通过同时记录的三维光学、音频和视频数据,可以获得基于合成的自然语音语料库。综合开发以人类感知测试为指导。将分发DVD存档的文集。该项目将扩大获取信息的机会,并改善不同个人群体获取知识的情况,例如:仍在学习识字技能的儿童;识字能力不足的成年人;使用第二语言的个人;以及那些依赖视听语言的听力损失患者。结果将通过专业渠道广泛传播。研究生和本科生将参加。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Abeer Alwan其他文献
Modeling auditory perception to improve robust speech recognition
建立听觉感知模型以提高稳健的语音识别能力
- DOI:
- 发表时间:
1997 - 期刊:
- 影响因子:0
- 作者:
B. Strope;Abeer Alwan - 通讯作者:
Abeer Alwan
Unraveling the associations between voice pitch and major depressive disorder: a multisite genetic study
揭示声音音调与重度抑郁症之间的关联:一项多站点遗传研究
- DOI:
10.1038/s41380-024-02877-y - 发表时间:
2024-12-31 - 期刊:
- 影响因子:10.100
- 作者:
Yazheng Di;Elior Rahmani;Joel Mefford;Jinhan Wang;Vijay Ravi;Aditya Gorla;Abeer Alwan;Kenneth S. Kendler;Tingshao Zhu;Jonathan Flint - 通讯作者:
Jonathan Flint
Optical Phonetics and Visual Percep Stress in Eng
英语中的光学语音和视觉感知压力
- DOI:
- 发表时间:
2003 - 期刊:
- 影响因子:0
- 作者:
P. Keating;Marco Baroni;Sven Matty;E. T. Auer;Rebecca Scarborough;Abeer Alwan;E. Bernstein - 通讯作者:
E. Bernstein
Towards Automatically Assessing Children’s Picture Description Tasks
自动评估儿童图片描述任务
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
Hariram Veeramani;Natarajan Balaji Shankar;Alexander Johnson;Abeer Alwan - 通讯作者:
Abeer Alwan
An Analysis of Large Language Models for African American English Speaking Children’s Oral Language Assessment
非裔美国英语儿童口语评估大语言模型分析
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
Alexander Johnson;Christina Chance;Kaycee Stiemke;Hariram Veeramani;Natarajan Balaji Shankar;Abeer Alwan - 通讯作者:
Abeer Alwan
Abeer Alwan的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Abeer Alwan', 18)}}的其他基金
Collaborative Research: Improving speech technology for better learning outcomes: the case of AAE child speakers
协作研究:改进语音技术以获得更好的学习成果:AAE 儿童扬声器的案例
- 批准号:
2202585 - 财政年份:2022
- 资助金额:
$ 18.32万 - 项目类别:
Standard Grant
Collaborative Research: RI: Small: From Ultrasound and MRI to articulatory and acoustic models of child speech development
合作研究:RI:小型:从超声和 MRI 到儿童言语发展的发音和声学模型
- 批准号:
2006979 - 财政年份:2020
- 资助金额:
$ 18.32万 - 项目类别:
Standard Grant
Workshop for Undergraduate and MS Female Students in Speech Science and Technology
语音科学与技术本科生和女硕士讲习班
- 批准号:
1745166 - 财政年份:2017
- 资助金额:
$ 18.32万 - 项目类别:
Standard Grant
NRI: INT: COLLAB: Development, Deployment and Evaluation of Personalized Learning Companion Robots for Early Literacy and Language Learning
NRI:INT:COLLAB:用于早期识字和语言学习的个性化学习伴侣机器人的开发、部署和评估
- 批准号:
1734380 - 财政年份:2017
- 资助金额:
$ 18.32万 - 项目类别:
Standard Grant
RI: Medium: Collaborative Research: Variance and Invariance in Voice Quality: Implications for Machine and Human Speaker Identification
RI:媒介:协作研究:语音质量的方差和不变性:对机器和人类说话人识别的影响
- 批准号:
1704167 - 财政年份:2017
- 资助金额:
$ 18.32万 - 项目类别:
Continuing Grant
A Workshop for Junior Female Researchers in Speech Science and Technology
语音科学与技术青年女性研究员研讨会
- 批准号:
1637240 - 财政年份:2016
- 资助金额:
$ 18.32万 - 项目类别:
Standard Grant
The Role of Speech Science in Developing Robust Speech Technology Applications
语音科学在开发强大的语音技术应用中的作用
- 批准号:
1543522 - 财政年份:2015
- 资助金额:
$ 18.32万 - 项目类别:
Standard Grant
EAGER: Collaborative Research: Models of Child Speech
EAGER:合作研究:儿童言语模型
- 批准号:
1551113 - 财政年份:2015
- 资助金额:
$ 18.32万 - 项目类别:
Standard Grant
EAGER: Variance and Invariance in Voice Quality
EAGER:语音质量的方差和不变性
- 批准号:
1450992 - 财政年份:2014
- 资助金额:
$ 18.32万 - 项目类别:
Standard Grant
EAGER: Collaborative Research: Towards Modeling Human Speech Confusions in Noise
EAGER:协作研究:对噪声中的人类语音混乱进行建模
- 批准号:
1247809 - 财政年份:2012
- 资助金额:
$ 18.32万 - 项目类别:
Standard Grant
相似海外基金
ITR Collaborative Research: Pervasively Secure Infrastructures (PSI): Integrating Smart Sensing, Data Mining, Pervasive Networking, and Community Computing
ITR 协作研究:普遍安全基础设施 (PSI):集成智能传感、数据挖掘、普遍网络和社区计算
- 批准号:
1404694 - 财政年份:2013
- 资助金额:
$ 18.32万 - 项目类别:
Continuing Grant
ITR-SCOTUS: A Resource for Collaborative Research in Speech Technology, Linguistics, Decision Processes, and the Law
ITR-SCOTUS:语音技术、语言学、决策过程和法律合作研究的资源
- 批准号:
1139735 - 财政年份:2011
- 资助金额:
$ 18.32万 - 项目类别:
Continuing Grant
ITR/NGS: Collaborative Research: DDDAS: Data Dynamic Simulation for Disaster Management
ITR/NGS:合作研究:DDDAS:灾害管理数据动态模拟
- 批准号:
0963973 - 财政年份:2009
- 资助金额:
$ 18.32万 - 项目类别:
Continuing Grant
ITR/NGS: Collaborative Research: DDDAS: Data Dynamic Simulation for Disaster Management
ITR/NGS:合作研究:DDDAS:灾害管理数据动态模拟
- 批准号:
1018072 - 财政年份:2009
- 资助金额:
$ 18.32万 - 项目类别:
Continuing Grant
ITR Collaborative Research: A Reusable, Extensible, Optimizing Back End
ITR 协作研究:可重用、可扩展、优化的后端
- 批准号:
0838899 - 财政年份:2008
- 资助金额:
$ 18.32万 - 项目类别:
Continuing Grant
ITR Collaborative Research: Pervasively Secure Infrastructures (PSI): Integrating Smart Sensing, Data Mining, Pervasive Networking, and Community Computing
ITR 协作研究:普遍安全基础设施 (PSI):集成智能传感、数据挖掘、普遍网络和社区计算
- 批准号:
0833849 - 财政年份:2008
- 资助金额:
$ 18.32万 - 项目类别:
Continuing Grant
ITR/NGS: Collaborative Research: DDDAS: Data Dynamic Simulation for Disaster Management
ITR/NGS:合作研究:DDDAS:灾害管理数据动态模拟
- 批准号:
0808419 - 财政年份:2007
- 资助金额:
$ 18.32万 - 项目类别:
Continuing Grant
ITR: Collaborative Research - ASE - (sim+dmc): Image-based Biophysical Modeling: Scalable Registration and Inversion Algorithms and Distributed Computing
ITR:协作研究 - ASE - (sim dmc):基于图像的生物物理建模:可扩展配准和反演算法以及分布式计算
- 批准号:
0849301 - 财政年份:2007
- 资助金额:
$ 18.32万 - 项目类别:
Continuing Grant
ITR: Collaborative Research: Modeling and Display of Haptic Information for Enhanced Performance of Computer-Integrated Surgery
ITR:协作研究:触觉信息建模和显示,以提高计算机集成手术的性能
- 批准号:
0711040 - 财政年份:2007
- 资助金额:
$ 18.32万 - 项目类别:
Standard Grant
Collaborative Research: ITR-(ASE)-(dmc): Overcoming Fractionation Errors in Cancer Treatement Planning
合作研究:ITR-(ASE)-(dmc):克服癌症治疗计划中的分割错误
- 批准号:
0749671 - 财政年份:2006
- 资助金额:
$ 18.32万 - 项目类别:
Standard Grant