Next-Generation Expressive Personalized Voices for Speech-Generating Devices
用于语音生成设备的下一代富有表现力的个性化声音
基本信息
- 批准号:10547241
- 负责人:
- 金额:$ 27.58万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2022
- 资助国家:美国
- 起止时间:2022-08-15 至 2024-08-14
- 项目状态:已结题
- 来源:
- 关键词:ALS patientsAdoptionAdultAgeAlgorithmsAmyotrophic Lateral SclerosisAugmentative and Alternative CommunicationCharacteristicsChildChild HealthClientDepressed moodDiseaseDysarthriaEmotionsEncapsulatedEvaluationFemaleGenerationsGoalsGovernmentHumanHybridsIndividualKnowledgeLaboratory ResearchLearningLinguisticsMachine LearningMethodsModelingNetwork-basedNeurodegenerative DisordersOnset of illnessOutcomeOutputPersonsPhaseProcessProductionReadingRecordsRehabilitation therapyRiskRunningServicesSpeechStructureSurveysSystemTechnologyTextTrainingVoiceVoice Qualitybasecommercial applicationcommunication devicedeep neural networkdesignexperienceexperimental studyimprovedknowledge basemachine learning algorithmmalemimeticsnext generationnovelsoundsuccessvirtual vocal tract
项目摘要
Project Summary/Abstract
The creation of personalized synthetic voices has wide application in medical/rehabilitation settings for pa-
tients who rely on a speech-generating device (SGD) for communication. One common application is voice
banking, wherein a person who risks losing their voice, such as somebody with a neurodegenerative disease
like Amyotrophic Lateral Sclerosis (ALS), records their own speech before the onset of disease-related dysar-
thria for later use in an SGD that mimics their natural speech characteristics. While the technology underlying
the creation of such personalized synthetic voices is growing in maturity and adoption by SGD users, it still suf-
fers from two primary limitations: a lack of expressiveness and a burdensome amount of recording needed to
create highly natural-sounding voices. The proposed project aims to remedy this situation by marrying the ma-
chine-learning technology behind ModelTalker, a pioneering voice-banking text-to-speech service developed at
Nemours Children’s Health, with the knowledge-based technology underlying Synfony, a rule-based text-to-
speech system developed by Synfonica LLC, which is capable of generating a variety of speech styles and ex-
pressive modes. The expert knowledge built into Synfonica will be used to design an optimal set of sentences
for voice bankers to record, and its algorithms for the generation of natural-sounding prosody in different
modes and styles will be integrated into ModelTalker’s machine-learning algorithms, creating a hybrid system
that embraces the best qualities of both approaches. The new text-to-speech (TTS) system resulting from this
project will (a) require a minimal amount of recorded speech from the voice banker, (b) accurately capture
their vocal identity, and (c) be structured such that new expressive modes and speech styles can be added easily
without additional recording. The feasibility of the project will be demonstrated by recording the voices of an
adult male, an adult female, and a child, and generating TTS voices that can speak in three expressive modes
(neutral, happy, and sad). Perceptual experiments will be run to evaluate their intelligibility, naturalness, suc-
cess in capturing the vocal identity of the speaker, and the appropriateness of their expressive modes. In gen-
eral, the project will be a major step forward in enabling the users of personalized synthetic voices to express
their emotions and intentions.
项目总结/摘要
个性化合成语音的创建在医疗/康复环境中具有广泛的应用,
依赖语音生成设备(SGD)进行交流的青少年。一个常见的应用是语音
银行业,其中一个人谁的风险失去了他们的声音,如有人与神经退行性疾病
像肌萎缩性侧索硬化症(ALS),在疾病相关的dysar发作之前记录他们自己的语言,
thria,以便以后在模拟其自然语音特征的SGD中使用。虽然背后的技术
这种个性化合成语音的创建正在成熟并被SGD用户采用,但它仍然足够,
这是由于两个主要的限制:缺乏表现力和需要大量的记录,
创造出非常自然的声音。拟议项目旨在通过与马-
ModelTalker是一种开创性的语音银行文本到语音服务,
Nemours儿童健康,与基于知识的技术基础的Synfony,一个基于规则的文本到
语音系统开发的Synfonica有限责任公司,这是能够产生各种语音风格和前,
压力模式Synfonica内置的专家知识将用于设计一组最佳句子
的语音银行家记录,其算法的自然发声韵律的产生,在不同的
模式和风格将被集成到ModelTalker的机器学习算法中,创建一个混合系统
它包含了两种方法的最佳品质。由此产生的新的文本到语音(TTS)系统
项目将(a)需要从语音银行家记录的语音最小量,(B)准确捕捉
他们的声音身份,(c)结构,使新的表达方式和讲话风格可以很容易地添加
没有额外的记录。该项目的可行性将通过记录一个
成年男性、成年女性和儿童,并生成可以以三种表达模式说话的TTS语音
(中性、快乐和悲伤)。将进行知觉实验,以评估其可理解性,自然性,可理解性,
cess在捕捉说话者的声音身份,以及他们的表达模式的适当性。在gen-
总的来说,该项目将是一个重大的一步,使用户的个性化合成语音表达
他们的情绪和意图
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
H TIMOTHY Bunnell其他文献
H TIMOTHY Bunnell的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('H TIMOTHY Bunnell', 18)}}的其他基金
Personalized speech output for communication devices
通信设备的个性化语音输出
- 批准号:
7219783 - 财政年份:2003
- 资助金额:
$ 27.58万 - 项目类别:
Personalizing Speech Output for Communication Devices
个性化通信设备的语音输出
- 批准号:
6749031 - 财政年份:2003
- 资助金额:
$ 27.58万 - 项目类别:
Personalizing Speech Output for Communication Devices
个性化通信设备的语音输出
- 批准号:
6646704 - 财政年份:2003
- 资助金额:
$ 27.58万 - 项目类别:
Personalized speech output for communication devices
通信设备的个性化语音输出
- 批准号:
7337320 - 财政年份:2003
- 资助金额:
$ 27.58万 - 项目类别:
相似海外基金
Investigating the Adoption, Actual Usage, and Outcomes of Enterprise Collaboration Systems in Remote Work Settings.
调查远程工作环境中企业协作系统的采用、实际使用和结果。
- 批准号:
24K16436 - 财政年份:2024
- 资助金额:
$ 27.58万 - 项目类别:
Grant-in-Aid for Early-Career Scientists
WELL-CALF: optimising accuracy for commercial adoption
WELL-CALF:优化商业采用的准确性
- 批准号:
10093543 - 财政年份:2024
- 资助金额:
$ 27.58万 - 项目类别:
Collaborative R&D
Unraveling the Dynamics of International Accounting: Exploring the Impact of IFRS Adoption on Firms' Financial Reporting and Business Strategies
揭示国际会计的动态:探索采用 IFRS 对公司财务报告和业务战略的影响
- 批准号:
24K16488 - 财政年份:2024
- 资助金额:
$ 27.58万 - 项目类别:
Grant-in-Aid for Early-Career Scientists
ERAMET - Ecosystem for rapid adoption of modelling and simulation METhods to address regulatory needs in the development of orphan and paediatric medicines
ERAMET - 快速采用建模和模拟方法的生态系统,以满足孤儿药和儿科药物开发中的监管需求
- 批准号:
10107647 - 财政年份:2024
- 资助金额:
$ 27.58万 - 项目类别:
EU-Funded
Assessing the Coordination of Electric Vehicle Adoption on Urban Energy Transition: A Geospatial Machine Learning Framework
评估电动汽车采用对城市能源转型的协调:地理空间机器学习框架
- 批准号:
24K20973 - 财政年份:2024
- 资助金额:
$ 27.58万 - 项目类别:
Grant-in-Aid for Early-Career Scientists
Ecosystem for rapid adoption of modelling and simulation METhods to address regulatory needs in the development of orphan and paediatric medicines
快速采用建模和模拟方法的生态系统,以满足孤儿药和儿科药物开发中的监管需求
- 批准号:
10106221 - 财政年份:2024
- 资助金额:
$ 27.58万 - 项目类别:
EU-Funded
De-Adoption Beta-Blockers in patients with stable ischemic heart disease without REduced LV ejection fraction, ongoing Ischemia, or Arrhythmias: a randomized Trial with blinded Endpoints (ABbreviate)
在没有左心室射血分数降低、持续性缺血或心律失常的稳定型缺血性心脏病患者中停用β受体阻滞剂:一项盲法终点随机试验(ABbreviate)
- 批准号:
481560 - 财政年份:2023
- 资助金额:
$ 27.58万 - 项目类别:
Operating Grants
Our focus for this project is accelerating the development and adoption of resource efficient solutions like fashion rental through technological advancement, addressing longer in use and reuse
我们该项目的重点是通过技术进步加快时装租赁等资源高效解决方案的开发和采用,解决更长的使用和重复使用问题
- 批准号:
10075502 - 财政年份:2023
- 资助金额:
$ 27.58万 - 项目类别:
Grant for R&D
Engage2innovate – Enhancing security solution design, adoption and impact through effective engagement and social innovation (E2i)
Engage2innovate — 通过有效参与和社会创新增强安全解决方案的设计、采用和影响 (E2i)
- 批准号:
10089082 - 财政年份:2023
- 资助金额:
$ 27.58万 - 项目类别:
EU-Funded
Collaborative Research: SCIPE: CyberInfrastructure Professionals InnoVating and brOadening the adoption of advanced Technologies (CI PIVOT)
合作研究:SCIPE:网络基础设施专业人员创新和扩大先进技术的采用 (CI PIVOT)
- 批准号:
2321091 - 财政年份:2023
- 资助金额:
$ 27.58万 - 项目类别:
Standard Grant