Voice Transformation for Dysarthria - Phase I
构音障碍的语音转换 - 第一阶段
基本信息
- 批准号:7162050
- 负责人:
- 金额:$ 13.56万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2006
- 资助国家:美国
- 起止时间:2006-09-01 至 2008-03-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
DESCRIPTION (provided by applicant): Software will be developed that transforms speech compromised by dysarthria into easier-to-understand and more natural-sounding speech. The software will reside on laptop computers, with microphone input and amplified speaker or line output. Such software and hardware solutions will assist individuals with dysarthria to better communicate by voice, whether face-to-face or by telephone; it will also help these individuals when interacting with voice controlled services and devices, which are increasingly more popular. The system operates in "Interpreter Mode", meaning that output will take place after a brief processing delay once the speaker has completed an utterance. The software is based on a multi-step formant re-synthesis process: (i) Robust extraction of formant, energy, spectral balance, and pitch trajectories from input speech; (ii) Modification of extracted trajectories by imposition of smoothness and shape based constraints, and by bringing these trajectories in closer proximity to trajectories of normal speech; (iii) Conversion of the trajectories into a speech signal by formant synthesis. Results obtained with a prototype, personal computer based system show that this process is robust, enhances intelligibility, and completely eliminates "vocal fry", i.e., distortions caused by irregularities in the temporal pattern of the vocal folds. In Phase I, the core algorithms performing these steps will be improved and extended, and the software will be ported to a pocketable computer; the resulting system will be evaluated on multiple speakers and listeners; and feedback will be obtained from potential users and their partners about desired features, usability, and functionality. In Phase II, acceptable processing delays will be achieved using known methods for optimizing memory and processing speed; further enhancement capabilities will be added, and the system will be evaluated. The currently targeted product will be the first in a family of speech enhancement products with continually expanding functionality, by capitalizing on ongoing algorithmic and hardware improvements. Usage of standard hardware and software platforms, that in turn are compatible with a wide range of headsets and wearable amplified speakers or telephones, puts this software in a strong competitive position. A large percentage of the more than 2.5 million adult Americans with significant disability due to chronic neurological impairment in the United States present with dysarthria or speech impairment as 1 of their disabling conditions. There are no cures for speech impairments. Dysarthric individuals report losses to employment, educational opportunities, social integration, and quality of life. Individuals are taught strategies that compensate for their impairments, but the isolation caused by communication impairment is pervasive. The project goal is to develop a system that uses a wearable computer to transform speech compromised by dysarthria into easier-to-understand and more natural-sounding speech, and will thereby enable dysarthric individuals to communicate more effectively by telephone or in face-to-face contexts.
描述(由申请人提供):将开发软件,将因构音障碍而受损的语音转换为更容易理解和更自然的语音。该软件将安装在笔记本电脑上,带有麦克风输入和扩音器或线路输出。这样的软件和硬件解决方案将帮助构音障碍患者更好地通过语音交流,无论是面对面还是通过电话;它还将帮助这些人与越来越受欢迎的语音控制服务和设备进行交互。该系统以“翻译模式”运行,这意味着一旦说话者完成一句话,将在短暂的处理延迟后进行输出。该软件基于一个多步骤的形成峰重新合成过程:(i)从输入语音中稳健地提取形成峰、能量、频谱平衡和音高轨迹;(ii)通过施加平滑和基于形状的约束来修改提取的轨迹,并使这些轨迹更接近正常言语的轨迹;(iii)通过共振峰合成将轨迹转换成语音信号。通过基于个人计算机的原型系统获得的结果表明,该过程是鲁棒的,提高了可理解性,并完全消除了“声带杂音”,即由声带时间模式的不规则性引起的扭曲。在第一阶段,执行这些步骤的核心算法将得到改进和扩展,软件将被移植到便携式计算机上;最终的系统将在多个演讲者和听众中进行评估;并且将从潜在用户及其合作伙伴那里获得关于期望的特性、可用性和功能的反馈。在第二阶段,可接受的处理延迟将使用已知的方法来优化存储器和处理速度;进一步的增强能力将被增加,系统将被评估。目前的目标产品将是语音增强产品系列中的第一款,通过利用正在进行的算法和硬件改进,不断扩展功能。标准硬件和软件平台的使用,反过来又与各种耳机和可穿戴扩音器或电话兼容,使该软件处于强大的竞争地位。在美国,超过250万成年美国人因慢性神经损伤而严重残疾,其中很大一部分人表现为构音障碍或语言障碍,这是他们的残疾状况之一。语言障碍是无法治愈的。生活障碍患者报告说,他们失去了就业、教育机会、社会融合和生活质量。个人被教导弥补他们的缺陷的策略,但沟通障碍造成的孤立是普遍存在的。该项目的目标是开发一种系统,该系统使用可穿戴计算机将构音障碍患者的语音转化为更容易理解、听起来更自然的语音,从而使构音障碍患者通过电话或面对面的环境更有效地交流。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Jan van Santen其他文献
Jan van Santen的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Jan van Santen', 18)}}的其他基金
相似海外基金
Wearable Sensor and Digital Technologies for Quantitative Assessment and Remote Monitoring of Symptoms in Myasthenia Gravis
用于定量评估和远程监测重症肌无力症状的可穿戴传感器和数字技术
- 批准号:
10757163 - 财政年份:2023
- 资助金额:
$ 13.56万 - 项目类别:
Wearable Sensor and Digital Technologies for Quantitative Assessment and Remote Monitoring of Symptoms in Myasthenia Gravis
用于定量评估和远程监测重症肌无力症状的可穿戴传感器和数字技术
- 批准号:
10470564 - 财政年份:2022
- 资助金额:
$ 13.56万 - 项目类别:
Function and Regulation of Sema3 Genes in Palate Development and Innervation
Sema3 基因在上颚发育和神经支配中的功能和调节
- 批准号:
10380003 - 财政年份:2021
- 资助金额:
$ 13.56万 - 项目类别:
Clear speech in ALS: Effects of feedback from a novel ASR practice paradigm and practice dosage
ALS 中的清晰言语:新型 ASR 练习范式和练习剂量的反馈效果
- 批准号:
10161598 - 财政年份:2020
- 资助金额:
$ 13.56万 - 项目类别:
Clear speech in ALS: Effects of feedback from a novel ASR practice paradigm and practice dosage
ALS 中的清晰言语:新型 ASR 练习范式和练习剂量的反馈效果
- 批准号:
10063254 - 财政年份:2020
- 资助金额:
$ 13.56万 - 项目类别:
The development and validation of a novel tool for the assessment of bulbar dysfunction in ALS
评估 ALS 延髓功能障碍的新工具的开发和验证
- 批准号:
10405152 - 财政年份:2018
- 资助金额:
$ 13.56万 - 项目类别:
Communicative participation outcomes measurement in neurologic communication diso
神经沟通障碍中的沟通参与结果测量
- 批准号:
8500974 - 财政年份:2013
- 资助金额:
$ 13.56万 - 项目类别:
Communicative participation outcomes measurement in neurologic communication diso
神经沟通障碍中的沟通参与结果测量
- 批准号:
9016525 - 财政年份:2013
- 资助金额:
$ 13.56万 - 项目类别:
Disorders of Emotional Communication in Patient with Cerebellar Dysfunction
小脑功能障碍患者的情绪沟通障碍
- 批准号:
8741909 - 财政年份:2013
- 资助金额:
$ 13.56万 - 项目类别:
Patient-provider communication for patients with communication disorders
沟通障碍患者的医患沟通
- 批准号:
8685941 - 财政年份:2013
- 资助金额:
$ 13.56万 - 项目类别:














{{item.name}}会员




