Speaker and session variability in speech processing
语音处理中的说话者和会话变异性
基本信息
- 批准号:105523-2007
- 负责人:
- 金额:$ 1.24万
- 依托单位:
- 依托单位国家:加拿大
- 项目类别:Discovery Grants Program - Individual
- 财政年份:2007
- 资助国家:加拿大
- 起止时间:2007-01-01 至 2008-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Two of the principal problems in speech technologyare speaker recognition (who is speaking?) and speechrecognition (what is being said?). These problemshave resisted solution because of the numerous typesof variability in speech. For example, the performance ofspeaker recognition systems degrades as a resultof channel and microphone effects and careful modelingis needed to distinguish these nuisance effects from aspectsof the speech signal which are useful in discriminatingbetween speakers. In speech recognition on the other hand,variation betweenspeakers is considered to be a nuisance and the only type of variabilitywhich is of real interest is phonetic variability.This proposal is concerned with developing newstatistical methods of modeling speaker and channel variation using`hidden variables' which I call speaker and channel factors.The techniques that I am developing are purely statistical soit is hard to say exactly what the hidden variables represent butit is reasonable to surmise that they capture features such asa speaker's age and sex, electrical noise in transmission channels andordinary background acoustic noise.
语音技术中的两个主要问题是说话人识别(谁在说话?)和语音识别(正在说什么?)。这些问题一直难以解决,因为语音中有许多类型的可变性。例如,由于通道和麦克风效应,说话人识别系统的性能下降,需要仔细建模以区分这些干扰效应和语音信号中对区分说话人有用的方面。另一方面,在语音识别中,说话人之间的差异被认为是一种滋扰,唯一真正感兴趣的类型的差异是语音差异。这项建议涉及开发新的统计方法,使用我称为说话人和通道因素的隐藏变量来建模说话人和通道变化。我正在开发的技术是纯统计的,所以很难确切地说隐藏变量代表了什么,但可以合理地推测它们反映了说话人的年龄和性别、传输通道中的电噪声和普通的背景声学噪声等特征。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Kenny, Patrick其他文献
Real-time expression of affect through respiration
- DOI:
10.1002/cav.349 - 发表时间:
2010-05-01 - 期刊:
- 影响因子:1.1
- 作者:
de Melo, Celso M.;Kenny, Patrick;Gratch, Jonathan - 通讯作者:
Gratch, Jonathan
Modeling prosodic features with joint factor analysis for speaker verification
- DOI:
10.1109/tasl.2007.902758 - 发表时间:
2007-09-01 - 期刊:
- 影响因子:0
- 作者:
Dehak, Najim;Dumouchel, Pierre;Kenny, Patrick - 通讯作者:
Kenny, Patrick
A study of interspeaker variability in speaker verification
- DOI:
10.1109/tasl.2008.925147 - 发表时间:
2008-07-01 - 期刊:
- 影响因子:0
- 作者:
Kenny, Patrick;Ouellet, Pierre;Dumouchel, Pierre - 通讯作者:
Dumouchel, Pierre
Joint factor analysis versus eigenchannels in speaker recognition
- DOI:
10.1109/tasl.2006.881693 - 发表时间:
2007-05-01 - 期刊:
- 影响因子:0
- 作者:
Kenny, Patrick;Boulianne, Gilles;Dumouchel, Pierre - 通讯作者:
Dumouchel, Pierre
Effect of body mass index on functional outcome in primary total knee arthroplasty - a single institution analysis of 2180 primary total knee replacements
- DOI:
10.5312/wjo.v7.i10.664 - 发表时间:
2016-10-18 - 期刊:
- 影响因子:1.9
- 作者:
O'Neill, Shane C.;Butler, Joseph S.;Kenny, Patrick - 通讯作者:
Kenny, Patrick
Kenny, Patrick的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Kenny, Patrick', 18)}}的其他基金
Representations of Speech Dynamics as Features for Speaker Recognition
语音动力学的表示作为说话人识别的特征
- 批准号:
105523-2012 - 财政年份:2015
- 资助金额:
$ 1.24万 - 项目类别:
Discovery Grants Program - Individual
JFA for text-dependent speaker verification
JFA 用于文本相关的说话人验证
- 批准号:
462115-2013 - 财政年份:2015
- 资助金额:
$ 1.24万 - 项目类别:
Collaborative Research and Development Grants
Representations of Speech Dynamics as Features for Speaker Recognition
语音动力学的表示作为说话人识别的特征
- 批准号:
105523-2012 - 财政年份:2014
- 资助金额:
$ 1.24万 - 项目类别:
Discovery Grants Program - Individual
JFA for text-dependent speaker verification
JFA 用于文本相关的说话人验证
- 批准号:
462115-2013 - 财政年份:2014
- 资助金额:
$ 1.24万 - 项目类别:
Collaborative Research and Development Grants
Representations of Speech Dynamics as Features for Speaker Recognition
语音动力学的表示作为说话人识别的特征
- 批准号:
105523-2012 - 财政年份:2013
- 资助金额:
$ 1.24万 - 项目类别:
Discovery Grants Program - Individual
Representations of Speech Dynamics as Features for Speaker Recognition
语音动力学的表示作为说话人识别的特征
- 批准号:
105523-2012 - 财政年份:2012
- 资助金额:
$ 1.24万 - 项目类别:
Discovery Grants Program - Individual
Speaker and session variability in speech processing
语音处理中的说话者和会话可变性
- 批准号:
105523-2007 - 财政年份:2011
- 资助金额:
$ 1.24万 - 项目类别:
Discovery Grants Program - Individual
Découpage en nouvelles de bulletins télévisés
电视公告中的剪纸
- 批准号:
417255-2011 - 财政年份:2011
- 资助金额:
$ 1.24万 - 项目类别:
Engage Grants Program
Speaker and session variability in speech processing
语音处理中的说话者和会话可变性
- 批准号:
105523-2007 - 财政年份:2010
- 资助金额:
$ 1.24万 - 项目类别:
Discovery Grants Program - Individual
Speaker and session variability in speech processing
语音处理中的说话者和会话可变性
- 批准号:
105523-2007 - 财政年份:2009
- 资助金额:
$ 1.24万 - 项目类别:
Discovery Grants Program - Individual
相似国自然基金
下一代全IP无线网络移动性管理策略研究
- 批准号:60902023
- 批准年份:2009
- 资助金额:16.0 万元
- 项目类别:青年科学基金项目
相似海外基金
Conference: Reinforcing the future of BPE research through a poster session for BPE NSF Awardees
会议:通过 BPE NSF 获奖者海报会议加强 BPE 研究的未来
- 批准号:
2338936 - 财政年份:2023
- 资助金额:
$ 1.24万 - 项目类别:
Standard Grant
Development of a Dedicated Fluidjet Technology for Single-session Debridement of Necrotizing Pancreatitis
开发用于坏死性胰腺炎单次清创的专用流体喷射技术
- 批准号:
10699626 - 财政年份:2023
- 资助金额:
$ 1.24万 - 项目类别:
How might an unmet patient need in metal health concerning memory be satisfied using transcribed session summaries
如何使用转录的会议摘要来满足患者在金属健康方面与记忆相关的未满足需求
- 批准号:
10045783 - 财政年份:2023
- 资助金额:
$ 1.24万 - 项目类别:
Collaborative R&D
Plans4Care: Personalized Dementia Care On-Demand
Plans4Care:按需个性化痴呆症护理
- 批准号:
10758864 - 财政年份:2023
- 资助金额:
$ 1.24万 - 项目类别:
Novel, On-demand VR for Accessible, Practical, and Engaging therapy (NO VAPE)
新颖的按需 VR,可实现无障碍、实用且引人入胜的治疗(无 VAPE)
- 批准号:
10740956 - 财政年份:2023
- 资助金额:
$ 1.24万 - 项目类别:
Conference: Special Session on Language Technology at the 2024 Algonquian Conference
会议:2024 年阿尔冈昆会议语言技术特别会议
- 批准号:
2333919 - 财政年份:2023
- 资助金额:
$ 1.24万 - 项目类别:
Standard Grant
Development and Initial Evaluation of a Personalized Feedback Intervention based on Ecological Momentary Assessment Data in People with Eating Disorders
基于生态瞬时评估数据的饮食失调患者个性化反馈干预的开发和初步评估
- 批准号:
489142 - 财政年份:2023
- 资助金额:
$ 1.24万 - 项目类别:
Operating Grants
Addressing Rural Health Disparities by Optimizing "High Touch" Intervention Components in Digital Obesity Treatment
通过优化数字肥胖治疗中的“高接触”干预措施来解决农村健康差异
- 批准号:
10601655 - 财政年份:2023
- 资助金额:
$ 1.24万 - 项目类别:
Wellness Achieved Through Changing Habits (WATCH): An Acceptance-Based Healthy Lifestyle Intervention for Diverse Adolescents
通过改变习惯实现健康(WATCH):针对不同青少年的基于接受的健康生活方式干预
- 批准号:
10738846 - 财政年份:2023
- 资助金额:
$ 1.24万 - 项目类别:
Using Re-inforcement Learning to Automatically Adapt a Remote Therapy Intervention (RTI) for Reducing Adolescent Violence Involvement
使用强化学习自动调整远程治疗干预 (RTI),以减少青少年暴力参与
- 批准号:
10834339 - 财政年份:2023
- 资助金额:
$ 1.24万 - 项目类别: