Computational approaches to human spoken word recognition
人类口语单词识别的计算方法
基本信息
- 批准号:1754284
- 负责人:
- 金额:$ 60.23万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2018
- 资助国家:美国
- 起止时间:2018-03-15 至 2023-02-28
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
This project addresses one of the grand challenges facing cognitive science -- how humans understand speech. People recognize words far more easily than even the best computer speech recognition systems, even though the actual sounds we hear as consonants and vowels vary greatly depending on context (what sounds come before or after), who is talking, and the setting (a quiet room versus a crowded airport). Most current models of speech recognition cannot handle the huge variability in real speech because they do not operate on the actual speech signal. Also, they do not learn, so they cannot model how people acquire language. This project addresses these challenges by comparing current models of speech recognition to each other and to human capabilities, with the goal of understanding how human speech processing is so robust and flexible. In addition, simplified "deep learning" networks will be developed and evaluated as models of human speech recognition. Deep learning networks are similar to cognitive models in that they learn abstract representations of the data, not task-specific rules or algorithms. These networks have been used to create accurate commercial speech recognition systems. By comparing them to human performance, the investigators may provide new insights into why human speech recognition is so robust. The results of this project will have technical implications (better understanding of human flexibility may aid in improving computer speech recognition) and health implications (better understanding of human speech recognition will aid in developing better interventions for language disorders). The project will also support the training of a postdoctoral researcher and a PhD student, both of whom will develop skills that can be used to contribute to research and development in academia or industry. This project focuses on the development of a "shallow deep network" model called "DeepListener" that will be compared with the behavior of human listeners. A close match in the millisecond-level behavior of the network (for example, in which words are temporarily confusable with each other) and human performance suggests that human speech processing may emerge from similar principles as those in the model. In preliminary work, DeepListener learned to recognize 93% of 2000 real words (200 words produced by 10 talkers). DeepListener will be evaluated by detailed comparison to standard neural network models of cognitive theories and to human performance. The ways in which DeepListener is similar and dissimilar to human performance and competing models will help to advance scientific theories of human speech recognition. This project will follow emerging standards for open science: experiments will be pre-registered and data and computer code will be made freely and publicly available.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
该项目解决了认知科学面临的巨大挑战之一 - 人类如何理解言语。即使我们听到的辅音和元音的实际声音差异很大,但根据上下文(听起来之前或之后),谁在说话和设置(一个安静的房间与拥挤的机场),人们也比最佳的计算机语音识别系统更容易识别单词。大多数当前语音识别模型无法处理真实语音的巨大变化,因为它们不在实际语音信号上。另外,他们不学习,因此他们无法建模人们如何获取语言。该项目通过将当前的语音识别模型彼此和人类能力进行比较来解决这些挑战,目的是了解人类言语处理如何如此强大和灵活。 此外,将开发和评估简化的“深度学习”网络作为人类语音识别的模型。深度学习网络类似于认知模型,因为他们学习了数据的抽象表示,而不是特定于任务的规则或算法。这些网络已用于创建准确的商业语音识别系统。通过将它们与人类绩效进行比较,研究人员可以提供新的见解,以了解为什么人类言语识别如此强大。该项目的结果将具有技术意义(对人类灵活性的更好理解可以有助于改善计算机语音识别)和健康影响(对人类语音识别的更好理解将有助于为语言障碍提供更好的干预措施)。该项目还将支持对博士后研究员和博士生的培训,他们都将开发可用于学术界或行业研究和发展的技能。该项目着重于称为“ DeepListener”的“浅层深网”模型的开发,该模型将与人类听众的行为进行比较。网络的毫秒级别行为的密切匹配(例如,单词彼此暂时混淆),人类绩效表明,人类的语音处理可能从与模型中的类似原则中出现。在初步工作中,Deeplistener学会了认识到2000个真实词中的93%(由10位讲话者产生的200个单词)。 DeepListener将通过与认知理论的标准神经网络模型和人类绩效进行详细比较来评估。 DeepListener与人类绩效和竞争模型不同的方式将有助于推进人类言语识别的科学理论。该项目将遵循开放科学的新兴标准:实验将进行预注册,并将数据和计算机代码自由和公开提供。该奖项反映了NSF的法定任务,并被认为是值得通过基金会的智力优点和更广泛影响的审查标准通过评估来支持的。
项目成果
期刊论文数量(18)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Does predictive processing imply predictive coding in models of spoken word recognition?
预测处理是否意味着口语单词识别模型中的预测编码?
- DOI:
- 发表时间:2020
- 期刊:
- 影响因子:0
- 作者:Magnuson, J. S.;Li, M.;Luthra, S.;You, H.;Steiner, R.
- 通讯作者:Steiner, R.
LexFindR: A fast, simple, and extensible R package for finding similar words in a lexicon
LexFindR:一个快速、简单且可扩展的 R 包,用于在词典中查找相似单词
- DOI:10.3758/s13428-021-01667-6
- 发表时间:2021
- 期刊:
- 影响因子:5.4
- 作者:Li, ZhaoBin;Crinnion, Anne Marie;Magnuson, James S.
- 通讯作者:Magnuson, James S.
EARSHOT: A minimal network model of human speech recognition that operates on real speech
- DOI:
- 发表时间:2019
- 期刊:
- 影响因子:0
- 作者:J. Magnuson;Heejo You;J. Rueckl;Paul D. Allopenna;Monica Li;Sahil Luthra;Rachael Steiner;Hosung Nam;M. Escabí;K. Brown;Rachel M. Theodore;Nicholas Monto
- 通讯作者:J. Magnuson;Heejo You;J. Rueckl;Paul D. Allopenna;Monica Li;Sahil Luthra;Rachael Steiner;Hosung Nam;M. Escabí;K. Brown;Rachel M. Theodore;Nicholas Monto
Word length, proportion of overlap, and phonological competition in spoken word recognition
- DOI:
- 发表时间:2018
- 期刊:
- 影响因子:2.5
- 作者:Elizabeth Simmons;J. Magnuson
- 通讯作者:Elizabeth Simmons;J. Magnuson
Boosting lexical support does not enhance lexically guided perceptual learning.
- DOI:10.1037/xlm0000945
- 发表时间:2021-04
- 期刊:
- 影响因子:0
- 作者:Luthra S;Magnuson JS;Myers EB
- 通讯作者:Myers EB
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
James Magnuson其他文献
James Magnuson的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('James Magnuson', 18)}}的其他基金
CRCNS US-Spain Research Proposal: Collaborative Research: Tracking and modeling the neurobiology of multilingual speech recognition
CRCNS 美国-西班牙研究提案:合作研究:跟踪和建模多语言语音识别的神经生物学
- 批准号:
2207770 - 财政年份:2022
- 资助金额:
$ 60.23万 - 项目类别:
Continuing Grant
Collaborative Research: CompCog: Psychological, Computational, and Neural Adequacy in a Deep Learning Model of Human Speech Recognition
合作研究:CompCog:人类语音识别深度学习模型中的心理、计算和神经充分性
- 批准号:
2043903 - 财政年份:2021
- 资助金额:
$ 60.23万 - 项目类别:
Standard Grant
NRT-UtB: Science of learning, from neurobiology to real-world application: a problem-based approach
NRT-UtB:学习科学,从神经生物学到现实世界应用:基于问题的方法
- 批准号:
1735225 - 财政年份:2017
- 资助金额:
$ 60.23万 - 项目类别:
Standard Grant
Real-world language: Future directions in the science of communication and the communication of science
现实世界语言:传播科学和科学传播的未来方向
- 批准号:
1747486 - 财政年份:2017
- 资助金额:
$ 60.23万 - 项目类别:
Standard Grant
IGERT: Language plasticity - Genes, Brain, Cognition and Computation
IGERT:语言可塑性 - 基因、大脑、认知和计算
- 批准号:
1144399 - 财政年份:2012
- 资助金额:
$ 60.23万 - 项目类别:
Continuing Grant
CAREER: The Time Course of Bottom-up and Top-down Integration in Language Understanding
职业:语言理解中自下而上和自上而下整合的时间进程
- 批准号:
0748684 - 财政年份:2008
- 资助金额:
$ 60.23万 - 项目类别:
Continuing Grant
Compensation for Coarticulation: Implications for the Basis and Architecture of Speech Perception
协同发音的补偿:对语音感知的基础和架构的影响
- 批准号:
0642300 - 财政年份:2007
- 资助金额:
$ 60.23万 - 项目类别:
Standard Grant
Special Foreign Currency Travel Support (In Indian Currency)To Participate in the Int'l Symposium on Lectins As Tools InBiology and Medicine; Calcutta, India; January 1981
特别外币旅行支持(印度货币)参加凝集素作为生物学和医学工具的国际研讨会;
- 批准号:
8022021 - 财政年份:1981
- 资助金额:
$ 60.23万 - 项目类别:
Standard Grant
相似国自然基金
发展计算方法预测人类病毒受体
- 批准号:
- 批准年份:2021
- 资助金额:58 万元
- 项目类别:面上项目
发展计算方法预测人类病毒受体
- 批准号:32170651
- 批准年份:2021
- 资助金额:58.00 万元
- 项目类别:面上项目
基于单样本网络识别人类复杂疾病相关LncRNA的数学模型和方法研究
- 批准号:11701379
- 批准年份:2017
- 资助金额:21.0 万元
- 项目类别:青年科学基金项目
面向人类健康的体外诊察信息感知与计算方法研究
- 批准号:61332011
- 批准年份:2013
- 资助金额:300.0 万元
- 项目类别:重点项目
果蝇及人类种系的细胞溯祖理论,计算机模拟和统计分析方法
- 批准号:91231120
- 批准年份:2012
- 资助金额:100.0 万元
- 项目类别:重大研究计划
相似海外基金
Integration of Immunologic Phenotyping with Computational Approaches to Predict Clinical Trajectory in Septic Patients
免疫表型分析与计算方法相结合来预测脓毒症患者的临床轨迹
- 批准号:
10708534 - 财政年份:2023
- 资助金额:
$ 60.23万 - 项目类别:
Using Single Cell Biological Approaches to Understand CNS TB
使用单细胞生物学方法了解中枢神经系统结核
- 批准号:
10739081 - 财政年份:2023
- 资助金额:
$ 60.23万 - 项目类别:
Bottom-up and top-down computational modeling approaches to study CMV retinitis
研究 CMV 视网膜炎的自下而上和自上而下的计算模型方法
- 批准号:
10748709 - 财政年份:2023
- 资助金额:
$ 60.23万 - 项目类别:
Integrative computational-experimental approaches to stratify monogenic disease risk
综合计算实验方法对单基因疾病风险进行分层
- 批准号:
10889297 - 财政年份:2023
- 资助金额:
$ 60.23万 - 项目类别:
Integrative approaches defining the ontogeny, maintenance, and immune response dynamics of marginal-zone B cells
定义边缘区 B 细胞个体发育、维持和免疫反应动力学的综合方法
- 批准号:
10660534 - 财政年份:2023
- 资助金额:
$ 60.23万 - 项目类别: