Embodied Decision Making: The Influence of Action Errors on Reinforcement Learning
具身决策:行动错误对强化学习的影响
基本信息
- 批准号:9263024
- 负责人:
- 金额:$ 34.03万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2015
- 资助国家:美国
- 起止时间:2015-04-01 至 2020-03-31
- 项目状态:已结题
- 来源:
- 关键词:AddressArchitectureAttenuatedBasal GangliaBehaviorBehavioralBiologicalBrainCerebellar DiseasesCerebellumCerebral cortexChoice BehaviorCoffeeCognitionComplementComplexComputer SimulationDataDecision MakingDegenerative DisorderEmployee StrikesEnvironmentEventFailureFeedbackFunctional ImagingFunctional Magnetic Resonance ImagingFunctional disorderFutureGlassGoalsHumanInferiorIntuitionLawsLearningLesionLiteratureLoveMethodsModelingMotorMovementNeurobiologyNeuropsychologyOutcomeParietal LobeParkinson DiseaseParticipantPatientsPersonsPrefrontal CortexProcessPropertyPsychological reinforcementResearchRewardsRoleSeminalSensorySeriesSignal TransductionSourceStructureSystemTennisTestingUpdateVariantWaterWineWorkarmbasecognitive processexperimental studyfrontal lobeimprovedinterestlearning networkmotor controlmotor disordernervous system disorderneuroimagingneuromechanismnovelpsychologicpublic health relevancerelating to nervous systemresponsesensorimotor systemskills
项目摘要
DESCRIPTION (provided by applicant): This proposal explores the interaction of processes involved in action selection and action execution. This interaction is essential for understanding how people learn to make optimal decisions and develop complex skills, as well as for explicating how disorders of the motor system may impact cognition, a question that has been of central interest in studies of degenerative diseases of the cerebellum and basal ganglia. The interaction can be appreciated by considering that successful decision making requires (at least) two fundamental abilities. First, an agent must be able to evaluate the value of different options in the environment, using that information to choose the option that will maximize reward. Second, the agent must be able to execute a response to indicate the selected option. Traditionally, models of decision making have focused on the former and ignored the latter. However, in many real-world situations, errors in execution are the primary impediment to successful outcomes. A tennis player may correctly opt to use a backhand swing instead of a forehand to return a serve, but fail to execute the action properly. Or in a more mundane example, a person might choose to take a sip from the wine glass rather than the water glass, but fail to reap the expected reward because she knocks over the glass by reaching in a clumsy manner. In this example, the issue is whether a person values wine less (due to the failure to obtain the expected reinforcement) after the clumsy reach, or whether the error is attributed to the execution system, with the outcome precluded from influencing future choice behavior (assuming we are equally competent in reaching for water or wine). The proposed work will examine the psychological processes and neural systems through which action execution and action selection interact. To this end, two specific aims will be addressed. 1) Our pilot work demonstrates a striking difference in choice behavior depending on whether failed outcomes are attributed to a property of the object or a limitation of the execution system. A series of computational models will be developed that have the potential to account for this difference. In a symbiotic manner, behavioral data from a series of experiments will be used to evaluate the models, and the models will be used to generate and test specific predictions. 2) Identify the neural regions involved in the interaction of action errors and selection processes. Of special note here is the idea that cerebellar-based representations of action execution errors might serve a dual-purpose, improving future action execution and providing a gating signal to constrain learning processes that underlie action selection dynamics. The experimental plan for this aim entails the integrated use of functional imaging and neuropsychological studies. At the completion of this project, the studies will help provide an integrated picture of how action selection and action execution processes interact in the human brain to optimize behavior.
描述(由申请人提供):本提案探讨了行动选择和行动执行过程中的相互作用。这种相互作用对于理解人们如何学习做出最佳决策和发展复杂技能以及解释运动系统疾病如何影响认知至关重要,这是小脑和基底神经节退行性疾病研究的核心问题。这种相互作用可以通过考虑成功的决策需要(至少)两种基本能力来理解。首先,智能体必须能够评估环境中不同选项的价值,利用这些信息来选择将最大化回报的选项。其次,代理必须能够执行响应以指示所选选项。传统上,决策模型关注前者而忽视后者。然而,在许多现实世界的情况下,执行中的错误是成功结果的主要障碍。一个网球运动员可以正确地选择使用反手挥杆而不是反手击球来回击发球,但不能正确地执行这个动作。或者在一个更平凡的例子中,一个人可能会选择从酒杯而不是水杯中啜饮一口,但却没有获得预期的奖励,因为她笨拙地伸手打翻了杯子。在这个例子中,问题是一个人在笨拙地伸手之后是否更不重视葡萄酒(由于未能获得预期的强化),或者错误是否归因于执行系统,结果被排除在影响未来选择行为之外(假设我们同样有能力伸手去拿水或葡萄酒)。拟议的工作将检查的心理过程和神经系统,通过行动执行和行动选择相互作用。为此,将探讨两个具体目标。1)我们的试点工作表明,选择行为的显着差异取决于失败的结果是否归因于对象的属性或执行系统的限制。将开发一系列计算模型,这些模型有可能解释这种差异。以共生的方式,来自一系列实验的行为数据将用于评估模型,模型将用于生成和测试特定的预测。2)识别参与动作错误和选择过程相互作用的神经区域。这里特别值得注意的是,基于小脑的动作执行错误表示可能具有双重目的,改善未来的动作执行,并提供门控信号来约束动作选择动态基础的学习过程。为此目的的实验计划需要综合使用功能成像和神经心理学研究。在这个项目完成后,这些研究将有助于提供一个综合的图片,如何行动选择和行动执行过程在人脑中相互作用,以优化行为。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
RICHARD IVRY其他文献
RICHARD IVRY的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('RICHARD IVRY', 18)}}的其他基金
Human Cerebellar Function in Multiple Task Domains
多任务域中的人类小脑功能
- 批准号:
10624778 - 财政年份:2020
- 资助金额:
$ 34.03万 - 项目类别:
Human Cerebellar Function in Multiple Task Domains
多任务域中的人类小脑功能
- 批准号:
10842004 - 财政年份:2020
- 资助金额:
$ 34.03万 - 项目类别:
Human Cerebellar Function in Multiple Task Domains
多任务域中的人类小脑功能
- 批准号:
10385723 - 财政年份:2020
- 资助金额:
$ 34.03万 - 项目类别:
A Kilohertz-Frequency, Continuous-Wave Transcranial Magnetic Stimulator to Increase the Dynamic Range of Subthreshold Neuromodulation
千赫兹频率连续波经颅磁刺激器可增加阈下神经调节的动态范围
- 批准号:
9896268 - 财政年份:2019
- 资助金额:
$ 34.03万 - 项目类别:
A Kilohertz-Frequency, Continuous-Wave Transcranial Magnetic Stimulator to Increase the Dynamic Range of Subthreshold Neuromodulation
千赫兹频率连续波经颅磁刺激器可增加阈下神经调节的动态范围
- 批准号:
10021664 - 财政年份:2019
- 资助金额:
$ 34.03万 - 项目类别:
Embodied Decision Making: The Influence of Action Errors on Reinforcement Learning
具身决策:行动错误对强化学习的影响
- 批准号:
8888798 - 财政年份:2015
- 资助金额:
$ 34.03万 - 项目类别:
Relationship of Prefrontal GABA to Inhibitory Mechanisms for Response Preparation
前额叶 GABA 与反应准备抑制机制的关系
- 批准号:
8624150 - 财政年份:2013
- 资助金额:
$ 34.03万 - 项目类别:
Relationship of Prefrontal GABA to Inhibitory Mechanisms for Response Preparation
前额叶 GABA 与反应准备抑制机制的关系
- 批准号:
8730249 - 财政年份:2013
- 资助金额:
$ 34.03万 - 项目类别:
Neural Mechanisms Underlying Hand Choice for Unimanual Action
单手动作手部选择背后的神经机制
- 批准号:
8606267 - 财政年份:2012
- 资助金额:
$ 34.03万 - 项目类别:
Neural Mechanisms Underlying Hand Choice for Unimanual Action
单手动作手部选择背后的神经机制
- 批准号:
8401745 - 财政年份:2012
- 资助金额:
$ 34.03万 - 项目类别:
相似海外基金
CAREER: Efficient Algorithms for Modern Computer Architecture
职业:现代计算机架构的高效算法
- 批准号:
2339310 - 财政年份:2024
- 资助金额:
$ 34.03万 - 项目类别:
Continuing Grant
Hardware-aware Network Architecture Search under ML Training workloads
ML 训练工作负载下的硬件感知网络架构搜索
- 批准号:
2904511 - 财政年份:2024
- 资助金额:
$ 34.03万 - 项目类别:
Studentship
CAREER: Creating Tough, Sustainable Materials Using Fracture Size-Effects and Architecture
职业:利用断裂尺寸效应和架构创造坚韧、可持续的材料
- 批准号:
2339197 - 财政年份:2024
- 资助金额:
$ 34.03万 - 项目类别:
Standard Grant
Travel: Student Travel Support for the 51st International Symposium on Computer Architecture (ISCA)
旅行:第 51 届计算机体系结构国际研讨会 (ISCA) 的学生旅行支持
- 批准号:
2409279 - 财政年份:2024
- 资助金额:
$ 34.03万 - 项目类别:
Standard Grant
Understanding Architecture Hierarchy of Polymer Networks to Control Mechanical Responses
了解聚合物网络的架构层次结构以控制机械响应
- 批准号:
2419386 - 财政年份:2024
- 资助金额:
$ 34.03万 - 项目类别:
Standard Grant
I-Corps: Highly Scalable Differential Power Processing Architecture
I-Corps:高度可扩展的差分电源处理架构
- 批准号:
2348571 - 财政年份:2024
- 资助金额:
$ 34.03万 - 项目类别:
Standard Grant
Collaborative Research: Merging Human Creativity with Computational Intelligence for the Design of Next Generation Responsive Architecture
协作研究:将人类创造力与计算智能相结合,设计下一代响应式架构
- 批准号:
2329759 - 财政年份:2024
- 资助金额:
$ 34.03万 - 项目类别:
Standard Grant
The architecture and evolution of host control in a microbial symbiosis
微生物共生中宿主控制的结构和进化
- 批准号:
BB/X014657/1 - 财政年份:2024
- 资助金额:
$ 34.03万 - 项目类别:
Research Grant
RACCTURK: Rock-cut Architecture and Christian Communities in Turkey, from Antiquity to 1923
RACCTURK:土耳其的岩石建筑和基督教社区,从古代到 1923 年
- 批准号:
EP/Y028120/1 - 财政年份:2024
- 资助金额:
$ 34.03万 - 项目类别:
Fellowship
NSF Convergence Accelerator Track M: Bio-Inspired Surface Design for High Performance Mechanical Tracking Solar Collection Skins in Architecture
NSF Convergence Accelerator Track M:建筑中高性能机械跟踪太阳能收集表皮的仿生表面设计
- 批准号:
2344424 - 财政年份:2024
- 资助金额:
$ 34.03万 - 项目类别:
Standard Grant