A theoretical framework for probabilistic reinforcement learning in the basal ganglia
基底神经节概率强化学习的理论框架
基本信息
- 批准号:10226986
- 负责人:
- 金额:$ 52.13万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2019
- 资助国家:美国
- 起止时间:2019-08-15 至 2024-07-31
- 项目状态:已结题
- 来源:
- 关键词:Adaptive BehaviorsAddressAnimalsArchitectureBasal GangliaBehaviorBeliefCellsCorpus striatum structureDataData AnalysesDopamineDorsalExperimental DesignsFutureGoalsLearningLinkLocationModelingNeuronsOutputPathway interactionsPatternPlayPoliciesProbabilityPsychological reinforcementRampRattusRewardsRodentRoleSignal TransductionSpecific qualifier valueSynapsesTestingTimeUncertaintyUpdateWeightWidthWorkbasedesignflexibilityinnovationinsightmathematical modelmotor behaviorneurobiological mechanismpredictive modelingsuccesstheories
项目摘要
Project abstract
According to the standard reinforcement learning framework, the basal ganglia implements estimation of long-
term future reward and the control of actions to maximize future reward. Dopamine (DA) plays a central role by
providing the learning signal (reward prediction error, or RPE) that guides updating of reward predictions and
the action policy. Despite its success, the reinforcement learning framework has been challenged from a
number of directions. Some studies have suggested that DA encodes reward predictions themselves, rather
than reward prediction errors, and other studies have suggested that DA may play a role in invigorating action
selection independently from its contribution to learning. A major goal of this project is to develop a
reinforcement learning theory of basal ganglia function that addresses these challenges, and more broadly
presents a unifying view of how learning, probabilistic inference, and action selection work together to produce
adaptive behavior. Our theoretical innovation can be divided into three components. First, we argue that
cortical inputs to the striatum encode a probability distribution over hidden states, known as the belief state.
Second, we argue that striatal projection neurons transform this input through a set of basis functions, whose
purpose is to facilitate reward prediction. The synaptic weights that parametrize these predictions are updated
based on the DA RPE signal. Third, we argue that action selection circuits in the dorsal striatum use
probabilistic information about rewards to implement uncertainty-guided exploration.
项目摘要
根据标准的强化学习框架,基底神经节实现长-
长期未来奖励和控制行动以最大化未来奖励。多巴胺(DA)通过以下途径发挥核心作用:
- 提供引导奖励预测的更新的学习信号(奖励预测误差,或RPE),以及
行动政策。尽管取得了成功,但强化学习框架受到了来自
方向的数量。一些研究表明,DA编码奖励预测本身,而不是
而不是奖励预测错误,其他研究表明,DA可能在激励行动中发挥作用,
选择独立于其对学习的贡献。该项目的一个主要目标是开发一个
基底神经节功能的强化学习理论解决了这些挑战,
提出了一个统一的观点,学习,概率推理和行动选择如何协同工作,以产生
适应行为我们的理论创新可以分为三个组成部分。首先,我们认为,
对纹状体的皮层输入编码了被称为信念状态的隐藏状态的概率分布。
其次,我们认为纹状体投射神经元通过一组基函数来转换这种输入,
目的是为了便于奖励预测。更新参数化这些预测的突触权重
基于DA RPE信号。第三,我们认为,动作选择回路在背侧纹状体使用,
关于奖励的概率信息,以实现不确定性引导的探索。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Samuel J Gershman其他文献
Samuel J Gershman的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Samuel J Gershman', 18)}}的其他基金
A theoretical framework for probabilistic reinforcement learning in the basal ganglia
基底神经节概率强化学习的理论框架
- 批准号:
10460155 - 财政年份:2019
- 资助金额:
$ 52.13万 - 项目类别:
A theoretical framework for probabilistic reinforcement learning in the basal ganglia
基底神经节概率强化学习的理论框架
- 批准号:
10687830 - 财政年份:2019
- 资助金额:
$ 52.13万 - 项目类别:
相似海外基金
Rational design of rapidly translatable, highly antigenic and novel recombinant immunogens to address deficiencies of current snakebite treatments
合理设计可快速翻译、高抗原性和新型重组免疫原,以解决当前蛇咬伤治疗的缺陷
- 批准号:
MR/S03398X/2 - 财政年份:2024
- 资助金额:
$ 52.13万 - 项目类别:
Fellowship
Re-thinking drug nanocrystals as highly loaded vectors to address key unmet therapeutic challenges
重新思考药物纳米晶体作为高负载载体以解决关键的未满足的治疗挑战
- 批准号:
EP/Y001486/1 - 财政年份:2024
- 资助金额:
$ 52.13万 - 项目类别:
Research Grant
CAREER: FEAST (Food Ecosystems And circularity for Sustainable Transformation) framework to address Hidden Hunger
职业:FEAST(食品生态系统和可持续转型循环)框架解决隐性饥饿
- 批准号:
2338423 - 财政年份:2024
- 资助金额:
$ 52.13万 - 项目类别:
Continuing Grant
Metrology to address ion suppression in multimodal mass spectrometry imaging with application in oncology
计量学解决多模态质谱成像中的离子抑制问题及其在肿瘤学中的应用
- 批准号:
MR/X03657X/1 - 财政年份:2024
- 资助金额:
$ 52.13万 - 项目类别:
Fellowship
CRII: SHF: A Novel Address Translation Architecture for Virtualized Clouds
CRII:SHF:一种用于虚拟化云的新型地址转换架构
- 批准号:
2348066 - 财政年份:2024
- 资助金额:
$ 52.13万 - 项目类别:
Standard Grant
BIORETS: Convergence Research Experiences for Teachers in Synthetic and Systems Biology to Address Challenges in Food, Health, Energy, and Environment
BIORETS:合成和系统生物学教师的融合研究经验,以应对食品、健康、能源和环境方面的挑战
- 批准号:
2341402 - 财政年份:2024
- 资助金额:
$ 52.13万 - 项目类别:
Standard Grant
The Abundance Project: Enhancing Cultural & Green Inclusion in Social Prescribing in Southwest London to Address Ethnic Inequalities in Mental Health
丰富项目:增强文化
- 批准号:
AH/Z505481/1 - 财政年份:2024
- 资助金额:
$ 52.13万 - 项目类别:
Research Grant
ERAMET - Ecosystem for rapid adoption of modelling and simulation METhods to address regulatory needs in the development of orphan and paediatric medicines
ERAMET - 快速采用建模和模拟方法的生态系统,以满足孤儿药和儿科药物开发中的监管需求
- 批准号:
10107647 - 财政年份:2024
- 资助金额:
$ 52.13万 - 项目类别:
EU-Funded
Ecosystem for rapid adoption of modelling and simulation METhods to address regulatory needs in the development of orphan and paediatric medicines
快速采用建模和模拟方法的生态系统,以满足孤儿药和儿科药物开发中的监管需求
- 批准号:
10106221 - 财政年份:2024
- 资助金额:
$ 52.13万 - 项目类别:
EU-Funded
Recite: Building Research by Communities to Address Inequities through Expression
背诵:社区开展研究,通过表达解决不平等问题
- 批准号:
AH/Z505341/1 - 财政年份:2024
- 资助金额:
$ 52.13万 - 项目类别:
Research Grant