Sparse representations for reinforcement learning
强化学习的稀疏表示
基本信息
- 批准号:RGPIN-2018-05721
- 负责人:
- 金额:$ 2.84万
- 依托单位:
- 依托单位国家:加拿大
- 项目类别:Discovery Grants Program - Individual
- 财政年份:2020
- 资助国家:加拿大
- 起止时间:2020-01-01 至 2021-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
A key component of an artificial intelligence system is the ability to process and learn from a high-dimensional, high-volume sensory stream of information. For example, an agent controlling the pumps in an industrial plant continually receives sensory information about temperatures and energy consumption, to continually adjust the motor speed in real-time to optimize performance. To make such decision, the agents needs to be able to predict the long-term outcomes of their behaviour. For example, if the industrial agent can predict the long-term temperature of the motor, given the current state of the system, they can use these predictions to improve their decisions and ensure motors are not damaged. 
Such predictions, however, can be difficult to learn accurately from raw sensory information. Predictions are typically learned as functions of inputted sensory information. For example, the prediction of motor temperature in five minutes could be approximated as a polynomial function of the last ten recorded temperature and motor speeds. Polynomials, however, are only one possible functional form, and not necessarily the best one. Further, to obtain general learning agents, the functional forms should be effective across multiple settings or tasks. This is the goal of representation learning in reinforcement learning: identifying a general mapping from a sequence of raw sensory information to a set of features, that facilitates accurate predictions. 
The goal in my research is to understandboth theoretically and empiricallythe properties of effective representations for a reinforcement learning agent learning on a continual stream of sensory information. A part of this challenge is to identify simpler representations for which we can provide optimization guarantees, but that are nonetheless sufficiently powerful to facilitate learning. Continuing preliminary research, I will explore prototype-based (kernel) representations and a sparse supervised auto-encoder representation. We have already found that, within this class of simpler representations, we can find computational models that provide highly accurate predictions, but are more amenable to theoretical analysis. A core component of this research direction will be to investigate sparsity as a generally useful property of representations, and how we can encode that property into our representation learning algorithms. 
If successful, this research will have important scientific and societal benefits. This research will contribute to a core endeavour in artificial intelligence: understanding how to develop intelligent agents that can learn in complex environments. This understanding, in turn, will contribute to improving the robustness of automated decision-making systems, which are becoming ubiquitous in our world, including in industrial systems and factories, in self-driving vehicles and even in our homes.
人工智能系统的一个关键组成部分是处理和学习高维、大容量感官信息流的能力。例如,控制工厂泵的代理不断接收有关温度和能耗的感官信息,从而不断实时调整电机速度以优化性能。为了做出这样的决定,代理人需要能够预测其行为的长期结果。例如,如果工业代理可以预测电机的长期温度,给定系统的当前状态,他们可以使用这些预测来改进他们的决策,并确保电机不损坏。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
                item.title }}
{{ item.translation_title }}
- DOI:{{ item.doi }} 
- 发表时间:{{ item.publish_year }} 
- 期刊:
- 影响因子:{{ item.factor }}
- 作者:{{ item.authors }} 
- 通讯作者:{{ item.author }} 
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:{{ item.author }} 
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:{{ item.author }} 
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:{{ item.author }} 
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:{{ item.author }} 
数据更新时间:{{ patent.updateTime }}
White, Martha其他文献
White, Martha的其他文献
{{
              item.title }}
{{ item.translation_title }}
- DOI:{{ item.doi }} 
- 发表时间:{{ item.publish_year }} 
- 期刊:
- 影响因子:{{ item.factor }}
- 作者:{{ item.authors }} 
- 通讯作者:{{ item.author }} 
{{ truncateString('White, Martha', 18)}}的其他基金
Sparse representations for reinforcement learning
强化学习的稀疏表示
- 批准号:RGPIN-2018-05721 
- 财政年份:2022
- 资助金额:$ 2.84万 
- 项目类别:Discovery Grants Program - Individual 
Sparse representations for reinforcement learning
强化学习的稀疏表示
- 批准号:RGPIN-2018-05721 
- 财政年份:2021
- 资助金额:$ 2.84万 
- 项目类别:Discovery Grants Program - Individual 
Optimizing the treatment of drinking water using reinforcement learning
使用强化学习优化饮用水处理
- 批准号:520966-2017 
- 财政年份:2020
- 资助金额:$ 2.84万 
- 项目类别:Collaborative Research and Development Grants 
Sparse representations for reinforcement learning
强化学习的稀疏表示
- 批准号:522586-2018 
- 财政年份:2019
- 资助金额:$ 2.84万 
- 项目类别:Discovery Grants Program - Accelerator Supplements 
Sparse representations for reinforcement learning
强化学习的稀疏表示
- 批准号:RGPIN-2018-05721 
- 财政年份:2019
- 资助金额:$ 2.84万 
- 项目类别:Discovery Grants Program - Individual 
Optimizing the treatment of drinking water using reinforcement learning
使用强化学习优化饮用水处理
- 批准号:520966-2017 
- 财政年份:2019
- 资助金额:$ 2.84万 
- 项目类别:Collaborative Research and Development Grants 
Sparse representations for reinforcement learning
强化学习的稀疏表示
- 批准号:RGPIN-2018-05721 
- 财政年份:2018
- 资助金额:$ 2.84万 
- 项目类别:Discovery Grants Program - Individual 
Optimizing the treatment of drinking water using reinforcement learning
使用强化学习优化饮用水处理
- 批准号:520966-2017 
- 财政年份:2018
- 资助金额:$ 2.84万 
- 项目类别:Collaborative Research and Development Grants 
Sparse representations for reinforcement learning
强化学习的稀疏表示
- 批准号:DGECR-2018-00161 
- 财政年份:2018
- 资助金额:$ 2.84万 
- 项目类别:Discovery Launch Supplement 
Sparse representations for reinforcement learning
强化学习的稀疏表示
- 批准号:522586-2018 
- 财政年份:2018
- 资助金额:$ 2.84万 
- 项目类别:Discovery Grants Program - Accelerator Supplements 
相似海外基金
Task Representations in Ventral Tegmental Area Dopamine Neurons across Shifts in Behavioral Strategy and Reward Expectation
腹侧被盖区多巴胺神经元的任务表征跨越行为策略和奖励期望的转变
- 批准号:10679825 
- 财政年份:2023
- 资助金额:$ 2.84万 
- 项目类别:
P2: Geometry of Neural Representations and Dynamics
P2:神经表征和动力学的几何
- 批准号:10705964 
- 财政年份:2023
- 资助金额:$ 2.84万 
- 项目类别:
Learning causal representations for fair reinforcement learning
学习公平强化学习的因果表示
- 批准号:547940-2020 
- 财政年份:2022
- 资助金额:$ 2.84万 
- 项目类别:Postgraduate Scholarships - Doctoral 
Sparse representations for reinforcement learning
强化学习的稀疏表示
- 批准号:RGPIN-2018-05721 
- 财政年份:2022
- 资助金额:$ 2.84万 
- 项目类别:Discovery Grants Program - Individual 
Learning good representations for and with reinforcement learning
通过强化学习学习良好的表征
- 批准号:RGPIN-2017-06788 
- 财政年份:2021
- 资助金额:$ 2.84万 
- 项目类别:Discovery Grants Program - Individual 
Experimental and modeling investigations into microcircuit, cellular and subcellular determinants of hippocampal ensemble recruitment to contextual representations
对海马体集合招募到情境表征的微电路、细胞和亚细胞决定因素的实验和建模研究
- 批准号:10535439 
- 财政年份:2021
- 资助金额:$ 2.84万 
- 项目类别:
Exploring Disentangled Representations for Model-Based Reinforcement Learning
探索基于模型的强化学习的解缠结表示
- 批准号:568619-2021 
- 财政年份:2021
- 资助金额:$ 2.84万 
- 项目类别:Canadian Graduate Scholarships Foreign Study Supplements 
Experimental and modeling investigations into microcircuit, cellular and subcellular determinants of hippocampal ensemble recruitment to contextual representations
对海马体集合招募到情境表征的微电路、细胞和亚细胞决定因素的实验和建模研究
- 批准号:10321652 
- 财政年份:2021
- 资助金额:$ 2.84万 
- 项目类别:
Experimental and modeling investigations into microcircuit, cellular and subcellular determinants of hippocampal ensemble recruitment to contextual representations
对海马体集合招募到情境表征的微电路、细胞和亚细胞决定因素的实验和建模研究
- 批准号:10097137 
- 财政年份:2021
- 资助金额:$ 2.84万 
- 项目类别:
Sparse representations for reinforcement learning
强化学习的稀疏表示
- 批准号:RGPIN-2018-05721 
- 财政年份:2021
- 资助金额:$ 2.84万 
- 项目类别:Discovery Grants Program - Individual 

 刷新
              刷新
            
















 {{item.name}}会员
              {{item.name}}会员
            



