权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

Transfer learning for continual learning in non-stationary environments

用于非静态环境中持续学习的迁移学习

基本信息

批准号：
553522-2020
负责人：
Lee, ChiGuhn
金额：
$ 11.98万
依托单位：
University of Toronto
依托单位国家：
加拿大
项目类别：
Alliance Grants
财政年份：
2020
资助国家：
加拿大
起止时间：
2020-01-01 至 2021-12-31
项目状态：
已结题

来源：
https://www.nserc-crsng.gc.ca/ase-oro/Details-Detailles_eng.asp?id=704454
关键词：
Transfer learning continual non stationary

项目摘要

Currently available artificial intelligence algorithms require re-training whenever their application environment undergoes fundamental changes and a new task is given. The ability to build an automated algorithm that can learn reliably under changing situations will therefore set new standards in the field. The main goal of this project is to develop new frameworks in which reinforcement learning algorithms can perform more stably and achieve higher sample and computational efficiency among a set of similar tasks in stochastic environment. That is, we intend to use transfer learning to address continual learning in non-stationary environments. To do so, we propose the following three objectives: 1) We will build adaptive neural network-based reinforcement learning models that will construct a knowledge representation that fits the non-stationary environmental complexity. This can be understood as a particular type of continual learning. 2) We will evaluate the performance of the developed frameworks/models in real-world applications which will be identified in collaboration with LG Electronics Canada. The identified problems may involve vision inspection data as inputs. 3) We will investigate how these frameworks/models perform when transferred to new environments in terms of dynamics, rewards and goals. The transfer learning frameworks/models will be combined with the developed continual learning frameworks/models. To sum up, the proposed project is to advance the current state-of-the-art of reinforcement learning so as to improve the sample and computational efficiency across similarly configured tasks and to robustify its performance in non-stationary environment.

目前可用的人工智能算法需要重新训练时，他们的应用环境发生了根本性的变化，并给出了一个新的任务。因此，构建一种能够在不断变化的情况下可靠学习的自动化算法的能力将在该领域树立新的标准。该项目的主要目标是开发新的框架，使强化学习算法能够在随机环境中的一组相似任务中更稳定地执行，并实现更高的样本和计算效率。也就是说，我们打算使用迁移学习来解决非静态环境中的持续学习。为此，我们提出以下三个目标： 1)我们将构建基于自适应神经网络的强化学习模型，该模型将构建适合非平稳环境复杂性的知识表示。这可以理解为一种特殊的持续学习。 2)我们将评估开发的框架/模型在实际应用中的性能，这些应用将与LG Electronics Canada合作确定。所识别的问题可能涉及视觉检查数据作为输入。 3)我们将研究这些框架/模型在动态、奖励和目标方面转移到新环境时的表现。迁移学习框架/模型将与已开发的持续学习框架/模型相结合。总而言之，该项目旨在推进强化学习的当前发展水平，以提高类似配置任务的样本和计算效率，并增强其在非平稳环境中的性能。