登录/注册

{{ userInfo.is_pay ? '专属管家' : '联系客服' }}

调研领500喵币

开通猫会员

用户头像

{{ userInfo.nickname }}

个人中心

ID: {{ userInfo.uid }}

复制

会员有效期至{{dayjs(userInfo?.membership_time * 1000).format('YYYY.MM.DD')}}

开通会员尊享 16+ 权益

{{isVip ? '立即续费' : '立即开通'}}

智能选题

智能选题

课程8折

智能标书

文献分析

文献分析

更多特权

更多特权

剩余喵币

{{userInfo.mew_coin_count}}

{{userInfo.over_mew_coin || 0}}喵币将在本周失效

专属邀请码

复制

{{userInfo.share?.code}}

邀好友注册得200喵币/人任务中心

任务中心

退出账号

{{loginType === 2 ? '微信扫码注册' : '欢迎来到猫眼课题宝'}}

登录二维码

刷新登录二维码

刷新

登录即代表您同意并遵守《隐私协议》

为了保证账户安全，请在
微信「猫眼课题宝」内点击授权

重新扫码

刷新登录二维码

刷新

登录即代表您同意并遵守《隐私协议》

账号注册

您好~为了给您提供更精准的分析体验，需完善基础信息！所有信息100%保密，请放心填写！

立即使用

切换微信登录

*注：建议或bug反馈被采纳后获得{{feedback_mew_coin}}喵币奖励，请关注公众号模版消息通知

取消

提交

已收到您的反馈，我们会尽快处理。若内容被采纳你将获得{{feedback_mew_coin}}喵币奖励。请关注《猫眼课题宝》消息通知。

{{ChannelMewCoin}}

喵币已到账！

*喵币用于产品体验解锁使用，有效期 30 天

在猫眼课题宝您可以：

{{item.title}}

{{item.desc}}

微信扫码添加小助理，回复“调研”
领取调研问卷

首次添加还可额外获得
{{customer_mew_coin}}喵币奖励哦！

完成问卷填写，立得{{question_mew_coin}}喵币奖励

永久回看权已生效！

直播主题

《{{latestCourse?.name}}》

立即去查看

7天猫会员

有效期至：{{dayjs(userInfo.membership_time * 1000).format('YYYY-MM-DD HH:mm')}}

已送您“7天会员体验卡+500喵币”

次数升级

享智能标书等多功能月解锁次数1次

10次

优享折扣

获会员期内充值喵币 8折等3大折扣

开心收下

永久回看权已生效！

课程

《{{giftRes?.img}}》

立即去查看

永久回看权已生效！

课程

《{{receiveTrainingCourseInfo?.name}}》

立即去查看

{{userInfo?.nickname}}

猫会员

{{vipStr}}

后失效

·查看权益对比·

猫会员

（全方位提升课题决策能力）

会员专属

升级猫会员：购买喵币享 8 折优惠

免费领最高 6W 喵币

二维码

{{qrCodeError}}

请先阅读
服务协议并同意

扫码添加「专属客服」
了解团购优惠方案

客服在线时间：工作日9:00-18:00

￥

{{currentInfo?.price}}

已优惠{{ _.floor(_.toNumber(currentInfo.original_price) - _.toNumber(currentInfo.price), 0) }}元

倒计时

支持：

支付宝

支付宝/

微信

*信息服务类购买后不支持退款

请阅读并同意《猫眼课题宝服务协议》

常见问题

会员权益说明

会员权益对比

权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

- 微信扫一扫 -

请添加您的「专属会员管家」
提供专属会员服务

Deep Bayesian Reinforcement Learning in Changing Environments

不断变化的环境中的深度贝叶斯强化学习

基本信息

批准号：
2724208
负责人：
金额：
--
依托单位：
University College London
依托单位国家：
英国
项目类别：
Studentship
财政年份：
2022
资助国家：
英国
起止时间：
2022 至无数据
项目状态：
未结题

来源：
https://gtr.ukri.org/projects?ref=studentship-2724208
关键词：
Deep Bayesian Reinforcement Learning Changing

项目摘要

Deep Reinforcement Learning (DRL) worked well in a wide range of games with a fixed environment, such as Go and Starcraft. But most real-world environments change over time and are influenced by random factors such as weather. So, the nonstationarity of the environment in DRL requires more attention. This PhD research sheds light on Reinforcement Learning in changing environments with Bayesian approach since Bayesian DRL can deal with environments with uncertainty. All we need is to treat the environmental changes as uncertainty. In general, I plan to pursue four research directions to help fast learning in changing environments with Bayesian RL. I first investigate utilising prior experience to identify current environment status in a changing environment setting and then learn strategies. Then I suggest using the meta-learning tricks to facilitate adaptation to new environments in Bayesian RL. Finally, I propose less conservative Robust RL and more efficient Safe RL in changing environments with a Bayesian approach. These Bayesian RL directions can contribute to an efficient, safe, and robust deep Bayesian RL in changing environments.

深度强化学习（DRL）在具有固定环境的各种游戏中运行良好，例如围棋和星际争霸。但大多数现实世界的环境会随着时间的推移而变化，并受到天气等随机因素的影响。因此，日间行车灯环境的非平稳性需要引起更多的关注。这项博士研究揭示了贝叶斯方法在不断变化的环境中的强化学习，因为贝叶斯DRL可以处理具有不确定性的环境。我们所需要的是把环境变化当作不确定性来对待。总的来说，我计划追求四个研究方向，以帮助贝叶斯RL在不断变化的环境中快速学习。我首先调查利用以前的经验，以确定当前的环境状况在不断变化的环境设置，然后学习策略。然后，我建议使用元学习技巧来促进贝叶斯RL对新环境的适应。最后，我提出了不太保守的鲁棒强化学习和更有效的安全强化学习在不断变化的环境中的贝叶斯方法。这些贝叶斯RL方向可以在不断变化的环境中实现高效，安全和鲁棒的深度贝叶斯RL。

项目成果

期刊论文数量（0）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

数据更新时间：{{ journalArticles.updateTime }}

{{ item.title }}

{{ item.translation_title }}

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

{{ item.title }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

{{ item.title }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

{{ item.title }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

{{ item.title }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

其他文献

Internet-administered, low-intensity cognitive behavioral therapy for parents of children treated for cancer: A feasibility trial (ENGAGE).

针对癌症儿童父母的互联网管理、低强度认知行为疗法：可行性试验 (ENGAGE)。

DOI：
10.1002/cam4.5377
发表时间：
2023-03
期刊：
Cancer medicine
影响因子：
4
作者：
通讯作者：

Differences in child and adolescent exposure to unhealthy food and beverage advertising on television in a self-regulatory environment.

在自我监管的环境中，儿童和青少年在电视上接触不健康食品和饮料广告的情况存在差异。

DOI：
10.1186/s12889-023-15027-w
发表时间：
2023-03-23
期刊：
BMC public health
影响因子：
4.5
作者：
通讯作者：

The association between rheumatoid arthritis and reduced estimated cardiorespiratory fitness is mediated by physical symptoms and negative emotions: a cross-sectional study.

类风湿性关节炎与估计心肺健康降低之间的关联是由身体症状和负面情绪介导的：一项横断面研究。

DOI：
10.1007/s10067-023-06584-x
发表时间：
2023-07
期刊：
Clinical rheumatology
影响因子：
3.4
作者：
通讯作者：

ElasticBLAST: accelerating sequence search via cloud computing.

ElasticBLAST：通过云计算加速序列搜索。

DOI：
10.1186/s12859-023-05245-9
发表时间：
2023-03-26
期刊：
BMC bioinformatics
影响因子：
3
作者：
通讯作者：

Amplified EQCM-D detection of extracellular vesicles using 2D gold nanostructured arrays fabricated by block copolymer self-assembly.

使用通过嵌段共聚物自组装制造的 2D 金纳米结构阵列放大 EQCM-D 检测细胞外囊泡。

DOI：
10.1039/d2nh00424k
发表时间：
2023-03-27
期刊：
Nanoscale horizons
影响因子：
9.7
作者：
通讯作者：

的其他文献

{{ item.title }}

{{ item.translation_title }}

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

{{ truncateString('', 18)}}的其他基金

An implantable biosensor microsystem for real-time measurement of circulating biomarkers

用于实时测量循环生物标志物的植入式生物传感器微系统

批准号：
2901954
财政年份：
2028
资助金额：
--
项目类别：
Studentship

Exploiting the polysaccharide breakdown capacity of the human gut microbiome to develop environmentally sustainable dishwashing solutions

利用人类肠道微生物群的多糖分解能力来开发环境可持续的洗碗解决方案

批准号：
2896097
财政年份：
2027
资助金额：
--
项目类别：
Studentship

A Robot that Swims Through Granular Materials

可以在颗粒材料中游动的机器人

批准号：
2780268
财政年份：
2027
资助金额：
--
项目类别：
Studentship

Likelihood and impact of severe space weather events on the resilience of nuclear power and safeguards monitoring.

严重空间天气事件对核电和保障监督的恢复力的可能性和影响。

批准号：
2908918
财政年份：
2027
资助金额：
--
项目类别：
Studentship

Proton, alpha and gamma irradiation assisted stress corrosion cracking: understanding the fuel-stainless steel interface

质子、α 和 γ 辐照辅助应力腐蚀开裂：了解燃料-不锈钢界面

批准号：
2908693
财政年份：
2027
资助金额：
--
项目类别：
Studentship

Field Assisted Sintering of Nuclear Fuel Simulants

核燃料模拟物的现场辅助烧结

批准号：
2908917
财政年份：
2027
资助金额：
--
项目类别：
Studentship

Assessment of new fatigue capable titanium alloys for aerospace applications

评估用于航空航天应用的新型抗疲劳钛合金

批准号：
2879438
财政年份：
2027
资助金额：
--
项目类别：
Studentship

Developing a 3D printed skin model using a Dextran - Collagen hydrogel to analyse the cellular and epigenetic effects of interleukin-17 inhibitors in

使用右旋糖酐-胶原蛋白水凝胶开发 3D 打印皮肤模型，以分析白细胞介素 17 抑制剂的细胞和表观遗传效应

批准号：
2890513
财政年份：
2027
资助金额：
--
项目类别：
Studentship

CDT year 1 so TBC in Oct 2024

CDT 第 1 年，预计 2024 年 10 月

批准号：
2879865
财政年份：
2027
资助金额：
--
项目类别：
Studentship

Understanding the interplay between the gut microbiome, behavior and urbanisation in wild birds

了解野生鸟类肠道微生物组、行为和城市化之间的相互作用

批准号：
2876993
财政年份：
2027
资助金额：
--
项目类别：
Studentship

相似国自然基金

多元纵向数据与复发事件和终止事件的Bayesian联合模型研究

批准号：
82173628
批准年份：
2021
资助金额：
52 万元
项目类别：
面上项目

三维地质模型约束下地球化学场的Bayesian-MCMC推断

批准号：
42072326
批准年份：
2020
资助金额：
63 万元
项目类别：
面上项目

基于Bayesian Kriging模型的压射机构稳健优化设计基础研究

批准号：
51875209
批准年份：
2018
资助金额：
59.0 万元
项目类别：
面上项目

X射线图像分析中的MCMC-Bayesian理论与计算方法研究

批准号：
U1830105
批准年份：
2018
资助金额：
62.0 万元
项目类别：
联合基金项目

基于Bayesian位移场的SAR图像精确配准方法研究

批准号：
41601345
批准年份：
2016
资助金额：
19.0 万元
项目类别：
青年科学基金项目

多结局Bayesian联合生存模型及糖尿病并发症预测研究

批准号：
81673274
批准年份：
2016
资助金额：
50.0 万元
项目类别：
面上项目

基于Meta流行病学和Bayesian方法构建针刺干预无偏倚风险效果评价体系研究

批准号：
81403276
批准年份：
2014
资助金额：
23.0 万元
项目类别：
青年科学基金项目

BtoC电子商务中基于分层Bayesian网络的信任与声誉计算理论研究

批准号：
71302080
批准年份：
2013
资助金额：
20.0 万元
项目类别：
青年科学基金项目

基于Bayesian网络的坚硬顶板条件下煤与瓦斯突出预警控制机理研究

批准号：
51274089
批准年份：
2012
资助金额：
80.0 万元
项目类别：
面上项目

Bayesian实物期权及在信用风险决策中的应用

批准号：
71071027
批准年份：
2010
资助金额：
23.0 万元
项目类别：
面上项目

相似海外基金

Scalable Bayesian Reinforcement Learning in the Games Industry

游戏行业中的可扩展贝叶斯强化学习

批准号：
2890029
财政年份：
2023
资助金额：
--
项目类别：
Studentship

Affordable High Quality Control Using Structured Bayesian Reinforcement Learning for Articulated Robot in Biomedical Applications

使用结构化贝叶斯强化学习对生物医学应用中的铰接式机器人进行经济实惠的高质量控制

批准号：
23K16976
财政年份：
2023
资助金额：
--
项目类别：
Grant-in-Aid for Early-Career Scientists

Fully Bayesian Reinforcement Learning for Control of Continuous Industrial Processes

用于控制连续工业过程的完全贝叶斯强化学习

批准号：
2640133
财政年份：
2021
资助金额：
--
项目类别：
Studentship

Safe Artificial Intelligence with Bayesian Reinforcement Learning

通过贝叶斯强化学习实现安全人工智能

批准号：
534795-2019
财政年份：
2021
资助金额：
--
项目类别：
Alexander Graham Bell Canada Graduate Scholarships - Doctoral

Safe Artificial Intelligence with Bayesian Reinforcement Learning

通过贝叶斯强化学习实现安全人工智能

批准号：
534795-2019
财政年份：
2020
资助金额：
--
项目类别：
Alexander Graham Bell Canada Graduate Scholarships - Doctoral

Safe Artificial Intelligence with Bayesian Reinforcement Learning

通过贝叶斯强化学习实现安全人工智能

批准号：
534795-2019
财政年份：
2019
资助金额：
--
项目类别：
Alexander Graham Bell Canada Graduate Scholarships - Doctoral

RTML: Small: Real-Time Model-Based Bayesian Reinforcement Learning

RTML：小型：基于实时模型的贝叶斯强化学习

批准号：
1937396
财政年份：
2019
资助金额：
--
项目类别：
Standard Grant

Bayesian Deep Reinforcement Learning

贝叶斯深度强化学习

批准号：
2243850
财政年份：
2019
资助金额：
--
项目类别：
Studentship

Deep Bayesian Reinforcement Learning

深度贝叶斯强化学习

批准号：
522237-2018
财政年份：
2018
资助金额：
--
项目类别：
Engage Plus Grants Program

RI: SMALL: Robust Reinforcement Learning Using Bayesian Models

RI：小：使用贝叶斯模型的鲁棒强化学习

批准号：
1815275
财政年份：
2018
资助金额：
--
项目类别：
Standard Grant

{{ showInfoDetail.title }}

成果类型：
{{ showInfoTypeEnum[showInfoType] }}

学术检索：
百度学术

作者：{{ showInfoDetail.author }}

知道了