登录/注册

调研领500喵币

开通猫会员

用户头像

{{ userInfo.nickname }}

个人中心

ID: {{ userInfo.uid }}

复制

会员有效期至{{dayjs(userInfo?.membership_time * 1000).format('YYYY.MM.DD')}}

开通会员尊享 16+ 权益

{{isVip ? '立即续费' : '立即开通'}}

智能选题

智能选题

课程8折

智能标书

文献分析

文献分析

更多特权

更多特权

剩余喵币

{{userInfo.mew_coin_count}}

{{userInfo.over_mew_coin || 0}}喵币将在本周失效

专属邀请码

复制

{{userInfo.share?.code}}

邀好友注册得200喵币/人任务中心

任务中心

退出账号

{{loginType === 2 ? '微信扫码注册' : '欢迎来到猫眼课题宝'}}

登录二维码

刷新登录二维码

刷新

登录即代表您同意并遵守《隐私协议》

为了保证账户安全，请在
微信「猫眼课题宝」内点击授权

重新扫码

刷新登录二维码

刷新

登录即代表您同意并遵守《隐私协议》

账号注册

您好~为了给您提供更精准的分析体验，需完善基础信息！所有信息100%保密，请放心填写！

立即使用

切换微信登录

*注：建议或bug反馈被采纳后获得{{feedback_mew_coin}}喵币奖励，请关注公众号模版消息通知

取消

提交

已收到您的反馈，我们会尽快处理。若内容被采纳你将获得{{feedback_mew_coin}}喵币奖励。请关注《猫眼课题宝》消息通知。

{{ChannelMewCoin}}

喵币已到账！

*喵币用于产品体验解锁使用，有效期 30 天

在猫眼课题宝您可以：

{{item.title}}

{{item.desc}}

微信扫码添加小助理，回复“调研”
领取调研问卷

首次添加还可额外获得
{{customer_mew_coin}}喵币奖励哦！

完成问卷填写，立得{{question_mew_coin}}喵币奖励

永久回看权已生效！

直播主题

《{{latestCourse?.name}}》

立即去查看

7天猫会员

有效期至：{{dayjs(userInfo.membership_time * 1000).format('YYYY-MM-DD HH:mm')}}

已送您“7天会员体验卡+500喵币”

次数升级

享智能标书等多功能月解锁次数1次

10次

优享折扣

获会员期内充值喵币 8折等3大折扣

开心收下

永久回看权已生效！

课程

《{{giftRes?.img}}》

立即去查看

永久回看权已生效！

课程

《{{receiveTrainingCourseInfo?.name}}》

立即去查看

{{userInfo?.nickname}}

猫会员

{{vipStr}}

后失效

·查看权益对比·

猫会员

（全方位提升课题决策能力）

会员专属

升级猫会员：购买喵币享 8 折优惠

免费领最高 6W 喵币

二维码

{{qrCodeError}}

请先阅读
服务协议并同意

扫码添加「专属客服」
了解团购优惠方案

客服在线时间：工作日9:00-18:00

￥

{{currentInfo?.price}}

已优惠{{ _.floor(_.toNumber(currentInfo.original_price) - _.toNumber(currentInfo.price), 0) }}元

倒计时

支持：

支付宝

支付宝/

微信

*信息服务类购买后不支持退款

请阅读并同意《猫眼课题宝服务协议》

常见问题

会员权益说明

会员权益对比

权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

- 微信扫一扫 -

请添加您的「专属会员管家」
提供专属会员服务

Sequential Decision making in probabilistic models

概率模型中的顺序决策

基本信息

批准号：
2744311
负责人：
金额：
--
依托单位：
University of Cambridge
依托单位国家：
英国
项目类别：
Studentship
财政年份：
2020
资助国家：
英国
起止时间：
2020 至无数据
项目状态：
已结题

来源：
https://gtr.ukri.org/projects?ref=studentship-2744311
关键词：
Sequential Decision making probabilistic models

项目摘要

This proposal considers the problem of robust sequential decision making in non-linear environments. Reinforcementlearning has demonstrated high potential for solving complex problems in non-linear environments but has lackedefficiency and robustness. We argue that in order to deploy reinforcement learning agents in the real world, it is essential todevelop similar efficiency and robustness properties that have been developed in control theory. We propose to leveragethe extensive control and probabilistic reasoning literature to improve RL algorithms and present two interesting researchdirections. The first one considers using Sequential Monte-Carlo methods to improve planning for non-linearenvironments. The second direction focuses on designing robust controllers by exploring the connections betweenadversarial learning, robust control theory, and uncertainty modelling.

该建议考虑了非线性环境下的鲁棒序贯决策问题。强化学习在解决非线性环境中的复杂问题方面表现出很高的潜力，但缺乏效率和鲁棒性。我们认为，为了在真实的世界中部署强化学习代理，必须开发类似的效率和鲁棒性，已经在控制理论中开发。我们提出了广泛的控制和概率推理文献，以改善RL算法，并提出了两个有趣的研究方向。第一个考虑使用顺序蒙特-卡罗方法来改善非线性规划。第二个方向侧重于通过探索对抗学习，鲁棒控制理论和不确定性建模之间的联系来设计鲁棒控制器。

项目成果

期刊论文数量（0）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

数据更新时间：{{ journalArticles.updateTime }}

{{ item.title }}

{{ item.translation_title }}

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

{{ item.title }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

{{ item.title }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

{{ item.title }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

{{ item.title }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

其他文献

吉治仁志他: "トランスジェニックマウスによるTIMP-1の線維化促進機序"最新医学. 55. 1781-1787 (2000)

Hitoshi Yoshiji 等：“转基因小鼠中 TIMP-1 的促纤维化机制”现代医学 55. 1781-1787 (2000)。

DOI：
发表时间：
期刊：
影响因子：
0
作者：
通讯作者：

LiDAR Implementations for Autonomous Vehicle Applications

DOI：
发表时间：
2021
期刊：
影响因子：
0
作者：
通讯作者：

生命分子工学・海洋生命工学研究室

生物分子工程/海洋生物技术实验室

DOI：
发表时间：
期刊：
影响因子：
0
作者：
通讯作者：

吉治仁志他: "イラスト医学&サイエンスシリーズ血管の分子医学"羊土社(渋谷正史編). 125 (2000)

Hitoshi Yoshiji 等人：“血管医学与科学系列分子医学图解”Yodosha（涉谷正志编辑）125（2000）。

DOI：
发表时间：
期刊：
影响因子：
0
作者：
通讯作者：

Effect of manidipine hydrochloride,a calcium antagonist,on isoproterenol-induced left ventricular hypertrophy: "Yoshiyama,M.,Takeuchi,K.,Kim,S.,Hanatani,A.,Omura,T.,Toda,I.,Akioka,K.,Teragaki,M.,Iwao,H.and Yoshikawa,J." Jpn Circ J. 62(1). 47-52 (1998)

钙拮抗剂盐酸马尼地平对异丙肾上腺素引起的左心室肥厚的影响：“Yoshiyama,M.,Takeuchi,K.,Kim,S.,Hanatani,A.,Omura,T.,Toda,I.,Akioka,

DOI：
发表时间：
期刊：
影响因子：
0
作者：
通讯作者：

的其他文献

{{ item.title }}

{{ item.translation_title }}

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

{{ truncateString('', 18)}}的其他基金

An implantable biosensor microsystem for real-time measurement of circulating biomarkers

用于实时测量循环生物标志物的植入式生物传感器微系统

批准号：
2901954
财政年份：
2028
资助金额：
--
项目类别：
Studentship

Exploiting the polysaccharide breakdown capacity of the human gut microbiome to develop environmentally sustainable dishwashing solutions

利用人类肠道微生物群的多糖分解能力来开发环境可持续的洗碗解决方案

批准号：
2896097
财政年份：
2027
资助金额：
--
项目类别：
Studentship

A Robot that Swims Through Granular Materials

可以在颗粒材料中游动的机器人

批准号：
2780268
财政年份：
2027
资助金额：
--
项目类别：
Studentship

Likelihood and impact of severe space weather events on the resilience of nuclear power and safeguards monitoring.

严重空间天气事件对核电和保障监督的恢复力的可能性和影响。

批准号：
2908918
财政年份：
2027
资助金额：
--
项目类别：
Studentship

Proton, alpha and gamma irradiation assisted stress corrosion cracking: understanding the fuel-stainless steel interface

质子、α 和 γ 辐照辅助应力腐蚀开裂：了解燃料-不锈钢界面

批准号：
2908693
财政年份：
2027
资助金额：
--
项目类别：
Studentship

Field Assisted Sintering of Nuclear Fuel Simulants

核燃料模拟物的现场辅助烧结

批准号：
2908917
财政年份：
2027
资助金额：
--
项目类别：
Studentship

Assessment of new fatigue capable titanium alloys for aerospace applications

评估用于航空航天应用的新型抗疲劳钛合金

批准号：
2879438
财政年份：
2027
资助金额：
--
项目类别：
Studentship

Developing a 3D printed skin model using a Dextran - Collagen hydrogel to analyse the cellular and epigenetic effects of interleukin-17 inhibitors in

使用右旋糖酐-胶原蛋白水凝胶开发 3D 打印皮肤模型，以分析白细胞介素 17 抑制剂的细胞和表观遗传效应

批准号：
2890513
财政年份：
2027
资助金额：
--
项目类别：
Studentship

CDT year 1 so TBC in Oct 2024

CDT 第 1 年，预计 2024 年 10 月

批准号：
2879865
财政年份：
2027
资助金额：
--
项目类别：
Studentship

Understanding the interplay between the gut microbiome, behavior and urbanisation in wild birds

了解野生鸟类肠道微生物组、行为和城市化之间的相互作用

批准号：
2876993
财政年份：
2027
资助金额：
--
项目类别：
Studentship

相似国自然基金

Scalable Learning and Optimization: High-dimensional Models and Online Decision-Making Strategies for Big Data Analysis

批准号：
批准年份：
2024
资助金额：
万元
项目类别：
合作创新研究团队

相似海外基金

CRII: CIF: Sequential Decision-Making Algorithms for Efficient Subset Selection in Multi-Armed Bandits and Optimization of Black-Box Functions

CRII：CIF：多臂老虎机中高效子集选择和黑盒函数优化的顺序决策算法

批准号：
2246187
财政年份：
2023
资助金额：
--
项目类别：
Standard Grant

Sequential Decision Making with Imperfect Information: Machine Learning and Information Theory

不完美信息的顺序决策：机器学习和信息论

批准号：
23K17547
财政年份：
2023
资助金额：
--
项目类别：
Grant-in-Aid for Challenging Research (Exploratory)

Construction of Large-Scale Sequential Decision-Making Methods Leveraging Structures

利用结构构建大规模顺序决策方法

批准号：
23K19986
财政年份：
2023
资助金额：
--
项目类别：
Grant-in-Aid for Research Activity Start-up

Collaborative Research: NSF-CSIRO: Fair Sequential Collective Decision-Making

合作研究：NSF-CSIRO：公平顺序集体决策

批准号：
2303000
财政年份：
2023
资助金额：
--
项目类别：
Standard Grant

Collaborative Research: NSF-CSIRO: Fair Sequential Collective Decision-Making

合作研究：NSF-CSIRO：公平顺序集体决策

批准号：
2302999
财政年份：
2023
资助金额：
--
项目类别：
Standard Grant

Collaborative Research: Towards the Foundation of Approximate Sampling-Based Exploration in Sequential Decision Making

协作研究：为顺序决策中基于近似采样的探索奠定基础

批准号：
2323113
财政年份：
2023
资助金额：
--
项目类别：
Standard Grant

Collaborative Research: Towards the Foundation of Approximate Sampling-Based Exploration in Sequential Decision Making

协作研究：为顺序决策中基于近似采样的探索奠定基础

批准号：
2323112
财政年份：
2023
资助金额：
--
项目类别：
Standard Grant

Sequential decision making under uncertainty: fundamental limits and applications

不确定性下的序贯决策：基本限制和应用

批准号：
RGPIN-2020-04256
财政年份：
2022
资助金额：
--
项目类别：
Discovery Grants Program - Individual

Sequential Decision Making in Real-time Digital Advertising

实时数字广告中的顺序决策

批准号：
2749396
财政年份：
2022
资助金额：
--
项目类别：
Studentship

Reinforcement Learning for Sequential Decision Making in FPGA CAD

FPGA CAD 中顺序决策的强化学习

批准号：
571824-2022
财政年份：
2022
资助金额：
--
项目类别：
University Undergraduate Student Research Awards

{{ showInfoDetail.title }}

成果类型：
{{ showInfoTypeEnum[showInfoType] }}

学术检索：
百度学术

作者：{{ showInfoDetail.author }}

知道了