New Simulation-Based Approaches to Solving Markov Decision Processes
解决马尔可夫决策过程的基于仿真的新方法
基本信息
- 批准号:9988867
- 负责人:
- 金额:$ 44.07万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2000
- 资助国家:美国
- 起止时间:2000-09-01 至 2004-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
The research to be performed will develop simulation-based algorithms for numerical solution of Markov Decision Processes (MDPs), which can be used to model complex systems in manufacturing, telecommunications, and finance. Two new approaches that offer potential benefits not found in currently available methods will be explored. The first approach will use ordinal optimization (OO) for choosing actions in the backwards induction step for finite horizon problems, or in the policy iteration or value iteration step for infinite horizon problems. The second approach will use simultaneous perturbation stochastic approximation (SPSA) for optimizing high-dimensional parameterized MDPs.If successful, the results of the research will lead to dramatically more efficient algorithms for solving MDPs of practical interest in a number of application areas, from financial engineering to production systems. The impact of successfully applying high-dimensional solution methodologies to these problems would represent a major advance in developing computationally tractable methods for solving complex problems of sequential decision making under uncertainty. Furthermore, theoretical results are envisioned that would rigorously establish faster rates of convergence for the new algorithms over convergence rates from usual Monte Carlo simulation.
研究将开发基于模拟的算法,用于马尔可夫决策过程(MDP)的数值解,可用于模拟制造,电信和金融领域的复杂系统。 将探讨两种新的方法,这些方法提供了现有方法所没有的潜在好处。 第一种方法将使用顺序优化(OO)选择有限时间问题的向后归纳步骤中的动作,或在无限时间问题的策略迭代或值迭代步骤中的动作。 第二种方法将使用同步扰动随机逼近(SPSA)优化高维参数化MDP,如果成功的话,研究的结果将导致显着更有效的算法,解决MDP的实际利益在一些应用领域,从金融工程到生产系统。 成功地应用高维的解决方案的方法,这些问题的影响将代表一个重大的进步,在发展计算上易于处理的方法来解决复杂的问题,顺序决策的不确定性。 此外,理论结果设想,将严格建立更快的收敛速度的新算法的收敛速度从通常的蒙特卡罗模拟。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Michael Fu其他文献
Association between Body Mass Index and Risk of Aortic Stenosis in Women
女性体重指数与主动脉瓣狭窄风险之间的关系
- DOI:
10.1101/2023.09.26.23296191 - 发表时间:
2023 - 期刊:
- 影响因子:2.5
- 作者:
S. Kontogeorgos;Annika Rosengren;T. Z. Sandström;Michael Fu;Martin Lindgren;C. Md;M. Md;MD PhD Demir Djekic;E. Thunström - 通讯作者:
E. Thunström
Cartilage-Preserving Arthroscopic-Assisted Radiofrequency Ablation of Periacetabular Osteoid Osteoma in a Young Adult Hip
- DOI:
10.1016/j.eats.2020.03.024 - 发表时间:
2020-07-01 - 期刊:
- 影响因子:
- 作者:
Alexander C. Newhouse;Daniel M. Wichman;Michael Fu;Shane J. Nho - 通讯作者:
Shane J. Nho
A Formal Explainer for Just-In-Time Defect Predictions
即时缺陷预测的正式解释器
- DOI:
- 发表时间:
2024 - 期刊:
- 影响因子:4.4
- 作者:
Jinqiang Yu;Michael Fu;Alexey Ignatiev;C. Tantithamthavorn;Peter J. Stuckey - 通讯作者:
Peter J. Stuckey
Impact-based forecasting for improving the capacity of typhoon-related disaster risk reduction in typhoon committee region
- DOI:
10.1016/j.tcrr.2022.09.003 - 发表时间:
2022-09-01 - 期刊:
- 影响因子:
- 作者:
Jixin Yu;Jinping Liu;Ji-Won Baek;Clarence Fong;Michael Fu - 通讯作者:
Michael Fu
Proactive resource provisioning
主动资源配置
- DOI:
10.1016/j.comcom.2004.02.019 - 发表时间:
2004 - 期刊:
- 影响因子:0
- 作者:
E. Chi;Michael Fu;J. Walrand - 通讯作者:
J. Walrand
Michael Fu的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Michael Fu', 18)}}的其他基金
Collaborative Research: SCH: Optimal Desensitization Protocol in Support of a Kidney Paired Donation (KPD) System
合作研究:SCH:支持肾脏配对捐赠 (KPD) 系统的最佳脱敏方案
- 批准号:
2123684 - 财政年份:2021
- 资助金额:
$ 44.07万 - 项目类别:
Standard Grant
CAREER: Maintaining volitional effort during electrical stimulation-assisted stroke rehabilitation
职业:在电刺激辅助中风康复期间保持意志力
- 批准号:
1942402 - 财政年份:2020
- 资助金额:
$ 44.07万 - 项目类别:
Continuing Grant
New Approaches for Simulation-Based Optimal Decision Making
基于仿真的最优决策的新方法
- 批准号:
1434419 - 财政年份:2015
- 资助金额:
$ 44.07万 - 项目类别:
Standard Grant
New Computational Approaches for Markov Decision Processes
马尔可夫决策过程的新计算方法
- 批准号:
0323220 - 财政年份:2004
- 资助金额:
$ 44.07万 - 项目类别:
Continuing Grant
U. S. - France (INRIA) Cooperative Research Improving the Efficiency of Manufacturing Systems by Integrating Production Control into Maintenance Policies
美国-法国 (INRIA) 合作研究通过将生产控制纳入维护策略来提高制造系统的效率
- 批准号:
0070866 - 财政年份:2000
- 资助金额:
$ 44.07万 - 项目类别:
Standard Grant
U.S.-France Cooperative Research (INRIA): Perturbation Analysis and Parallel Computing for Production Management
美法合作研究(INRIA):生产管理的扰动分析和并行计算
- 批准号:
9402580 - 财政年份:1995
- 资助金额:
$ 44.07万 - 项目类别:
Standard Grant
相似国自然基金
Simulation and certification of the ground state of many-body systems on quantum simulators
- 批准号:
- 批准年份:2020
- 资助金额:40 万元
- 项目类别:
相似海外基金
Development of a new asset-management approach using a fast simulation technique based upon probability measure transformation
使用基于概率测度转换的快速模拟技术开发新的资产管理方法
- 批准号:
23K11000 - 财政年份:2023
- 资助金额:
$ 44.07万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Development of a new swallowing assessment based on musculoskeletal simulation
基于肌肉骨骼模拟的新型吞咽评估的开发
- 批准号:
23K09290 - 财政年份:2023
- 资助金额:
$ 44.07万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Elucidation of new supportive environmental factors to improve eating habits among students: Verification based on simulation analysis
阐明改善学生饮食习惯的新支持环境因素:基于模拟分析的验证
- 批准号:
20K02384 - 财政年份:2020
- 资助金额:
$ 44.07万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Enabling the Design of Failure-Tolerant Complex Engineered Systems using New Network-Based Modeling, Analysis, and Simulation Formalisms
使用新的基于网络的建模、分析和仿真形式实现容错复杂工程系统的设计
- 批准号:
1562027 - 财政年份:2016
- 资助金额:
$ 44.07万 - 项目类别:
Standard Grant
New Approaches for Simulation-Based Optimal Decision Making
基于仿真的最优决策的新方法
- 批准号:
1434419 - 财政年份:2015
- 资助金额:
$ 44.07万 - 项目类别:
Standard Grant
Shape aware Computer Aided Tolerancing: A new methodical and computational framework for the assembly and mobility simulation based on Skin Model Shapes (ShapeCAN)
形状感知计算机辅助公差:基于蒙皮模型形状 (ShapeCAN) 的装配和移动模拟的新方法和计算框架
- 批准号:
278389853 - 财政年份:2015
- 资助金额:
$ 44.07万 - 项目类别:
Research Grants
Development of a new simulation method for wildfire based on fluid and combustion interaction analysis
基于流体和燃烧相互作用分析的野火模拟新方法的开发
- 批准号:
26420463 - 财政年份:2014
- 资助金额:
$ 44.07万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Development of a new human simulation model for preventing health problems due to severe thermal environments based on detailed measurements of cardiovascular system
基于心血管系统的详细测量,开发一种新的人体模拟模型,用于预防恶劣热环境引起的健康问题
- 批准号:
26820245 - 财政年份:2014
- 资助金额:
$ 44.07万 - 项目类别:
Grant-in-Aid for Young Scientists (B)
Collaborative Research: A New Paradigm for Simulation Optimization: Marriage between Expectation-Maximization and Model-Based Optimization
协作研究:仿真优化的新范式:期望最大化与基于模型的优化的结合
- 批准号:
1413790 - 财政年份:2013
- 资助金额:
$ 44.07万 - 项目类别:
Standard Grant
New Methods of Fault Simulation and Location for Smart Grids Based on Synchronized Measurements
基于同步测量的智能电网故障模拟与定位新方法
- 批准号:
1128383 - 财政年份:2012
- 资助金额:
$ 44.07万 - 项目类别:
Standard Grant














{{item.name}}会员




