CAREER: Structural Estimation and Optimization for Partially Observable Markov Decision Processes and Markov Games
职业:部分可观察马尔可夫决策过程和马尔可夫博弈的结构估计和优化
基本信息
- 批准号:2236477
- 负责人:
- 金额:$ 52.5万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2023
- 资助国家:美国
- 起止时间:2023-02-01 至 2028-01-31
- 项目状态:未结题
- 来源:
- 关键词:
项目摘要
This Faculty Early Career Development Program (CAREER) grant will contribute to the Nation's economic prosperity by developing analytical methods to enhance security and risk management of supply chains. As a particular use case, the project will focus on sustainable seafood supply chain operations. The US is the second largest consumer of seafood in the world, and fisheries agencies are seeking substantial reforms in management practices to better manage fishery population dynamics. The effectiveness of management strategies hinges upon fish stock assessment which is subject to many sources of uncertainties and noisy information, and is further compounded by illegal, unreported and unregulated (IUU) fishing, which steals natural resources, threatens ocean ecosystems and seafood supply, and undermines port and maritime security. This award supports development of a new framework, analytics, and algorithms that can learn preferences and behavior of fishermen and fishing adversaries from imperfect data, identify ways to modulate their behavior, and search for effective strategies to promote sustainable operations and to combat IUU fishing. The educational plan will utilize similar methods to elicit students' needs and preferences from observable data and to design effective education strategies that promote inclusion, equity, diversity, and accessibility. The research will be informed through collaboration with the US Coast Guard Academy and the North Carolina Division of Marine Fisheries. In addition, tutorials and workshops on data analytics involving both students and supply chain practitioners will provide important recruitment and outreach opportunities.This research will investigate a methodological framework comprising structural estimation, optimization, and integrated analysis for dynamic decision making under imperfect information. The current literature on learning and optimizing dynamic decisions mainly assumes that the system is perfectly observable. While there is an extensive literature on the analysis and optimization for partially observable Markov decision processes (POMDPs), this project will focus on the inverse estimation of the primitives of a POMDP model based upon observable histories, an understudied area. Both optimization and inverse estimation for multi-party decision processes are also considered through partially observable Markov games (POMGs). This research address a knowledge gap by developing (i) new estimation methods to learn model parameters of POMDPs and POMGs from their corresponding data trajectories; (ii) efficient solution procedures for leader-follower POMGs with imprecise reward; and (iii) an integrated methodology of Estimation-Optimization-Analysis based on POMDPs and POMGs to improve an agent's performance by learning, targeted modulating, and adapting to other agents' decision behaviors. The methodologies will be applied to improve fish stock rebuilding efforts, support defense agencies in combating IUU fishing, and identify best practices of course delivery strategies in education.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
该学院早期职业发展计划(CAREER)拨款将通过开发分析方法来加强供应链的安全和风险管理,从而为国家的经济繁荣做出贡献。 作为一个特定的用例,该项目将重点关注可持续的海鲜供应链运营。美国是世界上第二大海产品消费国,渔业机构正在寻求管理实践的重大改革,以更好地管理渔业种群动态。 管理战略的有效性取决于鱼类资源评估,而鱼类资源评估受到许多不确定因素和干扰信息的影响,非法、未报告和无管制的捕捞活动使情况更加复杂,这种捕捞活动窃取自然资源,威胁海洋生态系统和海产品供应,并破坏港口和海上安全。该奖项支持开发一个新的框架,分析和算法,可以从不完善的数据中学习渔民和捕鱼对手的偏好和行为,确定调整其行为的方法,并寻找有效的策略来促进可持续经营和打击IUU捕鱼。教育计划将利用类似的方法,从可观察到的数据中了解学生的需求和偏好,并设计有效的教育战略,促进包容、公平、多样性和无障碍。这项研究将通过与美国海岸警卫队学院和北卡罗来纳州海洋渔业司的合作进行。此外,涉及学生和供应链从业者的数据分析教程和研讨会将提供重要的招聘和推广机会。本研究将探讨一个方法框架,包括结构估计,优化和集成分析,用于不完美信息下的动态决策。目前关于学习和优化动态决策的文献主要假设系统是完全可观测的。虽然有大量的文献分析和优化部分可观察马尔可夫决策过程(POMDP),本项目将集中在POMDP模型的基元的逆估计可观察的历史,一个欠研究的领域。 多方决策过程的优化和逆估计也被认为是通过部分可观察马尔可夫博弈(POMG)。本研究通过开发(i)新的估计方法来从相应的数据轨迹中学习POMDP和POMG的模型参数;(ii)具有不精确奖励的领导者-跟随者POMG的有效求解程序;以及(iii)基于POMDP和POMG的估计-优化-分析的集成方法,通过学习、有针对性的调节、并适应其他智能体的决策行为。这些方法将被应用于改善鱼类种群重建工作,支持国防机构打击IUU捕捞,并确定最佳实践的课程教学策略在education.This奖项反映了NSF的法定使命,并已被认为是值得通过评估使用基金会的智力价值和更广泛的影响审查标准的支持。
项目成果
期刊论文数量(1)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Structural Estimation of Partially Observable Markov Decision Processes
- DOI:10.1109/tac.2022.3217908
- 发表时间:2020-08
- 期刊:
- 影响因子:6.8
- 作者:Yanling Chang;Alfredo Garcia;Zhide Wang;Lu Sun
- 通讯作者:Yanling Chang;Alfredo Garcia;Zhide Wang;Lu Sun
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Yanling Chang其他文献
Temporal-spectral imaging of optical pulses using time lens
使用时间透镜的光脉冲时域光谱成像
- DOI:
10.1117/12.802749 - 发表时间:
2008 - 期刊:
- 影响因子:0
- 作者:
Zhongwei Tan;Yanling Chang;Wenhua Ren;Jihong Cao;S. Jian - 通讯作者:
S. Jian
The Value of Misinformation and Disinformation.
错误信息和虚假信息的价值。
- DOI:
- 发表时间:
2019 - 期刊:
- 影响因子:0
- 作者:
Yanling Chang;Matthew F. Keblis;Ran Li;E. Iakovou;C. White - 通讯作者:
C. White
ZnO nanocones and nanoplatelets: synthesis and characterization
- DOI:
- 发表时间:
2010-08 - 期刊:
- 影响因子:0
- 作者:
Yanling Chang - 通讯作者:
Yanling Chang
Laboratory study on the evolution of waves parameters due to wave breaking in deep water
深水波浪破碎引起的波浪参数演化的室内研究
- DOI:
10.1016/j.wavemoti.2016.08.010 - 发表时间:
2017 - 期刊:
- 影响因子:2.4
- 作者:
Shuxiu Liang;Yihui Zhang;Zhaochen Sun;Yanling Chang - 通讯作者:
Yanling Chang
Dysregulation of β-catenin by hepatitis B virus X protein in HBV-infected human hepatocellular carcinomas
乙型肝炎病毒 X 蛋白对 HBV 感染的人肝细胞癌中 β-catenin 的失调
- DOI:
- 发表时间:
2010 - 期刊:
- 影响因子:0
- 作者:
Lei Chen;Liang Hu;Liang Li;Yuan Liu;Qian;Yanling Chang;Hexin Yan;Meng;Hong - 通讯作者:
Hong
Yanling Chang的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Yanling Chang', 18)}}的其他基金
Dynamic Discrete Choice Estimation with Partially Observable States and Hidden Dynamics
具有部分可观察状态和隐藏动态的动态离散选择估计
- 批准号:
2048395 - 财政年份:2021
- 资助金额:
$ 52.5万 - 项目类别:
Standard Grant
相似国自然基金
Understanding structural evolution of galaxies with machine learning
- 批准号:
- 批准年份:2022
- 资助金额:10.0 万元
- 项目类别:省市级项目
相似海外基金
Structural Estimation and Impact Decomposition in Centralized School Choice
集中择校的结构估计与影响分解
- 批准号:
22K13412 - 财政年份:2022
- 资助金额:
$ 52.5万 - 项目类别:
Grant-in-Aid for Early-Career Scientists
Machine learning approach to the estimation of spillover effects and their structural changes
估计溢出效应及其结构变化的机器学习方法
- 批准号:
22K20154 - 财政年份:2022
- 资助金额:
$ 52.5万 - 项目类别:
Grant-in-Aid for Research Activity Start-up
Structural estimation on the statistical learning model : Application on the competition and social policy for the digital economy
统计学习模型的结构估计:在数字经济竞争和社会政策中的应用
- 批准号:
22K01485 - 财政年份:2022
- 资助金额:
$ 52.5万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Estimation of amount of air-borne salt adhered to structural surface by multiscale airflow analysis
通过多尺度气流分析估算附着在结构表面的空气中盐的量
- 批准号:
22K20453 - 财政年份:2022
- 资助金额:
$ 52.5万 - 项目类别:
Grant-in-Aid for Research Activity Start-up
Structural Dynamics Identification Using Motion Estimation and Video Magnification
使用运动估计和视频放大进行结构动力学识别
- 批准号:
2230218 - 财政年份:2022
- 资助金额:
$ 52.5万 - 项目类别:
Standard Grant
A preview-type structural control based on the earthquake waveform estimation with a seismic observation network: Performance enhancement of the waveform estimation system
基于地震观测网地震波形估计的预览型结构控制:波形估计系统的性能增强
- 批准号:
22K03995 - 财政年份:2022
- 资助金额:
$ 52.5万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Realization of highly accurate estimation of structural vibration characteristics by blind deconvolution and Bayesian inference
通过盲反卷积和贝叶斯推理实现结构振动特性的高精度估计
- 批准号:
22K18830 - 财政年份:2022
- 资助金额:
$ 52.5万 - 项目类别:
Grant-in-Aid for Challenging Research (Exploratory)
Lighting Environment Estimation based on Structural Coloration
基于结构着色的光照环境估计
- 批准号:
21K11962 - 财政年份:2021
- 资助金额:
$ 52.5万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Estimation of post-harvest vegetation recovery based on forest structural attributes using time series of satellite data
利用卫星数据时间序列根据森林结构属性估算采后植被恢复情况
- 批准号:
21K14883 - 财政年份:2021
- 资助金额:
$ 52.5万 - 项目类别:
Grant-in-Aid for Early-Career Scientists
Estimation and inferences of structural changes in panel data models
面板数据模型结构变化的估计和推断
- 批准号:
21K20121 - 财政年份:2021
- 资助金额:
$ 52.5万 - 项目类别:
Grant-in-Aid for Research Activity Start-up














{{item.name}}会员




