CIF: AF: Small: A Perturbed Markov Chains Approach to Studying Centrality, Mixing and Reinforcement Learning
CIF:AF:小:研究中心性、混合和强化学习的扰动马尔可夫链方法
基本信息
- 批准号:2008130
- 负责人:
- 金额:$ 35.06万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2020
- 资助国家:美国
- 起止时间:2020-07-01 至 2024-06-30
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
By their key role in facilitating many modern innovations such as Internet search via the PageRank algorithm or enabling robot movement using reinforcement learning, Markov chains are an important and versatile modeling plus analysis tool. Further examples of applications of Markov chains include algorithms in recommendation engines, simulation of complex systems using Monte-Carlo methods, inference such as community detection in social networks using random walks, and in analyzing configurations for complex systems, such as extent of opinion spread in social networks. The goal of this project is to develop new foundational results on Markov chains using perturbations of them that are easier to analyze and to simulate, with the end result being both a better understanding of the original Markov chain and the development of novel and efficient algorithms for applications, such as in reinforcement learning and other artificial-intelligence paradigms. The project activities center around the development of mathematical tools to analyze key properties such as convergence to the stationary distribution and mixing of Markov chains using their perturbations, and the use these theoretical advances to develop novel estimation algorithms with provable performance guarantees for PageRank estimation and for reinforcement learning. The specific goals are divided into three thrusts. The first will study properties that are preserved in the perturbed chain from the original chain, and any accompanying implications on inference and optimization problems that Markov chains are used for. The second will study the implications of the general results from the first thrust on the PageRank Markov chain along with Personalized PageRank Markov chains, with the emphasis on accurate but low-complexity estimation. Drawing connections between PageRank estimation and reinforcement learning, the third thrust will develop efficient policy-evaluation and policy-iteration methods for general discounted-cost problems.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
通过它们在促进许多现代创新中的关键作用,例如通过PageRank算法进行互联网搜索或使用强化学习实现机器人运动,马尔可夫链是一种重要的通用建模和分析工具。马尔可夫链的应用的其他示例包括推荐引擎中的算法、使用蒙特-卡罗方法的复杂系统的模拟、诸如使用随机游走的社交网络中的社区检测的推理、以及在分析复杂系统的配置中的算法,诸如社交网络中的意见传播的程度。该项目的目标是使用更容易分析和模拟的扰动来开发马尔可夫链的新基础结果,最终结果是更好地理解原始马尔可夫链,并开发用于应用的新颖有效的算法,例如强化学习和其他人工智能范例。该项目的活动围绕数学工具的开发,以分析关键属性,如收敛到平稳分布和混合马尔可夫链使用其扰动,并使用这些理论的进步,开发新的估计算法与可证明的性能保证PageRank估计和强化学习。具体目标分为三个方面。第一个将研究从原始链中保留在扰动链中的属性,以及马尔可夫链用于推理和优化问题的任何附带影响。第二部分将研究PageRank马尔可夫链沿着个性化PageRank马尔可夫链的第一个推力的一般结果的影响,重点是准确但低复杂度的估计。第三个重点是将PageRank估计和强化学习联系起来,为一般的折扣成本问题开发有效的政策评估和政策迭代方法。该奖项反映了NSF的法定使命,并通过使用基金会的知识价值和更广泛的影响审查标准进行评估,被认为值得支持。
项目成果
期刊论文数量(18)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Decentralized Cooperative Reinforcement Learning with Hierarchical Information Structure
- DOI:
- 发表时间:2021-11
- 期刊:
- 影响因子:0
- 作者:Hsu Kao;Chen-Yu Wei;V. Subramanian
- 通讯作者:Hsu Kao;Chen-Yu Wei;V. Subramanian
Bayesian Learning of Optimal Policies in Markov Decision Processes with Countably Infinite State-Space
可数无限状态空间马尔可夫决策过程中最优策略的贝叶斯学习
- DOI:
- 发表时间:2023
- 期刊:
- 影响因子:0
- 作者:Saghar Adler;Vijay Subramanian
- 通讯作者:Vijay Subramanian
Private Information Compression in Dynamic Games among Teams
团队动态博弈中的私有信息压缩
- DOI:10.1109/cdc45484.2021.9683479
- 发表时间:2021
- 期刊:
- 影响因子:0
- 作者:Tang, Dengwang;Tavafoghi, Hamidreza;Subramanian, Vijay;Nayyar, Ashutosh;Teneketzis, Demosthenis
- 通讯作者:Teneketzis, Demosthenis
A Strong Duality Result for Cooperative Decentralized Constrained POMDPs
- DOI:10.1109/cdc49753.2023.10383989
- 发表时间:2023-12
- 期刊:
- 影响因子:0
- 作者:Nouman Khan;Vijay G. Subramanian
- 通讯作者:Nouman Khan;Vijay G. Subramanian
Empirical Policy Evaluation With Supergraphs
- DOI:10.1109/jsait.2021.3073257
- 发表时间:2020-02
- 期刊:
- 影响因子:0
- 作者:Daniel Vial;V. Subramanian
- 通讯作者:Daniel Vial;V. Subramanian
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Vijay Subramanian其他文献
Using Lactate Clearance at 6 hours and Glucose Metabolism as a Marker for Usability of Liver following Normothermic Machine Perfusion
以 6 小时时的乳酸清除率和葡萄糖代谢作为常温机械灌注后肝脏可用性的标志物
- DOI:
10.1016/j.ajt.2024.12.259 - 发表时间:
2025-01-01 - 期刊:
- 影响因子:8.200
- 作者:
Vijay Subramanian;Philopateer Messeha;Olivia Walter;Arni Kumar;Emma Kotelnicki;Milana Mudra;Kaidyn White;Ashish Singhal;Kiran Dhanireddy - 通讯作者:
Kiran Dhanireddy
578: INTEGRATED ALCOHOL USE DISORDER CLINIC AS A STRATEGY TO REDUCE ALCOHOL RELAPSE AFTER EARLY LIVER TRANSPLANTATION IN PATIENTS WITH ALCOHOL RELATED LIVER DISEASE
- DOI:
10.1016/s0016-5085(22)63401-2 - 发表时间:
2022-05-01 - 期刊:
- 影响因子:
- 作者:
Rashid Z. Syed;Saurabh Agrawal;Kawtar Al Khalloufi;Christopher Albers;Basem Alkurdi;Kristina Barber;Kiran Dhanireddy;Brenna J. Evans;Rachel Hogen;Nyingi Kemmer;Miguel Malespin;Marian Porubsky;Diego Reino;Vijay Subramanian;Christine Machado-Denis - 通讯作者:
Christine Machado-Denis
Factors Associated with Liver Cradle Compression Effect Following Normothermic Machine Perfusion
常温机械灌注后与肝脏摇篮压缩效应相关的因素
- DOI:
10.1016/j.ajt.2024.12.048 - 发表时间:
2025-01-01 - 期刊:
- 影响因子:8.200
- 作者:
Vijay Subramanian;Grant Weiderman;Venkata Yeddula;Emma Kotelnicki;Milana Mudra;Kaidyn White;Kiran Dhanireddy - 通讯作者:
Kiran Dhanireddy
Combined cardiac procedures and orthotopic liver transplant in the era of machine perfusion
机器灌注时代的心脏联合手术和原位肝移植
- DOI:
10.1016/j.ajt.2024.12.212 - 发表时间:
2025-01-01 - 期刊:
- 影响因子:8.200
- 作者:
Tara Barry;Vijay Subramanian;Rachel Hogen;Diego Reino;Lucian Lozonschi;Kiran Dhanireddy;Ashish Singhal - 通讯作者:
Ashish Singhal
Controlled Hypothermic Preservation of Donor Livers with Back- to-Base Normothermic Machine Perfusion Improves Clinical Outcomes and Facilitates Donor Pool Expansion
供肝的低温保存结合回基地常温机械灌注可改善临床结局并促进供肝库的扩展
- DOI:
10.1016/j.ajt.2024.12.258 - 发表时间:
2025-01-01 - 期刊:
- 影响因子:8.200
- 作者:
Vijay Subramanian;Rachel Hogen;Ashish Singhal;Diego Reino;Kiran Dhanireddy - 通讯作者:
Kiran Dhanireddy
Vijay Subramanian的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Vijay Subramanian', 18)}}的其他基金
CPS: Medium: Collaborative Research: Developing Data-driven Robustness and Safety from Single Agent Settings to Stochastic Dynamic Teams: Theory and Applications
CPS:中:协作研究:从单代理设置到随机动态团队开发数据驱动的鲁棒性和安全性:理论与应用
- 批准号:
2240981 - 财政年份:2023
- 资助金额:
$ 35.06万 - 项目类别:
Standard Grant
Collaborative Research: CPS: Medium: Empowering prosumers in electricity markets through market design and learning
合作研究:CPS:中:通过市场设计和学习为电力市场中的产消者赋权
- 批准号:
2038416 - 财政年份:2020
- 资助金额:
$ 35.06万 - 项目类别:
Standard Grant
Collaborative Research: CNS Core: Medium: Learning to Cache and Caching to Learn in High Performance Caching Systems
合作研究:CNS 核心:中:学习缓存以及在高性能缓存系统中学习缓存
- 批准号:
1955777 - 财政年份:2020
- 资助金额:
$ 35.06万 - 项目类别:
Standard Grant
The 6th Midwest Workshop on Control and Game Theory; Ann Arbor, Michigan
第六届中西部控制与博弈论研讨会;
- 批准号:
1738207 - 财政年份:2017
- 资助金额:
$ 35.06万 - 项目类别:
Standard Grant
Collaborative Research: EARS: Creating an Ecosystem for Enhanced Spectrum Utilization Through Dynamic Market Mechanisms
合作研究:EARS:通过动态市场机制创建增强频谱利用率的生态系统
- 批准号:
1516075 - 财政年份:2014
- 资助金额:
$ 35.06万 - 项目类别:
Standard Grant
III: Small: Inferring first movers in large-scale socio-technical networks
III:小型:推断大规模社会技术网络中的先行者
- 批准号:
1538827 - 财政年份:2014
- 资助金额:
$ 35.06万 - 项目类别:
Standard Grant
Collaborative Research: EARS: Creating an Ecosystem for Enhanced Spectrum Utilization Through Dynamic Market Mechanisms
合作研究:EARS:通过动态市场机制创建增强频谱利用率的生态系统
- 批准号:
1443972 - 财政年份:2014
- 资助金额:
$ 35.06万 - 项目类别:
Standard Grant
III: Small: Inferring first movers in large-scale socio-technical networks
III:小型:推断大规模社会技术网络中的先行者
- 批准号:
1219071 - 财政年份:2012
- 资助金额:
$ 35.06万 - 项目类别:
Standard Grant
相似国自然基金
基于前瞻性队列的双酚AF联合果糖加重代谢损伤的靶向代谢组学研究
- 批准号:2025JJ30049
- 批准年份:2025
- 资助金额:0.0 万元
- 项目类别:省市级项目
U2AF2-circMMP1信号轴促进结直肠癌进展的分子机制研究
- 批准号:2025JJ80723
- 批准年份:2025
- 资助金额:0.0 万元
- 项目类别:省市级项目
U2AF2精氯酸甲基化调控RNA转录合成在MTAP缺失骨肉瘤T细胞耗竭中的机制研究
- 批准号:
- 批准年份:2024
- 资助金额:0 万元
- 项目类别:青年科学基金项目
BDA-366通过MYD88/NF-κB/PGC1β通路杀伤 KMT2A/AF9 AML细胞的机制研究
- 批准号:
- 批准年份:2024
- 资助金额:15.0 万元
- 项目类别:省市级项目
Lu AF21934减少缺血性脑卒中导致的神经损伤的机制研究
- 批准号:
- 批准年份:2024
- 资助金额:0.0 万元
- 项目类别:省市级项目
H2S介导剪接因子BraU2AF65a的S-巯基化修饰促进大白菜开花的分子机制
- 批准号:32372727
- 批准年份:2023
- 资助金额:50 万元
- 项目类别:面上项目
AF9通过ARRB2-MRGPRB2介导肠固有肥大细胞活化促进重症急性胰腺炎发生MOF的研究
- 批准号:82300739
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
剪接因子U2AF1突变在急性髓系白血病原发耐药中的机制研究
- 批准号:82370157
- 批准年份:2023
- 资助金额:49 万元
- 项目类别:面上项目
线粒体活性氧介导的胎盘早衰在孕期双酚AF暴露致婴幼儿神经发育迟缓中的作用
- 批准号:82304160
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
U2AF2-circMMP1调控能量代谢促进结直肠癌肝转移的分子机制
- 批准号:82303789
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
相似海外基金
Collaborative Research: U.S.-Ireland R&D Partnership: CIF: AF: Small: Enabling Beyond-5G Wireless Access Networks with Robust and Scalable Cell-Free Massive MIMO
合作研究:美国-爱尔兰 R
- 批准号:
2322191 - 财政年份:2023
- 资助金额:
$ 35.06万 - 项目类别:
Standard Grant
Collaborative Research: U.S.-Ireland R&D Partnership: CIF: AF: Small: Enabling Beyond-5G Wireless Access Networks with Robust and Scalable Cell-Free Massive MIMO
合作研究:美国-爱尔兰 R
- 批准号:
2322190 - 财政年份:2023
- 资助金额:
$ 35.06万 - 项目类别:
Standard Grant
Collaborative Research: NSF-AoF: CIF: AF: Small: Energy-Efficient THz Communications Across Massive Dimensions
合作研究:NSF-AoF:CIF:AF:小型:大尺寸的节能太赫兹通信
- 批准号:
2225576 - 财政年份:2022
- 资助金额:
$ 35.06万 - 项目类别:
Standard Grant
Collaborative Research: NSF-AoF: CIF: AF: Small: Energy-Efficient THz Communications Across Massive Dimensions
合作研究:NSF-AoF:CIF:AF:小型:大尺寸的节能太赫兹通信
- 批准号:
2225575 - 财政年份:2022
- 资助金额:
$ 35.06万 - 项目类别:
Standard Grant
CIF: AF: Small: Data Processing Against Synchronization Errors
CIF:AF:小:针对同步错误的数据处理
- 批准号:
2006455 - 财政年份:2020
- 资助金额:
$ 35.06万 - 项目类别:
Standard Grant
AF: CIF: Small: Communication complexity techniques beyond classical information theory
AF:CIF:小:超越经典信息论的通信复杂性技术
- 批准号:
2006589 - 财政年份:2020
- 资助金额:
$ 35.06万 - 项目类别:
Standard Grant
CCF-BSF: AF: CIF: Small: Low Complexity Error Correction
CCF-BSF:AF:CIF:小:低复杂性纠错
- 批准号:
1814629 - 财政年份:2018
- 资助金额:
$ 35.06万 - 项目类别:
Standard Grant
CIF: AF: Small: Foundations of Multimodal Information Integration
CIF:AF:小型:多模式信息集成的基础
- 批准号:
1712867 - 财政年份:2017
- 资助金额:
$ 35.06万 - 项目类别:
Standard Grant
CIF/AF: Small: Some fundamental complexity-inspired coding theory challenges
CIF/AF:小:一些由复杂性引发的基本编码理论挑战
- 批准号:
1422045 - 财政年份:2014
- 资助金额:
$ 35.06万 - 项目类别:
Standard Grant
AF: CIF: Small: Theoretical Problems in Quantum Cmputation and Cmmunication
AF:CIF:小:量子计算和通信中的理论问题
- 批准号:
1216729 - 财政年份:2012
- 资助金额:
$ 35.06万 - 项目类别:
Standard Grant