CPS: Medium: Collaborative Research: Developing Data-driven Robustness and Safety from Single Agent Settings to Stochastic Dynamic Teams: Theory and Applications

CPS:中:协作研究:从单代理设置到随机动态团队开发数据驱动的鲁棒性和安全性:理论与应用

基本信息

项目摘要

This Cyber-Physical Systems (CPS) project will make foundational methodological advances that enable safe and robust reinforcement learning (RL)-based control algorithmic solutions that are driven by problems in smart traffic signal control systems. Recent advances in computation, communication, storage, and sensing have led to a demand for data-driven learning-based decision-making and control in modern cyber-physical systems (CPSs), such as smart transportation systems. In such systems, decision-making agents need to operate safely and in a robust manner while working in complex environments with constraints that need to be respected. This project will develop foundational advances in robust RL solutions, and safe and constrained RL with provable guarantees by taking traffic signal control systems within smart transportation systems as our motivating CPS application and evaluation platform. This work will additionally focus on advancing curriculum development, recruitment of students from under-represented groups, involvement of undergraduate students in research, K-12 outreach, and also research community outreach via workshops, conference sessions, and seminars. The researchers will interface with companies and other stakeholders to communicate the results of the research as well as provide them with educational material on methodology. The technical approaches include: 1. Robust RL solutions incorporating model class knowledge, use of future predictions and robustness characterizations, and off-policy methods to address distributional shifts and data paucity arising from the use of a simulator/emulator or offline data; and 2. Efficient, safe, and constrained RL algorithms using model-free approaches and function-approximated methods, and also methods for partially-observed systems. To close the loop with the motivating CPS application, the RL algorithms will be evaluated in the context of traffic signal control via a comprehensive simulation-based evaluation using models of two instrumented sites.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
这个网络物理系统(CPS)项目将在方法上取得基础性进展,从而实现安全和强大的基于强化学习(RL)的控制算法解决方案,这些解决方案由智能交通信号控制系统中的问题驱动。计算、通信、存储和传感的最新进展导致了对现代网络物理系统(CPS)(例如智能交通系统)中基于数据驱动的学习的决策和控制的需求。在这样的系统中,决策代理需要在复杂的环境中工作时,以安全和稳健的方式运行,并需要遵守约束。该项目将通过将智能交通系统中的交通信号控制系统作为我们激励CPS应用和评估平台,在强大的RL解决方案以及安全和受约束的RL方面取得基础性进展。 这项工作还将侧重于推进课程开发,从代表性不足的群体中招募学生,本科生参与研究,K-12外展,以及通过研讨会,会议和研讨会进行的研究社区外展。研究人员将与公司和其他利益相关者交流研究结果,并为他们提供有关方法的教育材料。技术途径包括:1.鲁棒的RL解决方案,结合模型类知识,使用未来预测和鲁棒性特征,以及非策略方法来解决由于使用模拟器/仿真器或离线数据而产生的分布变化和数据缺乏问题;以及2.使用无模型方法和函数近似方法的高效,安全和约束RL算法,以及部分观测系统的方法。为了完成激励CPS应用的循环,RL算法将在交通信号控制的背景下进行评估,通过使用两个仪表站点的模型进行全面的基于仿真的评估。该奖项反映了NSF的法定使命,并通过使用基金会的知识价值和更广泛的影响审查标准进行评估,被认为值得支持。

项目成果

期刊论文数量(6)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Learning-based Optimal Admission Control in a Single Server Queuing System
单服务器排队系统中基于学习的最优准入控制
  • DOI:
    10.1109/allerton49937.2022.9929406
  • 发表时间:
    2022
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Zhang, Yili;Cohen, Asaf;Subramanian, Vijay G.
  • 通讯作者:
    Subramanian, Vijay G.
A Strong Duality Result for Cooperative Decentralized Constrained POMDPs
Bayesian Learning of Optimal Policies in Markov Decision Processes with Countably Infinite State-Space
可数无限状态空间马尔可夫决策过程中最优策略的贝叶斯学习
Rarest-First with Probabilistic-Mode-Suppression (RFwPMS)
具有概率模式抑制的稀有优先 (RFwPMS)
A Multi-Agent View of Wireless Video Streaming with Delayed Client-Feedback
具有延迟客户端反馈的无线视频流的多代理视图
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Vijay Subramanian其他文献

Using Lactate Clearance at 6 hours and Glucose Metabolism as a Marker for Usability of Liver following Normothermic Machine Perfusion
以 6 小时时的乳酸清除率和葡萄糖代谢作为常温机械灌注后肝脏可用性的标志物
  • DOI:
    10.1016/j.ajt.2024.12.259
  • 发表时间:
    2025-01-01
  • 期刊:
  • 影响因子:
    8.200
  • 作者:
    Vijay Subramanian;Philopateer Messeha;Olivia Walter;Arni Kumar;Emma Kotelnicki;Milana Mudra;Kaidyn White;Ashish Singhal;Kiran Dhanireddy
  • 通讯作者:
    Kiran Dhanireddy
578: INTEGRATED ALCOHOL USE DISORDER CLINIC AS A STRATEGY TO REDUCE ALCOHOL RELAPSE AFTER EARLY LIVER TRANSPLANTATION IN PATIENTS WITH ALCOHOL RELATED LIVER DISEASE
  • DOI:
    10.1016/s0016-5085(22)63401-2
  • 发表时间:
    2022-05-01
  • 期刊:
  • 影响因子:
  • 作者:
    Rashid Z. Syed;Saurabh Agrawal;Kawtar Al Khalloufi;Christopher Albers;Basem Alkurdi;Kristina Barber;Kiran Dhanireddy;Brenna J. Evans;Rachel Hogen;Nyingi Kemmer;Miguel Malespin;Marian Porubsky;Diego Reino;Vijay Subramanian;Christine Machado-Denis
  • 通讯作者:
    Christine Machado-Denis
Factors Associated with Liver Cradle Compression Effect Following Normothermic Machine Perfusion
常温机械灌注后与肝脏摇篮压缩效应相关的因素
  • DOI:
    10.1016/j.ajt.2024.12.048
  • 发表时间:
    2025-01-01
  • 期刊:
  • 影响因子:
    8.200
  • 作者:
    Vijay Subramanian;Grant Weiderman;Venkata Yeddula;Emma Kotelnicki;Milana Mudra;Kaidyn White;Kiran Dhanireddy
  • 通讯作者:
    Kiran Dhanireddy
Combined cardiac procedures and orthotopic liver transplant in the era of machine perfusion
机器灌注时代的心脏联合手术和原位肝移植
  • DOI:
    10.1016/j.ajt.2024.12.212
  • 发表时间:
    2025-01-01
  • 期刊:
  • 影响因子:
    8.200
  • 作者:
    Tara Barry;Vijay Subramanian;Rachel Hogen;Diego Reino;Lucian Lozonschi;Kiran Dhanireddy;Ashish Singhal
  • 通讯作者:
    Ashish Singhal
Controlled Hypothermic Preservation of Donor Livers with Back- to-Base Normothermic Machine Perfusion Improves Clinical Outcomes and Facilitates Donor Pool Expansion
供肝的低温保存结合回基地常温机械灌注可改善临床结局并促进供肝库的扩展
  • DOI:
    10.1016/j.ajt.2024.12.258
  • 发表时间:
    2025-01-01
  • 期刊:
  • 影响因子:
    8.200
  • 作者:
    Vijay Subramanian;Rachel Hogen;Ashish Singhal;Diego Reino;Kiran Dhanireddy
  • 通讯作者:
    Kiran Dhanireddy

Vijay Subramanian的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Vijay Subramanian', 18)}}的其他基金

CIF: AF: Small: A Perturbed Markov Chains Approach to Studying Centrality, Mixing and Reinforcement Learning
CIF:AF:小:研究中心性、混合和强化学习的扰动马尔可夫链方法
  • 批准号:
    2008130
  • 财政年份:
    2020
  • 资助金额:
    $ 48万
  • 项目类别:
    Standard Grant
Collaborative Research: CPS: Medium: Empowering prosumers in electricity markets through market design and learning
合作研究:CPS:中:通过市场设计和学习为电力市场中的产消者赋权
  • 批准号:
    2038416
  • 财政年份:
    2020
  • 资助金额:
    $ 48万
  • 项目类别:
    Standard Grant
Collaborative Research: CNS Core: Medium: Learning to Cache and Caching to Learn in High Performance Caching Systems
合作研究:CNS 核心:中:学习缓存以及在高性能缓存系统中学习缓存
  • 批准号:
    1955777
  • 财政年份:
    2020
  • 资助金额:
    $ 48万
  • 项目类别:
    Standard Grant
The 6th Midwest Workshop on Control and Game Theory; Ann Arbor, Michigan
第六届中西部控制与博弈论研讨会;
  • 批准号:
    1738207
  • 财政年份:
    2017
  • 资助金额:
    $ 48万
  • 项目类别:
    Standard Grant
Collaborative Research: EARS: Creating an Ecosystem for Enhanced Spectrum Utilization Through Dynamic Market Mechanisms
合作研究:EARS:通过动态市场机制创建增强频谱利用率的生态系统
  • 批准号:
    1516075
  • 财政年份:
    2014
  • 资助金额:
    $ 48万
  • 项目类别:
    Standard Grant
III: Small: Inferring first movers in large-scale socio-technical networks
III:小型:推断大规模社会技术网络中的先行者
  • 批准号:
    1538827
  • 财政年份:
    2014
  • 资助金额:
    $ 48万
  • 项目类别:
    Standard Grant
Collaborative Research: EARS: Creating an Ecosystem for Enhanced Spectrum Utilization Through Dynamic Market Mechanisms
合作研究:EARS:通过动态市场机制创建增强频谱利用率的生态系统
  • 批准号:
    1443972
  • 财政年份:
    2014
  • 资助金额:
    $ 48万
  • 项目类别:
    Standard Grant
III: Small: Inferring first movers in large-scale socio-technical networks
III:小型:推断大规模社会技术网络中的先行者
  • 批准号:
    1219071
  • 财政年份:
    2012
  • 资助金额:
    $ 48万
  • 项目类别:
    Standard Grant

相似海外基金

Collaborative Research: CPS: Medium: Automating Complex Therapeutic Loops with Conflicts in Medical Cyber-Physical Systems
合作研究:CPS:中:自动化医疗网络物理系统中存在冲突的复杂治疗循环
  • 批准号:
    2322534
  • 财政年份:
    2024
  • 资助金额:
    $ 48万
  • 项目类别:
    Standard Grant
Collaborative Research: CPS: Medium: Automating Complex Therapeutic Loops with Conflicts in Medical Cyber-Physical Systems
合作研究:CPS:中:自动化医疗网络物理系统中存在冲突的复杂治疗循环
  • 批准号:
    2322533
  • 财政年份:
    2024
  • 资助金额:
    $ 48万
  • 项目类别:
    Standard Grant
Collaborative Research: CPS: Medium: Physics-Model-Based Neural Networks Redesign for CPS Learning and Control
合作研究:CPS:中:基于物理模型的神经网络重新设计用于 CPS 学习和控制
  • 批准号:
    2311084
  • 财政年份:
    2023
  • 资助金额:
    $ 48万
  • 项目类别:
    Standard Grant
CPS: Medium: Collaborative Research: Provably Safe and Robust Multi-Agent Reinforcement Learning with Applications in Urban Air Mobility
CPS:中:协作研究:可证明安全且鲁棒的多智能体强化学习及其在城市空中交通中的应用
  • 批准号:
    2312092
  • 财政年份:
    2023
  • 资助金额:
    $ 48万
  • 项目类别:
    Standard Grant
Collaborative Research: CPS: Medium: Enabling Data-Driven Security and Safety Analyses for Cyber-Physical Systems
协作研究:CPS:中:为网络物理系统实现数据驱动的安全和安全分析
  • 批准号:
    2414176
  • 财政年份:
    2023
  • 资助金额:
    $ 48万
  • 项目类别:
    Standard Grant
Collaborative Research: CPS: Medium: An Online Learning Framework for Socially Emerging Mixed Mobility
协作研究:CPS:媒介:社会新兴混合出行的在线学习框架
  • 批准号:
    2401007
  • 财政年份:
    2023
  • 资助金额:
    $ 48万
  • 项目类别:
    Standard Grant
Collaborative Research: CPS: Medium: Mutualistic Cyber-Physical Interaction for Self-Adaptive Multi-Damage Monitoring of Civil Infrastructure
合作研究:CPS:中:土木基础设施自适应多损伤监测的互信息物理交互
  • 批准号:
    2305882
  • 财政年份:
    2023
  • 资助金额:
    $ 48万
  • 项目类别:
    Standard Grant
CPS: Medium: Collaborative Research: Robust Sensing and Learning for Autonomous Driving Against Perceptual Illusion
CPS:中:协作研究:针对自动驾驶对抗知觉错觉的鲁棒感知和学习
  • 批准号:
    2235231
  • 财政年份:
    2023
  • 资助金额:
    $ 48万
  • 项目类别:
    Standard Grant
Collaborative Research: CPS: Medium: Sensor Attack Detection and Recovery in Cyber-Physical Systems
合作研究:CPS:中:网络物理系统中的传感器攻击检测和恢复
  • 批准号:
    2333980
  • 财政年份:
    2023
  • 资助金额:
    $ 48万
  • 项目类别:
    Standard Grant
CPS Medium: Collaborative Research: Physics-Informed Learning and Control of Passive and Hybrid Conditioning Systems in Buildings
CPS 媒介:协作研究:建筑物中被动和混合空调系统的物理信息学习和控制
  • 批准号:
    2241796
  • 财政年份:
    2023
  • 资助金额:
    $ 48万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了