STCI: Middleware for Monitoring and Troubleshooting of Large-Scale Applications on National Cyberinfrastructure

STCI:用于国家网络基础设施大规模应用监控和故障排除的中间件

基本信息

  • 批准号:
    0943705
  • 负责人:
  • 金额:
    $ 187.58万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2009
  • 资助国家:
    美国
  • 起止时间:
    2009-09-01 至 2013-08-31
  • 项目状态:
    已结题

项目摘要

This proposal will be awarded using funds made available by the American Recovery and Reinvestment Act of 2009 (Public Law 111-5), and meets the requirements established in Section 2 of the White House Memorandum entitled, Ensuring Responsible Spending of Recovery Act Funds, dated March 20, 2009. The STCI: Middleware for Monitoring and Troubleshooting of Large-Scale Applications on National Cyberinfrastructure project aims to provide robust and scalable workflow monitoring services that can be used to track the progress of workflow-based applications as they are executing on the distributed cyberinfrastructure. New anomaly detection and troubleshooting services will also be developed to alert users to problems with the application and cyberinfrastructure services and allow them to quickly navigate and mine the application's execution records. The foundation of this work is the development of a robust and scalable infrastructure for performance information gathering and distribution. Information flowing through this infrastructure will be stored in high-performance archives and distributed to interested entities through subscription interfaces. Three main services will be developed: 1) an online monitoring service, 2) an anomaly detection service based on dynamic mining of application and cyberinfrastructure logs and 3) a troubleshooting service that will help trace the source of a failure.Intellectual Merit This work will potentially increase scientists' productivity by allowing them to quickly identify problems in an application, thus reducing the time it takes to generate scientifically meaningful results. This work will also make the performance of complex scientific workflows more transparent, which will enable the generation of accurate estimates of overall time to completion, more efficient use of resources, and easier resolution of end-to-end performance problems in collaboration with network and resource providers. Broader Impact Scientific communities in astronomy, biology, earthquake science, physics, and others will immediately benefit from the proposed system. Because the approach relies on simple, well-defined logging formats, this work is applicable to a range of workflow management systems as well as sub-components of those systems such as job managers and data transfer tools.
该提案将使用2009年《美国复苏和再投资法案》(公共法111-5)提供的资金进行授予,并满足2009年3月20日发布的题为“确保负责任地使用复苏法案资金”的白宫备忘录第2节中规定的要求。STCI:国家网络基础设施大规模应用程序监控和故障排除中间件项目旨在提供强大且可扩展的工作流监控服务,可用于跟踪基于工作流的应用程序在分布式网络基础设施上执行的进度。还将开发新的异常检测和故障排除服务,以提醒用户注意应用程序和网络基础设施服务的问题,并使他们能够快速导航和挖掘应用程序的执行记录。这项工作的基础是为性能信息收集和分发开发一个健壮的、可扩展的基础设施。流经这一基础设施的信息将存储在高性能档案中,并通过订阅接口分发给感兴趣的实体。将开发三项主要服务:1)在线监控服务,2)基于应用和网络基础设施日志动态挖掘的异常检测服务,3)帮助追踪故障来源的故障排除服务。智力优势这项工作将通过允许科学家快速识别应用中的问题,从而潜在地提高科学家的生产力,从而减少产生有科学意义的结果所需的时间。 这项工作还将使复杂的科学工作流程的性能更加透明,这将有助于生成对完成总时间的准确估计,更有效地利用资源,并与网络和资源提供商合作更容易解决端到端性能问题。天文学、生物学、地震科学、物理学等领域的科学界将立即从拟议的系统中受益。由于该方法依赖于简单,定义良好的日志格式,这项工作适用于一系列的工作流管理系统以及这些系统的子组件,如作业管理器和数据传输工具。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Ewa Deelman其他文献

Mapping Abstract Complex Workflows onto Grid Environments
  • DOI:
    10.1023/a:1024000426962
  • 发表时间:
    2003-01-01
  • 期刊:
  • 影响因子:
    2.900
  • 作者:
    Ewa Deelman;James Blythe;Yolanda Gil;Carl Kesselman;Gaurang Mehta;Karan Vahi;Kent Blackburn;Albert Lazzarini;Adam Arbree;Richard Cavanaugh;Scott Koranda
  • 通讯作者:
    Scott Koranda
Advancing Anomaly Detection in Computational Workflows with Active Learning
通过主动学习推进计算工作流程中的异常检测
  • DOI:
    10.48550/arxiv.2405.06133
  • 发表时间:
    2024
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Krishnan Raghavan;George Papadimitriou;Hongwei Jin;A. Mandal;Mariam Kiran;Prasanna Balaprakash;Ewa Deelman
  • 通讯作者:
    Ewa Deelman
A terminology for scientific workflow systems
科学工作流系统的术语
  • DOI:
    10.1016/j.future.2025.107974
  • 发表时间:
    2026-01-01
  • 期刊:
  • 影响因子:
    6.100
  • 作者:
    Frédéric Suter;Tainã Coleman;İlkay Altintaş;Rosa M. Badia;Bartosz Balis;Kyle Chard;Iacopo Colonnelli;Ewa Deelman;Paolo Di Tommaso;Thomas Fahringer;Carole Goble;Shantenu Jha;Daniel S. Katz;Johannes Köster;Ulf Leser;Kshitij Mehta;Hilary Oliver;J.-Luc Peterson;Giovanni Pizzi;Loïc Pottier;Rafael Ferreira da Silva
  • 通讯作者:
    Rafael Ferreira da Silva
Broadening Student Engagement To Build the Next Generation of Cyberinfrastructure Professionals
扩大学生参与度,培养下一代网络基础设施专业人员
  • DOI:
    10.1145/3569951.3597567
  • 发表时间:
    2023
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Angela Murillo;Don Brower;Sarowar Hossain;K. Kee;A. Mandal;J. Nabrzyski;Erik Scott;Nicole K. Virdone;Rodney Ewing;Ewa Deelman
  • 通讯作者:
    Ewa Deelman
How is Artificial Intelligence Changing Science?
人工智能如何改变科学?

Ewa Deelman的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Ewa Deelman', 18)}}的其他基金

Collaborative Research: CyberTraining: Implementation: Medium: CyberInfrastructure Training and Education for Synchrotron X-Ray Science (X-CITE)
合作研究:网络培训:实施:媒介:同步加速器 X 射线科学网络基础设施培训和教育 (X-CITE)
  • 批准号:
    2320375
  • 财政年份:
    2023
  • 资助金额:
    $ 187.58万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Small: Model-driven Design and Optimization of Dataflows for Scientific Applications
协作研究:SHF:小型:科学应用数据流的模型驱动设计和优化
  • 批准号:
    2331153
  • 财政年份:
    2023
  • 资助金额:
    $ 187.58万
  • 项目类别:
    Standard Grant
CI CoE: CI Compass: An NSF Cyberinfrastructure (CI) Center of Excellence for Navigating the Major Facilities Data Lifecycle
CI CoE:CI Compass:用于导航主要设施数据生命周期的 NSF 网络基础设施 (CI) 卓越中心
  • 批准号:
    2127548
  • 财政年份:
    2021
  • 资助金额:
    $ 187.58万
  • 项目类别:
    Standard Grant
Collaborative Research: OAC Core: Simulation-driven runtime resource management for distributed workflow applications
协作研究:OAC Core:分布式工作流应用程序的模拟驱动的运行时资源管理
  • 批准号:
    2106147
  • 财政年份:
    2021
  • 资助金额:
    $ 187.58万
  • 项目类别:
    Standard Grant
Collaborative Research: Elements: Simulation-driven Evaluation of Cyberinfrastructure Systems
协作研究:要素:网络基础设施系统的仿真驱动评估
  • 批准号:
    2103508
  • 财政年份:
    2021
  • 资助金额:
    $ 187.58万
  • 项目类别:
    Standard Grant
Collaborative Research: EAGER: VisDict - Visual Dictionaries for Enhancing the Communication between Domain Scientists and Scientific Workflow Providers
协作研究:EAGER:VisDict - 用于增强领域科学家和科学工作流程提供商之间沟通的视觉词典
  • 批准号:
    2100636
  • 财政年份:
    2021
  • 资助金额:
    $ 187.58万
  • 项目类别:
    Standard Grant
Collaborative Research: EAGER: Advancing Reproducibility in Multi-Messenger Astrophysics
合作研究:EAGER:提高多信使天体物理学的可重复性
  • 批准号:
    2041901
  • 财政年份:
    2020
  • 资助金额:
    $ 187.58万
  • 项目类别:
    Standard Grant
Collaborative Research: EAGER: Leveraging Advanced Cyberinfrastructure and Developing Organizational Resilience for NSF Large Facilities in the Pandemic Era
合作研究:EAGER:在大流行时代利用先进的网络基础设施并提高 NSF 大型设施的组织弹性
  • 批准号:
    2042054
  • 财政年份:
    2020
  • 资助金额:
    $ 187.58万
  • 项目类别:
    Standard Grant
Collaborative Research: PPoSS: Planning: Performance Scalability, Trust, and Reproducibility: A Community Roadmap to Robust Science in High-throughput Applications
协作研究:PPoSS:规划:性能可扩展性、信任和可重复性:高通量应用中稳健科学的社区路线图
  • 批准号:
    2028930
  • 财政年份:
    2020
  • 资助金额:
    $ 187.58万
  • 项目类别:
    Standard Grant
2019 NSF Workshop on Connecting Large Facilities and Cyberinfrastructure
2019 年 NSF 连接大型设施和网络基础设施研讨会
  • 批准号:
    1933353
  • 财政年份:
    2019
  • 资助金额:
    $ 187.58万
  • 项目类别:
    Standard Grant

相似海外基金

Blockchain-Based Middleware for Distributed Management IoT Resources and Services
用于分布式管理物联网资源和服务的基于区块链的中间件
  • 批准号:
    RGPIN-2020-05415
  • 财政年份:
    2022
  • 资助金额:
    $ 187.58万
  • 项目类别:
    Discovery Grants Program - Individual
EAGER: CNS: RobSenCom: A Middleware to Improve the Connectivity between Heterogeneous Robots and IoT
EAGER:CNS:RobSenCom:改善异构机器人和物联网之间连接的中间件
  • 批准号:
    2233879
  • 财政年份:
    2022
  • 资助金额:
    $ 187.58万
  • 项目类别:
    Standard Grant
Blockchain-Based Middleware for Distributed Management IoT Resources and Services
用于分布式管理物联网资源和服务的基于区块链的中间件
  • 批准号:
    RGPIN-2020-05415
  • 财政年份:
    2021
  • 资助金额:
    $ 187.58万
  • 项目类别:
    Discovery Grants Program - Individual
Next Generation of IoT Backend Middleware for Edge and Cloud
适用于边缘和云的下一代物联网后端中间件
  • 批准号:
    543885-2019
  • 财政年份:
    2021
  • 资助金额:
    $ 187.58万
  • 项目类别:
    Collaborative Research and Development Grants
A Framework for AI-Enabled Middleware Components for Securing IoT Systems
用于保护物联网系统的人工智能中间件组件框架
  • 批准号:
    DDG-2020-00032
  • 财政年份:
    2021
  • 资助金额:
    $ 187.58万
  • 项目类别:
    Discovery Development Grant
Hard-Middleware: Facilitating Reliable Machine Learning Deployment for Automotive Applications
硬件中间件:促进汽车应用的可靠机器学习部署
  • 批准号:
    2481244
  • 财政年份:
    2020
  • 资助金额:
    $ 187.58万
  • 项目类别:
    Studentship
Blockchain-Based Middleware for Distributed Management IoT Resources and Services
用于分布式管理物联网资源和服务的基于区块链的中间件
  • 批准号:
    RGPIN-2020-05415
  • 财政年份:
    2020
  • 资助金额:
    $ 187.58万
  • 项目类别:
    Discovery Grants Program - Individual
A Framework for AI-Enabled Middleware Components for Securing IoT Systems
用于保护物联网系统的人工智能中间件组件框架
  • 批准号:
    DDG-2020-00032
  • 财政年份:
    2020
  • 资助金额:
    $ 187.58万
  • 项目类别:
    Discovery Development Grant
OAC Core: Small: Next-Generation Communication and I/O Middleware for HPC and Deep Learning with Smart NICs
OAC 核心:小型:使用智能 NIC 实现 HPC 和深度学习的下一代通信和 I/O 中间件
  • 批准号:
    2007991
  • 财政年份:
    2020
  • 资助金额:
    $ 187.58万
  • 项目类别:
    Standard Grant
Elements: RADICAL-Cybertools: Middleware Building Blocks for NSF's Cyberinfrastructure Ecosystem.
元素: RADICAL-Cyber​​tools:NSF 网络基础设施生态系统的中间件构建块。
  • 批准号:
    1931512
  • 财政年份:
    2020
  • 资助金额:
    $ 187.58万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了