SI2-SSI: Pegasus: Automating Compute and Data Intensive Science

SI2-SSI:Pegasus:自动化计算和数据密集型科学

基本信息

  • 批准号:
    1664162
  • 负责人:
  • 金额:
    $ 250万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2017
  • 资助国家:
    美国
  • 起止时间:
    2017-05-15 至 2023-09-30
  • 项目状态:
    已结题

项目摘要

This project addresses the ever-growing gap between the capabilities offered by on-campus and off-campus cyberinfrastructures (CI) and the ability of researchers to effectively harness these capabilities to advance scientific discovery. Faculty and students on campuses struggle to extract knowledge from data that does not fit on their laptops or cannot be processed by an Excel spreadsheet and they find it difficult to efficiently manage their computations. The project sustains and enhances the Pegasus Workflow Management System, which enables scientist to orchestrate and run data- and compute-intensive computations on diverse distributed computational resources. Enhancements focus on the automation capabilities provided by Pegasus to support workflows handling large data sets, as well as usability of Pegasus that lowers the barrier of its adoption. This effort expands the reach of the advanced capabilities provided by Pegasus to researchers from a broader spectrum of disciplines that range from gravitational-wave physics to bioinformatics, and from earth science to material science.For more than 15 years the Pegasus Workflow Management System has been designed, implemented and supported to provide abstractions that enable scientists to focus on structuring their computations without worrying about the details of the target CI. To support these workflow abstractions Pegasus provides automation capabilities that seamlessly map workflows onto target resources, sparing scientists the overhead of managing the data flow, job scheduling, fault recovery and adaptation of their applications. Automation enables the delivery of services that consider criteria such as time-to-solution, as well as takes into account efficient use of resources, managing the throughput of tasks, and data transfer requests. The power of these abstractions was demonstrated in 2015 when Pegasus was used by an international collaboration to harness a diverse set of resources and to manage compute- and data- intensive workflows that confirmed the existence of gravitational waves, as predicted by Einstein's theory of relativity. Experience from working with diverse scientific domains - astronomy, bioinformatics, climate modeling, earthquake science, gravitational and material science - uncover opportunities for further automation of scientific workflows. This project addresses these opportunities through innovation in the following areas: automation methods to include resource provisioning ahead of and during workflow execution, data-aware job scheduling algorithms, and data sharing mechanisms in high-throughput environments. To support a broader group of "long-tail" scientists, effort is devoted to usability improvements as well as outreach, education, and training activities. The proposed work includes the implementation and evaluation of advanced frameworks, algorithms, and methods that enhance the power of automation in support of data-intensive science. These enhancements are delivers as dependable software tools integrated with Pegasus so that they can be evaluated in the context of real-life applications and computing environments. The data-aware focus targets new classes of applications executing in high-throughput and high-performance environments. Pegasus has been adopted by researchers from a broad spectrum of disciplines that range from gravitational-wave physics to bioinformatics, and from earth science to material science. It provides and enhances access to national CI such as OSG and XSEDE, and as part of this work it will be deployed within Chameleon and Jetstream to provide broader access to NSF's CI investments. Through usability improvements, engagement with CI and community platform providers such as HubZero and Cyverse, combined with educational, training, and tutorial activities, this project broadens the set of researchers that leverage automation for their work. Collaboration with the Gateways Institute assures that Pegasus interfaces are suitable for vertical integration within science gateways and seamlessly supports new scientific communities.
该项目解决了校内和校外网络基础设施(CI)提供的能力与研究人员有效利用这些能力来推进科学发现的能力之间不断扩大的差距。校园里的教师和学生努力从不适合他们的笔记本电脑或无法通过Excel电子表格处理的数据中提取知识,他们发现很难有效地管理他们的计算。该项目维持和增强了Pegasus工作流管理系统,使科学家能够在不同的分布式计算资源上协调和运行数据和计算密集型计算。增强功能主要集中在Pegasus提供的自动化功能,以支持处理大型数据集的工作流程,以及Pegasus的可用性,从而降低了采用的障碍。这一努力扩大了Pegasus提供的先进功能的范围,使研究人员能够从更广泛的学科,从引力波物理学到生物信息学,从地球科学到材料科学。实现并支持提供抽象,使科学家能够专注于构建他们的计算,而无需担心细节目标线人的身份为了支持这些工作流抽象,Pegasus提供了自动化功能,可以将工作流无缝映射到目标资源上,从而使科学家不必管理数据流、作业调度、故障恢复和应用程序调整。自动化使服务的交付能够考虑解决问题的时间等标准,并考虑资源的有效使用,管理任务的吞吐量和数据传输请求。这些抽象的力量在2015年得到了证明,当时Pegasus被一个国际合作组织用于利用各种资源并管理计算和数据密集型工作流程,这些工作流程证实了引力波的存在,正如爱因斯坦相对论所预测的那样。在天文学、生物信息学、气候建模、地震科学、引力和材料科学等不同科学领域的工作经验,为科学工作流程的进一步自动化提供了机会。该项目通过以下领域的创新来解决这些机会:自动化方法,包括工作流执行之前和期间的资源配置,数据感知作业调度算法以及高吞吐量环境中的数据共享机制。为了支持更广泛的“长尾”科学家群体,努力致力于可用性改进以及推广,教育和培训活动。拟议的工作包括实施和评估先进的框架、算法和方法,以增强自动化的能力,支持数据密集型科学。这些增强功能是作为与Pegasus集成的可靠软件工具提供的,因此可以在实际应用程序和计算环境中对其进行评估。数据感知的重点是在高吞吐量和高性能环境中执行的新型应用程序。Pegasus已经被来自广泛学科的研究人员采用,从引力波物理学到生物信息学,从地球科学到材料科学。它提供并增强了对OSG和XSEDE等国家CI的访问,作为这项工作的一部分,它将部署在Chameleon和Jetstream中,以提供对NSF CI投资的更广泛访问。通过可用性改进,与CI和社区平台提供商(如HubZero和Cyverse)的合作,结合教育,培训和教程活动,该项目扩大了利用自动化进行工作的研究人员的范围。与网关研究所的合作确保了Pegasus接口适合科学网关内的垂直整合,并无缝支持新的科学社区。

项目成果

期刊论文数量(24)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
FlyNet: Drones on the Horizon
FlyNet:地平线上的无人机
  • DOI:
    10.1109/mic.2023.3260440
  • 发表时间:
    2023
  • 期刊:
  • 影响因子:
    3.2
  • 作者:
    Morel, Alicia Esquivel;Qu, Chengyi;Calyam, Prasad;Wang, Cong;Thareja, Komal;Mandal, Anirban;Lyons, Eric;Zink, Michael;Papadimitriou, George;Deelman, Ewa
  • 通讯作者:
    Deelman, Ewa
Emerging Frameworks for Advancing Scientific Workflows Research, Development, and Education
推进科学工作流程研究、开发和教育的新兴框架
Mining Literature-Based Knowledge Graph for Predicting Combination Therapeutics: A COVID-19 Use Case
  • DOI:
    10.1109/ickg55886.2022.00018
  • 发表时间:
    2022-11
  • 期刊:
  • 影响因子:
    0
  • 作者:
    A. Hamed;Jakub Jończyk;Mohammad Zaiyan Alam;E. Deelman;Byung Suk Lee
  • 通讯作者:
    A. Hamed;Jakub Jończyk;Mohammad Zaiyan Alam;E. Deelman;Byung Suk Lee
The Evolution of the Pegasus Workflow Management Software
  • DOI:
    10.1109/mcse.2019.2919690
  • 发表时间:
    2019-07-01
  • 期刊:
  • 影响因子:
    2.1
  • 作者:
    Deelman, Ewa;Vahi, Karan;Livny, Miron
  • 通讯作者:
    Livny, Miron
Performance assessment of ensembles of in situ workflows under resource constraints
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Ewa Deelman其他文献

Mapping Abstract Complex Workflows onto Grid Environments
  • DOI:
    10.1023/a:1024000426962
  • 发表时间:
    2003-01-01
  • 期刊:
  • 影响因子:
    2.900
  • 作者:
    Ewa Deelman;James Blythe;Yolanda Gil;Carl Kesselman;Gaurang Mehta;Karan Vahi;Kent Blackburn;Albert Lazzarini;Adam Arbree;Richard Cavanaugh;Scott Koranda
  • 通讯作者:
    Scott Koranda
Advancing Anomaly Detection in Computational Workflows with Active Learning
通过主动学习推进计算工作流程中的异常检测
  • DOI:
    10.48550/arxiv.2405.06133
  • 发表时间:
    2024
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Krishnan Raghavan;George Papadimitriou;Hongwei Jin;A. Mandal;Mariam Kiran;Prasanna Balaprakash;Ewa Deelman
  • 通讯作者:
    Ewa Deelman
A terminology for scientific workflow systems
科学工作流系统的术语
  • DOI:
    10.1016/j.future.2025.107974
  • 发表时间:
    2026-01-01
  • 期刊:
  • 影响因子:
    6.100
  • 作者:
    Frédéric Suter;Tainã Coleman;İlkay Altintaş;Rosa M. Badia;Bartosz Balis;Kyle Chard;Iacopo Colonnelli;Ewa Deelman;Paolo Di Tommaso;Thomas Fahringer;Carole Goble;Shantenu Jha;Daniel S. Katz;Johannes Köster;Ulf Leser;Kshitij Mehta;Hilary Oliver;J.-Luc Peterson;Giovanni Pizzi;Loïc Pottier;Rafael Ferreira da Silva
  • 通讯作者:
    Rafael Ferreira da Silva
Broadening Student Engagement To Build the Next Generation of Cyberinfrastructure Professionals
扩大学生参与度,培养下一代网络基础设施专业人员
  • DOI:
    10.1145/3569951.3597567
  • 发表时间:
    2023
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Angela Murillo;Don Brower;Sarowar Hossain;K. Kee;A. Mandal;J. Nabrzyski;Erik Scott;Nicole K. Virdone;Rodney Ewing;Ewa Deelman
  • 通讯作者:
    Ewa Deelman
How is Artificial Intelligence Changing Science?
人工智能如何改变科学?

Ewa Deelman的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Ewa Deelman', 18)}}的其他基金

Collaborative Research: CyberTraining: Implementation: Medium: CyberInfrastructure Training and Education for Synchrotron X-Ray Science (X-CITE)
合作研究:网络培训:实施:媒介:同步加速器 X 射线科学网络基础设施培训和教育 (X-CITE)
  • 批准号:
    2320375
  • 财政年份:
    2023
  • 资助金额:
    $ 250万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Small: Model-driven Design and Optimization of Dataflows for Scientific Applications
协作研究:SHF:小型:科学应用数据流的模型驱动设计和优化
  • 批准号:
    2331153
  • 财政年份:
    2023
  • 资助金额:
    $ 250万
  • 项目类别:
    Standard Grant
CI CoE: CI Compass: An NSF Cyberinfrastructure (CI) Center of Excellence for Navigating the Major Facilities Data Lifecycle
CI CoE:CI Compass:用于导航主要设施数据生命周期的 NSF 网络基础设施 (CI) 卓越中心
  • 批准号:
    2127548
  • 财政年份:
    2021
  • 资助金额:
    $ 250万
  • 项目类别:
    Standard Grant
Collaborative Research: OAC Core: Simulation-driven runtime resource management for distributed workflow applications
协作研究:OAC Core:分布式工作流应用程序的模拟驱动的运行时资源管理
  • 批准号:
    2106147
  • 财政年份:
    2021
  • 资助金额:
    $ 250万
  • 项目类别:
    Standard Grant
Collaborative Research: Elements: Simulation-driven Evaluation of Cyberinfrastructure Systems
协作研究:要素:网络基础设施系统的仿真驱动评估
  • 批准号:
    2103508
  • 财政年份:
    2021
  • 资助金额:
    $ 250万
  • 项目类别:
    Standard Grant
Collaborative Research: EAGER: VisDict - Visual Dictionaries for Enhancing the Communication between Domain Scientists and Scientific Workflow Providers
协作研究:EAGER:VisDict - 用于增强领域科学家和科学工作流程提供商之间沟通的视觉词典
  • 批准号:
    2100636
  • 财政年份:
    2021
  • 资助金额:
    $ 250万
  • 项目类别:
    Standard Grant
Collaborative Research: EAGER: Advancing Reproducibility in Multi-Messenger Astrophysics
合作研究:EAGER:提高多信使天体物理学的可重复性
  • 批准号:
    2041901
  • 财政年份:
    2020
  • 资助金额:
    $ 250万
  • 项目类别:
    Standard Grant
Collaborative Research: EAGER: Leveraging Advanced Cyberinfrastructure and Developing Organizational Resilience for NSF Large Facilities in the Pandemic Era
合作研究:EAGER:在大流行时代利用先进的网络基础设施并提高 NSF 大型设施的组织弹性
  • 批准号:
    2042054
  • 财政年份:
    2020
  • 资助金额:
    $ 250万
  • 项目类别:
    Standard Grant
Collaborative Research: PPoSS: Planning: Performance Scalability, Trust, and Reproducibility: A Community Roadmap to Robust Science in High-throughput Applications
协作研究:PPoSS:规划:性能可扩展性、信任和可重复性:高通量应用中稳健科学的社区路线图
  • 批准号:
    2028930
  • 财政年份:
    2020
  • 资助金额:
    $ 250万
  • 项目类别:
    Standard Grant
2019 NSF Workshop on Connecting Large Facilities and Cyberinfrastructure
2019 年 NSF 连接大型设施和网络基础设施研讨会
  • 批准号:
    1933353
  • 财政年份:
    2019
  • 资助金额:
    $ 250万
  • 项目类别:
    Standard Grant

相似国自然基金

考虑SSI效应的导管架式海洋平台抗震性能研究
  • 批准号:
  • 批准年份:
    2022
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
考虑SSI的层间隔震高层建筑结构在三维地震下的响应研究
  • 批准号:
    52168072
  • 批准年份:
    2021
  • 资助金额:
    35 万元
  • 项目类别:
    地区科学基金项目
考虑SSI效应的大型储罐动力学特性及其隔板减晃研究
  • 批准号:
    51978336
  • 批准年份:
    2019
  • 资助金额:
    61.0 万元
  • 项目类别:
    面上项目
考虑SSI效应的摇摆墙-框架结构抗震机理及性能评估方法研究
  • 批准号:
    51978524
  • 批准年份:
    2019
  • 资助金额:
    60.0 万元
  • 项目类别:
    面上项目
考虑能量需求和SSI效应的RC梁式桥基于性能的抗震设计方法
  • 批准号:
    50908014
  • 批准年份:
    2009
  • 资助金额:
    20.0 万元
  • 项目类别:
    青年科学基金项目

相似海外基金

小学校理科における合意形成能力の育成 ーSSIが関わる内容を対象としてー
培养小学科学中建立共识的技能 - 针对与 SSI 相关的内容 -
  • 批准号:
    24H02435
  • 财政年份:
    2024
  • 资助金额:
    $ 250万
  • 项目类别:
    Grant-in-Aid for Encouragement of Scientists
The DECREASE SSI Trial (Decolonization to Reduce After-Surgery Events of Surgical Site Infection)
DECREASE SSI 试验(去殖民化以减少手术部位感染的术后事件)
  • 批准号:
    10670860
  • 财政年份:
    2022
  • 资助金额:
    $ 250万
  • 项目类别:
The DECREASE SSI Trial (Decolonization to Reduce After-Surgery Events of Surgical Site Infection)
DECREASE SSI 试验(去殖民化以减少手术部位感染的术后事件)
  • 批准号:
    10501944
  • 财政年份:
    2022
  • 资助金额:
    $ 250万
  • 项目类别:
Significance of nonlinear soil-structure interaction (SSI) on the seismic performance of micropiles-retrofitted pile foundations
非线性土-结构相互作用(SSI)对微型桩加固桩基抗震性能的意义
  • 批准号:
    22K04280
  • 财政年份:
    2022
  • 资助金额:
    $ 250万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
SSIサーベイランス、薬剤感受性と医療費を統合した耐性菌と患者予後の関連の研究
整合SSI监测、药敏和医疗费用研究耐药菌与患者预后的关系
  • 批准号:
    21K17309
  • 财政年份:
    2021
  • 资助金额:
    $ 250万
  • 项目类别:
    Grant-in-Aid for Early-Career Scientists
Development of monitoring system for rapid detection of orthopaedic SSI using the portable DNA sequencer MinION
使用便携式 DNA 测序仪 MinION 开发用于快速检测骨科 SSI 的监测系统
  • 批准号:
    20K09419
  • 财政年份:
    2020
  • 资助金额:
    $ 250万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Collaborative Research: SI2-SSI: Expanding Volunteer Computing
合作研究:SI2-SSI:扩展志愿者计算
  • 批准号:
    2039142
  • 财政年份:
    2020
  • 资助金额:
    $ 250万
  • 项目类别:
    Standard Grant
SI2-SSI: Collaborative Research: Einstein Toolkit Community Integration and Data Exploration
SI2-SSI:协作研究:Einstein Toolkit 社区集成和数据探索
  • 批准号:
    2114580
  • 财政年份:
    2020
  • 资助金额:
    $ 250万
  • 项目类别:
    Continuing Grant
Collaborative Research: SI2-SSI: Expanding Volunteer Computing
合作研究:SI2-SSI:扩展志愿者计算
  • 批准号:
    2001752
  • 财政年份:
    2019
  • 资助金额:
    $ 250万
  • 项目类别:
    Standard Grant
SSI efficacy in cancer and inflammatory bowel disease animal models
SSI 在癌症和炎症性肠病动物模型中的功效
  • 批准号:
    523831-2018
  • 财政年份:
    2018
  • 资助金额:
    $ 250万
  • 项目类别:
    Experience Awards (previously Industrial Undergraduate Student Research Awards)
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了