Frameworks: Re-Engineering Galaxy for Performance, Scalability and Energy Efficiency

框架:重新设计 Galaxy 以提高性能、可扩展性和能源效率

基本信息

  • 批准号:
    1931531
  • 负责人:
  • 金额:
    $ 350万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2019
  • 资助国家:
    美国
  • 起止时间:
    2019-10-01 至 2024-09-30
  • 项目状态:
    已结题

项目摘要

Biomedical research is an important branch of science that deals with the problem of studying biological processes and identifying, preventing and curing diseases. This research forms the pathway to the discovery of new medicines as well as new therapies. As such, biomedical research is crucial to advance the national health and prosperity. Given the geographically distributed research groups and biomedical labs, collaborative science plays a very important role in biomedical research. Galaxy is an open source, web-based framework that is extensively used by more than 20,000 researchers world-wide for conducting research in many application domains, the most prominent of which is biomedical research. It provides a web-based environment using which scientists perform various computational analyses on their data, exchange results from these analyses, explore new research concepts, facilitate student training, and preserve their results for future use. Galaxy currently runs on a large variety of high-performance computing (HPC) platforms including local clusters, supercomputers in national labs, public datacenters and Cloud. Unfortunately, while most of these systems supplement conventional CPUs with significant accelerator capabilities (in the form of Graphical Processing Units (GPUs) and/or Field-Programmable Gate Arrays (FPGAs)), the current Galaxy implementation does not take advantage of these powerful accelerators. This project enhances the Galaxy framework so that it can take full advantage of the tremendous computational capabilities offered by GPUs and FPGAs. By doing so, the important applications running under Galaxy experiences significant speedups, thereby accelerating scientific discoveries. This project consists of four complementary tasks, which follow a logistic progression as follows: Task-I focuses on redesigning existing Galaxy tools with GPU/FPGA support and integrate them to Galaxy tool-chains; Task-II provides containerization support for the tools and accelerator-aware orchestration for running Galaxy on cloud platforms; Task-III implements specific policy driven scheduling schemes for Task-I and Task-II; and finally, Task-IV redesigns Galaxy storage to speed up execution and reduce bottlenecks related to data transfer. The proposed enhancements to Galaxy enables the integration of innovation with discovery by providing a state-of-the art experimental platform to a larger community of researchers across several disciplines. On the broader impact and outreach/educational front, this project impacts the performance and energy efficiency of Galaxy tools and applications and improves the productivity of a typical Galaxy user tremendously; that is, the main beneficiaries of this project are thousands of members of existing Galaxy Community. However, this project also (i) helps existing GPU and FPGA based (non-Galaxy) applications start using Galaxy, thereby taking full advantage of all existing toolsets within the framework, (ii) enables Galaxy tools to take better advantage of emerging cluster scheduling capabilities, and (iii) creates a synergy with concurrent Galaxy related efforts and existing infrastructure efforts the PIs are involved with, to further expedite scientific discoveries. As such, this proposed system support will have a broad societal impact via the enhanced Galaxy system support. On the education side, the project involves under-represented groups in computer science as well as in bio-informatics, outreach to undergraduates, various K-12 related activities (Science-U, CSATS, VIEW), and engagement with researchers in other disciplines (e.g., natural language processing, image processing, drug discovery and cosmology) via a workshop open to the Galaxy community.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
生物医学研究是一门重要的科学分支,它研究生物过程,识别、预防和治疗疾病。这项研究形成了发现新药和新疗法的途径。因此,生物医学研究对促进国家健康和繁荣至关重要。由于研究小组和生物医学实验室的地理分布,合作科学在生物医学研究中发挥着非常重要的作用。Galaxy是一个基于Web的开源框架,被全球20,000多名研究人员广泛用于在许多应用领域进行研究,其中最突出的是生物医学研究。 它提供了一个基于网络的环境,科学家可以使用它对数据进行各种计算分析,交换这些分析的结果,探索新的研究概念,促进学生培训,并保存他们的结果供将来使用。Galaxy目前在各种高性能计算(HPC)平台上运行,包括本地集群,国家实验室的超级计算机,公共数据中心和云。不幸的是,虽然这些系统中的大多数都为传统CPU提供了重要的加速器功能(以图形处理单元(GPU)和/或现场可编程门阵列(FPGA)的形式),但当前的Galaxy实现并没有利用这些强大的加速器。该项目增强了Galaxy框架,以便它可以充分利用GPU和FPGA提供的巨大计算能力。通过这样做,在Galaxy下运行的重要应用程序将获得显著的加速,从而加速科学发现。 该项目由四个互补的任务组成,遵循以下逻辑进展:任务I侧重于重新设计具有GPU/FPGA支持的现有Galaxy工具并将其集成到Galaxy工具链中;任务II为工具提供容器化支持,并为在云平台上运行Galaxy提供加速器感知的编排;任务III为任务I和任务II实现特定的策略驱动调度方案;最后,Task-IV重新设计了Galaxy存储,以加快执行速度并减少与数据传输相关的瓶颈。拟议的Galaxy增强功能通过为多个学科的更大研究人员社区提供最先进的实验平台,实现了创新与发现的整合。在更广泛的影响和外联/教育方面,该项目影响到银河工具和应用程序的性能和能效,并极大地提高了银河系统典型用户的生产力;也就是说,该项目的主要受益者是银河系统社区现有的数千名成员。然而,该项目还(i)帮助现有的GPU和FPGA的基础上,(非银河系统)应用程序开始使用银河系统,从而充分利用框架内的所有现有工具集,㈡使银河系统工具能够更好地利用新出现的集群调度能力,㈢与同时进行的银河系统相关工作和主要参与人参与的现有基础设施工作产生协同作用,以进一步加速科学发现。因此,拟议的系统支助将通过加强银河系统支助产生广泛的社会影响。 在教育方面,该项目涉及计算机科学和生物信息学的代表性不足的群体,对本科生的宣传,各种K-12相关活动(Science-U,CSATS,VIEW),以及与其他学科的研究人员的接触(例如,该奖项反映了NSF的法定使命,并通过使用基金会的知识价值和更广泛的影响审查标准进行评估,被认为值得支持。

项目成果

期刊论文数量(13)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Kube-Knots: Resource Harvesting through Dynamic Container Orchestration in GPU-based Datacenters
Compression Algorithm for Colored de Bruijn Graphs
彩色 de Bruijn 图的压缩算法
Multiverse: Dynamic VM Provisioning for Virtualized High Performance Computing Clusters
Cocktail: A Multidimensional Optimization for Model Serving in Cloud
Cocktail:云中模型服务的多维优化
Efficient mapping of accurate long reads in minimizer space with mapquik.
  • DOI:
    10.1101/gr.277679.123
  • 发表时间:
    2023-07
  • 期刊:
  • 影响因子:
    7
  • 作者:
    Ekim, Baris;Sahlin, Kristoffer;Medvedev, Paul;Berger, Bonnie;Chikhi, Rayan
  • 通讯作者:
    Chikhi, Rayan
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Mahmut Kandemir其他文献

Particle simulation on the Cell BE architecture
A case for core-assisted bottleneck acceleration in GPUs
GPU 中核心辅助瓶颈加速的案例
  • DOI:
  • 发表时间:
    2015
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Nandita Vijaykumar;Gennady Pekhimenko;Adwait Jog;A. Bhowmick;Rachata Ausavarungnirun;Chita R. Das;Mahmut Kandemir;T. Mowry;O. Mutlu
  • 通讯作者:
    O. Mutlu
Optimizing Leakage Energy Consumption in Cache Bitlines
  • DOI:
    10.1007/s10617-005-5345-4
  • 发表时间:
    2004-03-01
  • 期刊:
  • 影响因子:
    0.900
  • 作者:
    Soontae Kim;Narayanan Vijaykrishnan;Mahmut Kandemir;Mary Jane Irwin
  • 通讯作者:
    Mary Jane Irwin
Time-constrained optimization of multi-AUV cooperative mine detection
多AUV协同探雷的时间约束优化
  • DOI:
    10.1109/oceans.2008.5151971
  • 发表时间:
    2008
  • 期刊:
  • 影响因子:
    0
  • 作者:
    R. Prins;Mahmut Kandemir
  • 通讯作者:
    Mahmut Kandemir
An I/O-Conscious Tiling Strategy for Disk-Resident Data Sets
  • DOI:
    10.1023/a:1014156327748
  • 发表时间:
    2002-01-01
  • 期刊:
  • 影响因子:
    2.700
  • 作者:
    Mahmut Kandemir;Alok Choudhary;J. Ramanujam
  • 通讯作者:
    J. Ramanujam

Mahmut Kandemir的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Mahmut Kandemir', 18)}}的其他基金

Collaborative Research: CNS Core: Small: Resource-efficient, Strongly Consistent Replication for the Cloud
合作研究:CNS 核心:小型:资源高效、强一致性的云复制
  • 批准号:
    2149389
  • 财政年份:
    2022
  • 资助金额:
    $ 350万
  • 项目类别:
    Standard Grant
PPoSS: Planning: Cross-Layer Design for Cost-Effective HPC in the Cloud
PPoSS:规划:云中经济高效 HPC 的跨层设计
  • 批准号:
    2028929
  • 财政年份:
    2020
  • 资助金额:
    $ 350万
  • 项目类别:
    Standard Grant
SaTC: CORE: Small: Automatic Software Patching against Microarchitectual Attacks
SaTC:核心:小型:针对微架构攻击的自动软件修补
  • 批准号:
    1956032
  • 财政年份:
    2020
  • 资助金额:
    $ 350万
  • 项目类别:
    Standard Grant
SHF: Small: Characterizing and Optimizing 3D NAND Flash
SHF:小型:表征和优化 3D NAND 闪存
  • 批准号:
    1908793
  • 财政年份:
    2019
  • 资助金额:
    $ 350万
  • 项目类别:
    Standard Grant
XPS: FULL: A Fresh Look at Near Data Computing: Coordinated Data and Computation Government
XPS:完整:近数据计算的新视角:协调数据和计算政府
  • 批准号:
    1629129
  • 财政年份:
    2016
  • 资助金额:
    $ 350万
  • 项目类别:
    Standard Grant
CSR: Medium: Collaborative Research: Enabling GPUs as First-Class Computing Engines
CSR:媒介:协作研究:使 GPU 成为一流的计算引擎
  • 批准号:
    1409095
  • 财政年份:
    2014
  • 资助金额:
    $ 350万
  • 项目类别:
    Continuing Grant
XPS: FULL:CCA: Extracting Scalable Parallelism by Relaxing the Contracts across the System Stack
XPS:FULL:CCA:通过放松整个系统堆栈的契约来提取可扩展的并行性
  • 批准号:
    1439021
  • 财政年份:
    2014
  • 资助金额:
    $ 350万
  • 项目类别:
    Standard Grant
SHF: Medium: Breaking the Physical Divide between Computation and NAND-Flash Storage
SHF:媒介:打破计算和 NAND 闪存存储之间的物理鸿沟
  • 批准号:
    1302557
  • 财政年份:
    2013
  • 资助金额:
    $ 350万
  • 项目类别:
    Continuing Grant
SHF: Medium: Automatic Control Driven Resource Management in Chip Multiprocessors
SHF:中:芯片多处理器中自动控制驱动的资源管理
  • 批准号:
    0963839
  • 财政年份:
    2010
  • 资助金额:
    $ 350万
  • 项目类别:
    Continuing Grant
Collaborative Research: Adaptive Techniques for Achieving End-to-End QoS in the I/O Stack on Petascale Multiprocessors
协作研究:在千万级多处理器上的 I/O 堆栈中实现端到端 QoS 的自适应技术
  • 批准号:
    0937949
  • 财政年份:
    2009
  • 资助金额:
    $ 350万
  • 项目类别:
    Standard Grant

相似国自然基金

RE-BOA联合VA-ECMO对难治性心跳骤停患者脑复苏效果的评估
  • 批准号:
    2025JJ80720
  • 批准年份:
    2025
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
Sr/RE复合变质下Al-Si-Mg-Cu合金的强韧化机制研究
  • 批准号:
    2025JJ80374
  • 批准年份:
    2025
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
基于实施科学理论和PRISM/RE-AIM框架的基层疾控体系韧性研究
  • 批准号:
  • 批准年份:
    2025
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
基于光生电荷调控的CeO2/RE-MOF异质结设计及光催化性能增强机 理研究
  • 批准号:
  • 批准年份:
    2024
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
高效近红外发光Cr3+-RE3+共掺双钙钛矿的设计合成及pc-LED器件的制备与应用
  • 批准号:
  • 批准年份:
    2024
  • 资助金额:
    10.0 万元
  • 项目类别:
    省市级项目
莺歌海盆地油气藏系统Re-Os年代学和Os同位素指纹研究
  • 批准号:
  • 批准年份:
    2024
  • 资助金额:
    15.0 万元
  • 项目类别:
    省市级项目
S100-Anx复合体及RE-出胞复合体介导胎盘滋养细胞内B族链球菌出胞的分子机制
  • 批准号:
  • 批准年份:
    2024
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
磁铁矿Re-Os同位素定年体系的关键控制因素研究
  • 批准号:
    42373027
  • 批准年份:
    2023
  • 资助金额:
    55 万元
  • 项目类别:
    面上项目

相似海外基金

ART: Re-engineering Technology Transfer at the University of Wyoming
ART:怀俄明大学的技术转让再造
  • 批准号:
    2331477
  • 财政年份:
    2024
  • 资助金额:
    $ 350万
  • 项目类别:
    Cooperative Agreement
Re-engineering metallic-based nanostructures for carbon dioxide conversion
重新设计用于二氧化碳转化的金属基纳米结构
  • 批准号:
    DE230101617
  • 财政年份:
    2024
  • 资助金额:
    $ 350万
  • 项目类别:
    Discovery Early Career Researcher Award
The shape of sustainable crop production - re-engineering plant architecture for sustainable food production
可持续作物生产的形态——为可持续粮食生产重新设计植物结构
  • 批准号:
    2878037
  • 财政年份:
    2023
  • 资助金额:
    $ 350万
  • 项目类别:
    Studentship
3D Bioprinting of a Bioelectric Cell Bridge for Re-engineering Cardiac Conduction
用于重新设计心脏传导的生物电细胞桥的 3D 生物打印
  • 批准号:
    10753836
  • 财政年份:
    2023
  • 资助金额:
    $ 350万
  • 项目类别:
REST-PSLL: Re-engineering Surgical Recovery and Transitions Using Technology Patient Safety Learning Laboratory
REST-PSLL:利用技术重新设计手术恢复和过渡患者安全学习实验室
  • 批准号:
    10768806
  • 财政年份:
    2023
  • 资助金额:
    $ 350万
  • 项目类别:
CAREER: Re-engineering Agricultural Drainage Infrastructure: Upscaling In-field Conservations for Regional Sustainability
职业:重新设计农业排水基础设施:扩大现场保护以实现区域可持续发展
  • 批准号:
    2239621
  • 财政年份:
    2023
  • 资助金额:
    $ 350万
  • 项目类别:
    Continuing Grant
Re-engineering differential regulation of ferroptosis in melanoma microenvironment
重新设计黑色素瘤微环境中铁死亡的差异调节
  • 批准号:
    10735217
  • 财政年份:
    2023
  • 资助金额:
    $ 350万
  • 项目类别:
Colour4CRAFTS: Colour for Combining, Re-engineering, Applying, Futuring, Transforming, Stretching
Colour4CRAFTS:用于组合、重新设计、应用、未来、转变、拉伸的颜色
  • 批准号:
    10076278
  • 财政年份:
    2023
  • 资助金额:
    $ 350万
  • 项目类别:
    EU-Funded
RUI: Re-engineering Ring-Cleaving Dioxygenases for Activity Towards Novel Substrates
RUI:重新设计开环双加氧酶以实现新型底物的活性
  • 批准号:
    2203928
  • 财政年份:
    2022
  • 资助金额:
    $ 350万
  • 项目类别:
    Standard Grant
[Re]Engineering water treatment systems for small communities
[Re]为小型社区设计水处理系统
  • 批准号:
    RGPIN-2018-04135
  • 财政年份:
    2022
  • 资助金额:
    $ 350万
  • 项目类别:
    Discovery Grants Program - Individual
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了