ABI Development: bioKepler: A Comprehensive Bioinformatics Scientific Workflow Module for Distributed Analysis of Large-Scale Biological Data
ABI 开发:bioKepler:用于大规模生物数据分布式分析的综合生物信息学科学工作流程模块
基本信息
- 批准号:1062565
- 负责人:
- 金额:$ 140.92万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2011
- 资助国家:美国
- 起止时间:2011-08-15 至 2015-07-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
The University of California at San Diego is awarded a grant to create a Kepler Scientific Workflow System (http://kepler-project.org) module that facilitates the development of Kepler workflows for integrated execution of bioinformatics applications in distributed environments. Next-generation DNA sequencing generates a very large amount of sequence data that can be used in numerous applications addressing many scientific challenges. This places unprecedented demands on traditional single-processor bioinformatics algorithms. In addition, enabling bioinformaticians and computational biologists to conduct efficient analysis requires higher-level abstractions on top of scientific workflow systems and distributed computing methods. To develop such an environment, the bioKepler project will create scientific workflow components to execute a set of bioinformatics tools using distributed execution patterns. Once customized, these scientific workflow components will be executed on multiple distributed platforms including various Cloud and Grid computing platforms. The initial set of bioinformatics tools will be selected based on an evaluation and integration of a wide range of community tools and workflows to meet the diverse needs of researchers, organized into eight groups covering most aspect of bioinformatics applications: 1) Sequence database searches; 2) Mapping; 3) Sequence assembly; 4) Gene prediction; 5) Clustering; 6) Multiple sequence alignment, phylogeny and taxonomy; 7) Protein annotation; 8) Other miscellaneous utilities including data format transformation and parsing. The project will also study how these distributed execution patterns will affect or improve workflow scheduling and execution in distributed environments. In addition, the project will deliver virtual machines that include a Kepler engine and all the bioKepler components for bioinformatics tools and applications. The developed tools will be applicable to a wide range of bioinformatics and computational biology problems. The central rationale for the planned education and outreach efforts is the importance of training next generation scientists. This rationale also aligns with the primary goal of the project to provide tools to further bridge the gap between bioinformatics and technology. The impact of such an approach is multifold, including facilitating bioinformaticians (and potentially scientists from other disciplines) to conduct efficient, comprehensive and parallelized analyses using domain-specific distributed execution components without writing a single line of code. In addition to the project workshops, usage scenarios will be solicited via surveys with follow-up phone discussions, and representation at major domain conferences will solicit input on priorities and raise awareness of the products in later years. The bioKepler team is committed to diversity as demonstrated by the involvement of three females (including PI Altintas) in the group of seven funded personnel and the broad range of efforts to include underrepresented students. All the resource, materials and the open-source software products produced by the bioKepler ABI Development project will be integrated with the CAMERA (http://camera.calit2.net/) project for a community of nearly 4000 devoted users in over 75 countries worldwide, and will be made publicly available to a larger audience through the Kepler project website (http://kepler-project.org).
加州大学圣地亚哥分校获得了创建开普勒科学工作流系统(http://kepler-project.org)模块)的资助,该模块促进了开普勒工作流的开发,以便在分布式环境中集成执行生物信息学应用程序。下一代DNA测序产生了非常大量的序列数据,这些数据可以用于许多应用程序,解决许多科学挑战。这对传统的单处理器生物信息学算法提出了前所未有的要求。此外,使生物信息学家和计算生物学家能够进行有效的分析,需要在科学工作流系统和分布式计算方法的基础上进行更高级别的抽象。为了开发这样的环境,生物开普勒项目将创建科学的工作流程组件,以使用分布式执行模式执行一组生物信息学工具。一旦定制,这些科学工作流组件将在多个分布式平台上执行,包括各种云和网格计算平台。最初的一套生物信息学工具将在评估和整合各种社区工具和工作流程的基础上选出,以满足研究人员的不同需求,分为八组,涵盖生物信息学应用的大部分方面:1)序列数据库搜索;2)测绘;3)序列组装;4)基因预测;5)聚类;6)多序列比对、系统发育和分类学;7)蛋白质注释;8)其他各种实用工具,包括数据格式转换和解析。该项目还将研究这些分布式执行模式将如何影响或改进分布式环境中的工作流调度和执行。此外,该项目将交付包括开普勒引擎和用于生物信息学工具和应用的所有生物开普勒组件的虚拟机。所开发的工具将适用于广泛的生物信息学和计算生物学问题。计划中的教育和外展工作的主要理由是培训下一代科学家的重要性。这一理论基础也符合该项目的主要目标,即提供进一步弥合生物信息学和技术之间差距的工具。这种方法的影响是多方面的,包括促进生物信息学家(以及可能来自其他学科的科学家)使用特定于领域的分布式执行组件进行高效、全面和并行的分析,而无需编写一行代码。除了项目研讨会外,还将通过调查和后续电话讨论征求使用方案,在主要领域会议上的代表将征求对优先事项的意见,并在以后几年提高对产品的认识。生物开普勒团队致力于多样性,这一点体现在七名受资助人员中有三名女性(包括Pi Altintas),以及为吸纳代表性不足的学生所作的广泛努力。生物开普勒ABI开发项目生产的所有资源、材料和开源软件产品将与照相机(http://camera.calit2.net/)项目)整合在一起,该项目面向全球75个以上国家和地区的近4,000名忠实用户,并将通过开普勒项目网站(http://kepler-project.org).)向更多的受众公开
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Ilkay Altintas其他文献
Sex Differences in the Variability of Physical Activity Measurements Across Multiple Timescales Recorded by a Wearable Device: Observational Retrospective Cohort Study
可穿戴设备记录的多时间尺度下身体活动测量变异性的性别差异:观察性回顾性队列研究
- DOI:
10.2196/66231 - 发表时间:
2025-01-01 - 期刊:
- 影响因子:6.000
- 作者:
Kristin J Varner;Lauryn Keeler Bruce;Severine Soltani;Wendy Hartogensis;Stephan Dilchert;Frederick M Hecht;Anoushka Chowdhary;Leena Pandya;Subhasis Dasgupta;Ilkay Altintas;Amarnath Gupta;Ashley E Mason;Benjamin L Smarr - 通讯作者:
Benjamin L Smarr
Correction: Variability of temperature measurements recorded by a wearable device by biological sex
- DOI:
10.1186/s13293-023-00568-x - 发表时间:
2023-11-13 - 期刊:
- 影响因子:5.100
- 作者:
Lauryn Keeler Bruce;Patrick Kasl;Severine Soltani;Varun K. Viswanath;Wendy Hartogensis;Stephan Dilchert;Frederick M. Hecht;Anoushka Chowdhary;Claudine Anglo;Leena Pandya;Subhasis Dasgupta;Ilkay Altintas;Amarnath Gupta;Ashley E. Mason;Benjamin L. Smarr - 通讯作者:
Benjamin L. Smarr
Ilkay Altintas的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Ilkay Altintas', 18)}}的其他基金
Student and Early Career Support: 23rd IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGrid 2023)
学生和早期职业支持:第 23 届 IEEE/ACM 国际集群、云和互联网计算研讨会 (CCGrid 2023)
- 批准号:
2317547 - 财政年份:2023
- 资助金额:
$ 140.92万 - 项目类别:
Standard Grant
Planning: FIRE-PLAN: Community Building Toward an Immersive Forest Network to Catalyze Wildland Fire Solutions and Training
规划:FIRE-PLAN:建立沉浸式森林网络的社区,以促进荒地火灾解决方案和培训
- 批准号:
2341120 - 财政年份:2023
- 资助金额:
$ 140.92万 - 项目类别:
Standard Grant
National Data Platform Pilot: Services for Equitable Open Access to Data
国家数据平台试点:公平开放数据访问服务
- 批准号:
2333609 - 财政年份:2023
- 资助金额:
$ 140.92万 - 项目类别:
Continuing Grant
Collaborative Research: CyberTraining: Implementation: Medium: FOUNT: Scaffolded, Hands-On Learning for a Data-Centric Future
协作研究:网络培训:实施:媒介:FOUNT:支架式实践学习,打造以数据为中心的未来
- 批准号:
2230081 - 财政年份:2022
- 资助金额:
$ 140.92万 - 项目类别:
Standard Grant
NSF Convergence Accelerator – Track D: Artificial Intelligence and Community Driven Wildland Fire Innovation via a WIFIRE Commons Infrastructure for Data and Model Sharing
NSF 融合加速器 — 轨道 D:通过 WIFIRE 共享基础设施实现数据和模型共享,人工智能和社区驱动的野地火灾创新
- 批准号:
2134904 - 财政年份:2021
- 资助金额:
$ 140.92万 - 项目类别:
Cooperative Agreement
NSF Convergence Accelerator Track D: Artificial Intelligence and Community Driven Wildland Fire Innovation via a WIFIRE Commons Infrastructure for Data and Model Sharing
NSF 融合加速器轨道 D:通过 WIFIRE 共享基础设施实现数据和模型共享,人工智能和社区驱动的野地火灾创新
- 批准号:
2040676 - 财政年份:2020
- 资助金额:
$ 140.92万 - 项目类别:
Standard Grant
Collaborative Research: Framework: Software: NSCI : Computational and Data Innovation Implementing a National Community Hydrologic Modeling Framework for Scientific Discovery
合作研究:框架:软件:NSCI:计算和数据创新实施国家社区水文建模框架以促进科学发现
- 批准号:
1835855 - 财政年份:2018
- 资助金额:
$ 140.92万 - 项目类别:
Standard Grant
Hazards SEES Type 2: WIFIRE: A Scalable Data-Driven Monitoring, Dynamic Prediction and Resilience Cyberinfrastructure for Wildfires
Hazards SEES 类型 2:WIFIRE:可扩展的数据驱动型野火监控、动态预测和弹性网络基础设施
- 批准号:
1331615 - 财政年份:2013
- 资助金额:
$ 140.92万 - 项目类别:
Continuing Grant
EAGER: Interoperability Testbed - Assessing a Layered Architecture for Integration of Existing Capabilities
EAGER:互操作性测试台 - 评估用于集成现有功能的分层架构
- 批准号:
1239623 - 财政年份:2012
- 资助金额:
$ 140.92万 - 项目类别:
Standard Grant
相似国自然基金
水稻边界发育缺陷突变体abnormal boundary development(abd)的基因克隆与功能分析
- 批准号:32070202
- 批准年份:2020
- 资助金额:58 万元
- 项目类别:面上项目
Development of a Linear Stochastic Model for Wind Field Reconstruction from Limited Measurement Data
- 批准号:
- 批准年份:2020
- 资助金额:40 万元
- 项目类别:
相似海外基金
Development of a new solid tritium breeder blanket
新型固体氚增殖毯的研制
- 批准号:
2908923 - 财政年份:2027
- 资助金额:
$ 140.92万 - 项目类别:
Studentship
Optimal utility-based design of oncology clinical development programmes
基于效用的肿瘤学临床开发项目的优化设计
- 批准号:
2734768 - 财政年份:2026
- 资助金额:
$ 140.92万 - 项目类别:
Studentship
REU Site: Microbial Biofilm Development, Resistance, & Community Structure
REU 网站:微生物生物膜的发展、耐药性、
- 批准号:
2349311 - 财政年份:2025
- 资助金额:
$ 140.92万 - 项目类别:
Continuing Grant
SoundDecisions - Musical Listening, Decision Making, And Equitable Development In The Mekong Delta
SoundDecisions - 湄公河三角洲的音乐聆听、决策和公平发展
- 批准号:
EP/Z000424/1 - 财政年份:2025
- 资助金额:
$ 140.92万 - 项目类别:
Research Grant
Bio-MATSUPER: Development of high-performance supercapacitors based on bio-based carbon materials
Bio-MATSUPER:开发基于生物基碳材料的高性能超级电容器
- 批准号:
EP/Z001013/1 - 财政年份:2025
- 资助金额:
$ 140.92万 - 项目类别:
Fellowship
Development of a Cell-Based Assay for Tetanus Vaccine Quality Control
破伤风疫苗质量控制细胞检测方法的开发
- 批准号:
10101986 - 财政年份:2024
- 资助金额:
$ 140.92万 - 项目类别:
Collaborative R&D
HURR — Platform Development
HURR – 平台开发
- 批准号:
10103254 - 财政年份:2024
- 资助金额:
$ 140.92万 - 项目类别:
Investment Accelerator
Automatic battery swapping cabinet development for scalability of e-mobility in Uganda
自动电池交换柜开发,以提高乌干达电动汽车的可扩展性
- 批准号:
10080435 - 财政年份:2024
- 资助金额:
$ 140.92万 - 项目类别:
Collaborative R&D
Development of digital diagnostics services for Parkinson’s disease
开发帕金森病数字诊断服务
- 批准号:
10086932 - 财政年份:2024
- 资助金额:
$ 140.92万 - 项目类别:
Collaborative R&D
RestoreDNA: Development of scalable eDNA-based solutions for biodiversity regulators and nature-related disclosure
RestoreDNA:为生物多样性监管机构和自然相关披露开发可扩展的基于 eDNA 的解决方案
- 批准号:
10086990 - 财政年份:2024
- 资助金额:
$ 140.92万 - 项目类别:
Collaborative R&D