Implementing the Genomic Data Science Analysis, Visualization, and Informatics Lab-space (AnVIL)
实施基因组数据科学分析、可视化和信息学实验室空间 (AnVIL)
基本信息
- 批准号:10220581
- 负责人:
- 金额:$ 40万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2018
- 资助国家:美国
- 起止时间:2018-09-21 至 2023-06-30
- 项目状态:已结题
- 来源:
- 关键词:AddressBioconductorBiomedical ResearchDataData AnalysesData ScienceData Storage and RetrievalEcosystemElectronic Health RecordElementsEnsureEnvironmentFast Healthcare Interoperability ResourcesGalaxyGenomicsGenotype-Tissue Expression ProjectHealthcareIndividualInformaticsInstitutesMonitorMovementNational Human Genome Research InstitutePatientsRecordsResearchResearch PersonnelServicesSystemSystems AnalysisTechnologyThe Cancer Genome AtlasTrans-Omics for Precision MedicineUnited States National Institutes of HealthVisualizationWorkbasecloud basedcloud platformcomputational platformcomputing resourcesdata accessdata formatdata ingestiondata managementdata portaldata resourcedata sharingdata warehousegenomic datahealth datainteroperabilitymolecular phenotypenext generationphenotypic datatoolworking group
项目摘要
Project Summary
The NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space (AnVIL) powers the
next generation of computational genomics research using cloud-scale data and compute resources. The
platform is built on a set of established components, including the Terra computing platform and Dockstore
for standards-based sharing of containerized tools and workflows. It also provides multiple entry points for
data access and analysis, including batch workflows with Terra, notebook environments including Jupyter and
RStudio, Bioconductor packages for building analysis on top of AnVIL APIs and services, and will soon offer
Galaxy instances for interactive analysis. By providing a unified environment for data management and
compute, AnVIL eliminates the need for data movement, allows for controlled access to sensitive data and
monitoring, and provides elastic, shared computing resources that can be acquired by researchers as needed.
NIH-sponsored biomedical research is increasingly moving to cloud-based data storage and analysis systems,
with major cloud portals established for GTEx, Kids First, TOPMed, TCGA and several other major initiatives.
However, using these systems together is a challenge. The individual data portals enable researchers to browse
and query their own data but have limited functionality to share data or user registrations across portals or
with cloud based workspaces, like Terra and Galaxy. The recently established NIH Cloud Platform
Interoperability (NCPI) effort aims to address these issues by implementing key interoperability technologies
across multiple NIH institutes. Under this project, we will work the NCPI working groups to define the use
cases and standards for interoperability as well as implement three major technologies recommended by the
NCPI within the Galaxy and R/Bioconductor components of AnVIL. First, we will implement the NIH
Researcher Auth Service (RAS) to provide a common mechanism for researchers to establish their identity and
access data they are authorized to use across Terra and Galaxy. Second, we will implement the Global Alliance
for Genomics and Health (GA4GH) Data Repository Service (DRS) so that data consumers, including
workflow systems, can access data objects in a single, standard way regardless of where they are stored and
how they are managed. Finally, we will develop initial support in AnVIL for the Fast Healthcare
Interoperability Resources (FHIR) standard. This standard describes data formats, elements, and an API for
exchanging electronic health records (EHR), especially to ensure these records are available, discoverable, and
understandable as patients move around the healthcare ecosystem. FHIR support in AnVIL will facilitate
access to eMERGE and related projects by users once the data are ingested in AnVIL.
项目摘要
NHGRI基因组数据科学分析,可视化和信息学实验室空间(AnVIL)为
利用云规模数据和计算资源进行下一代计算基因组学研究。的
平台构建在一组已建立的组件之上,包括Terra计算平台和Dockstore
用于基于标准的容器化工具和工作流共享。它还提供了多个入口点,
数据访问和分析,包括使用Terra的批处理工作流,笔记本环境,包括Quixyter,
RStudio,Bioconductor软件包,用于在AnVIL API和服务之上构建分析,并将很快提供
用于交互式分析的Galaxy实例。通过提供统一的数据管理环境,
AnVIL消除了数据移动的需要,允许对敏感数据的受控访问,
监控,并提供弹性,共享计算资源,可以由研究人员根据需要获得。
NIH赞助的生物医学研究越来越多地转向基于云的数据存储和分析系统,
为GTEx、Kids First、TOPMed、TCGA和其他几个主要计划建立了主要的云门户。
然而,将这些系统结合使用是一个挑战。各个数据门户使研究人员能够浏览
并查询自己的数据,但在跨门户共享数据或用户注册方面功能有限,
基于云计算的云计算,比如Terra和Galaxy。最近建立的NIH云平台
互操作性(NCPI)工作旨在通过实现关键的互操作性技术来解决这些问题
在多个国家卫生研究院。在这个项目下,我们将与NCPI工作组合作,
案例和互操作性标准,并实施
AnVIL的Galaxy和R/Bioconductor组件内的NCPI。首先,我们将实施NIH
研究人员身份验证服务(RAS)为研究人员提供一个通用机制,以建立他们的身份,
访问他们被授权在Terra和Galaxy上使用的数据。第二,我们将实施全球联盟
基因组学和健康(GA 4GH)数据存储库服务(DRS),以便数据消费者,包括
工作流系统可以以单一的标准方式访问数据对象,而不管它们存储在哪里,
如何管理它们。最后,我们将在AnVIL中为Fast Healthcare开发初始支持
互操作性资源(FHIR)标准。该标准描述了数据格式、元素和API,
交换电子健康记录(EHR),特别是确保这些记录可用、可验证,
这是可以理解的,因为病人在医疗生态系统中移动。FHIR在AnVIL的支持将促进
一旦AnVIL中的数据被摄取,用户就可以访问eMERGE和相关项目。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
                item.title }}
{{ item.translation_title }}
- DOI:{{ item.doi }} 
- 发表时间:{{ item.publish_year }} 
- 期刊:
- 影响因子:{{ item.factor }}
- 作者:{{ item.authors }} 
- 通讯作者:{{ item.author }} 
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:{{ item.author }} 
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:{{ item.author }} 
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:{{ item.author }} 
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:{{ item.author }} 
数据更新时间:{{ patent.updateTime }}
Jeremy Goecks其他文献
Jeremy Goecks的其他文献
{{
              item.title }}
{{ item.translation_title }}
- DOI:{{ item.doi }} 
- 发表时间:{{ item.publish_year }} 
- 期刊:
- 影响因子:{{ item.factor }}
- 作者:{{ item.authors }} 
- 通讯作者:{{ item.author }} 
{{ truncateString('Jeremy Goecks', 18)}}的其他基金
Scalable multi-mode education to increase use of ITCR tools by diverse analysts
可扩展的多模式教育,以增加不同分析师对 ITCR 工具的使用
- 批准号:10669864 
- 财政年份:2020
- 资助金额:$ 40万 
- 项目类别:
Scalable multi-mode education to increase use of ITCR tools by diverse analysts
可扩展的多模式教育,以增加不同分析师对 ITCR 工具的使用
- 批准号:10250548 
- 财政年份:2020
- 资助金额:$ 40万 
- 项目类别:
Scalable multi-mode education to increase use of ITCR tools by diverse analysts
可扩展的多模式教育,以增加不同分析师对 ITCR 工具的使用
- 批准号:10075552 
- 财政年份:2020
- 资助金额:$ 40万 
- 项目类别:
A Federated Galaxy for user-friendly large-scale cancer genomics research
用于用户友好的大规模癌症基因组学研究的联邦星系
- 批准号:10245142 
- 财政年份:2018
- 资助金额:$ 40万 
- 项目类别:
A Federated Galaxy for user-friendly large-scale cancer genomics research
用于用户友好的大规模癌症基因组学研究的联邦星系
- 批准号:10908030 
- 财政年份:2018
- 资助金额:$ 40万 
- 项目类别:
Implementing the Genomic Data Science Analysis, Visualization, and Informatics Lab-space (AnVIL)
实施基因组数据科学分析、可视化和信息学实验室空间 (AnVIL)
- 批准号:10405959 
- 财政年份:2018
- 资助金额:$ 40万 
- 项目类别:
A Federated Galaxy for user-friendly large-scale cancer genomics research
用于用户友好的大规模癌症基因组学研究的联邦星系
- 批准号:10461143 
- 财政年份:2018
- 资助金额:$ 40万 
- 项目类别:
相似海外基金
Supplement:  Enhancing Community Contributions to Bioconductor With Build System Containerization and a GPU for Testing
补充:通过构建系统容器化和用于测试的 GPU 增强社区对 Bioconductor 的贡献
- 批准号:10838736 
- 财政年份:2023
- 资助金额:$ 40万 
- 项目类别:
Data infrastructure for single-cell multiplex imaging in Bioconductor
Bioconductor 中单细胞多重成像的数据基础设施
- 批准号:10831240 
- 财政年份:2022
- 资助金额:$ 40万 
- 项目类别:
Cancer Genomics: Integrative and Scalable Solutions in R/Bioconductor
癌症基因组学:R/Bioconductor 中的集成且可扩展的解决方案
- 批准号:10703230 
- 财政年份:2021
- 资助金额:$ 40万 
- 项目类别:
Durable Common Fund Data Interfaces and Tutorials with Bioconductor
持久的共同基金数据接口和 Bioconductor 教程
- 批准号:10356362 
- 财政年份:2021
- 资助金额:$ 40万 
- 项目类别:
Cancer Genomics: Integrative and Scalable Solutions in R/Bioconductor
癌症基因组学:R/Bioconductor 中的集成且可扩展的解决方案
- 批准号:10594231 
- 财政年份:2021
- 资助金额:$ 40万 
- 项目类别:
Cancer Genomics: Integrative and Scalable Solutions in R/Bioconductor
癌症基因组学:R/Bioconductor 中的集成且可扩展的解决方案
- 批准号:10449603 
- 财政年份:2021
- 资助金额:$ 40万 
- 项目类别:
Cancer Genomics: Integrative and Scalable Solutions in R/Bioconductor
癌症基因组学:R/Bioconductor 中的集成且可扩展的解决方案
- 批准号:10478123 
- 财政年份:2021
- 资助金额:$ 40万 
- 项目类别:
Accelerating Cancer Genomics with Cloud-scale Bioconductor
利用云规模 Bioconductor 加速癌症基因组学
- 批准号:9478159 
- 财政年份:2017
- 资助金额:$ 40万 
- 项目类别:
Cancer Genomics:Integrative and Scalable Solutions in R / Bioconductor
癌症基因组学:R / Bioconductor 中的集成且可扩展的解决方案
- 批准号:9186264 
- 财政年份:2014
- 资助金额:$ 40万 
- 项目类别:
Cancer Genomics:Integrative and Scalable Solutions in R / Bioconductor
癌症基因组学:R / Bioconductor 中的集成且可扩展的解决方案
- 批准号:9334747 
- 财政年份:2014
- 资助金额:$ 40万 
- 项目类别:

 刷新
              刷新
            
















 {{item.name}}会员
              {{item.name}}会员
            



