SciDAP: a next generation universal platform for collaborative data analysis
SciDAP:下一代协作数据分析通用平台
基本信息
- 批准号:10081764
- 负责人:
- 金额:$ 37.45万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2020
- 资助国家:美国
- 起止时间:2020-09-14 至 2022-05-31
- 项目状态:已结题
- 来源:
- 关键词:AdoptedAir MovementsBiologyClientCollaborationsCommunitiesComputer AnalysisComputer softwareCore FacilityCouplingCustomDNA MethylationDNA-Protein InteractionDataData AnalysesDevelopmentDiseaseDoctor of PhilosophyEnsureEnvironmentFeelingGene Expression ProfilingGenomicsHealthInstitutionInstructionLaboratoriesLanguageLearningLifeMedical centerMetadataMethodsMethylationModificationOutputPediatric HospitalsProceduresProcessProductivityPublicationsRNAReproducibilityResearchResearch PersonnelSPT6 ProteinScientistServicesSoftware ToolsSystemTechniquesTechnologyTestingVariantVisualizationWorkXCL1 geneanalysis pipelineautomated analysisbasebioinformatics toolbiomedical scientistbisulfite sequencingcomputational pipelinescomputerized data processingdesignexperimental studyflexibilitygenome browsergenomic datagraphical user interfaceinsightinteractive toolinterestmultiple omicsnew technologynext generationnext generation sequencingnovelopen sourceportabilityprototypescientific computingsingle-cell RNA sequencingtooltranscriptomicsusabilityuser friendly softwareuser-friendly
项目摘要
The recent proliferation of next-generation sequencing (NGS) - based methods for the analysis of gene expres-
sion, chromatin structure and protein-DNA interactions has created tremendous opportunities for gaining novel
insights into basic biology, health, and disease. However, analysis of the resulting data requires computational
expertise that many traditional biologists do not possess. Hence, when dealing with genomics data, majority of
biologists require the help of bioinformaticians even for simple tasks. This places these exciting methods beyond
the reach of the majority of life scientists.
This proposal from DATIRIUM, LLC, a start-up company from Cincinnati, OH describes a plan to create
SciDAP (Scientific Data Analysis Platform), a novel multi-omics user-friendly data analysis platform to allow
biologists to analyze the data themselves and to enable collaboration between biologists and bioinformaticians.
Datirium was founded by this application’s PIs, Artem Barski, PhD and Andrey Kartashov, initially to assist users
with installation and support of BioWardrobe. BioWardrobe, a user-friendly open-source integrative genomics
analysis platform, was developed by the Barski laboratory at Cincinnati Children’s Hospital Medical Center
(CCHMC) in 2015. It has been used by more than 40 CCHMC laboratories to process more than 8000 experi-
ments and has been applied in more than 40 publications. In addition, Datirium has installed and continues to
maintain BioWardrobe servers at several external research centers.
For Datirium, BioWardrobe served as a Minimum Viable Product (MVP) that allowed to confirm the need
and existence of market niche for such software, but also highlighted several design limitations. The key among
them was the difficulty in adding new or modifying existing pipelines: due to the tight coupling between pipeline
and user interface this required changes at all levels of software. Unfortunately, the same limitation exists for all
user-friendly bioinformatics tools. Given that there are more than 150 NGS-based methods and many ways to
process the data, this explains why a universal and user-friendly data analysis platform does not yet exist.
We hypothesize that we can create a data analysis platform that is both universal and user-friendly by includ-
ing interface instructions into computational pipelines itself. Platform will use these instructions to create a
graphical interface for users. Specifically, we are using containerized pipelines developed using Common Work-
flow Language (CWL). CWL allows to describe tools, pipelines and computational environment making these
pipelines both portable and reproducible. On top of CWL, Datirium developed a system of CWL extensions that
allows to describe the inputs and outputs visualizations within the CWL workflows. Importantly, our platform
will increase the rigor of computational analysis by (i) making the analysis reproducible and auditable by bioin-
formaticians due to CWL pipeline portability and recording each step of the analysis as Research Objects; (ii)
enabling collaboration between experimentalists and computational biologists by providing bioinformaticians
with a way to direct analysis flow and biologists with the convenience of GUI; (iii) Including out of the box pipe-
lines with optimized parameters and actionable QC metrics that flag possible issues.
In the first aim of this proposal we will create a prototype of the platform. In the second aim, we will conduct
usability testing with bioinformaticians and experimentalists to test whether our platform can accommodate
diverse types of the analysis in a user-friendly fashion. Specifically, in collaboration with Dr. Salomonis at
CCHMC, we will test how easy it is to integrate two existing analysis routines into SciDAP: BS-Seq DNA-methyl-
ation and scRNA-Seq; then we will work with biologists to ensure that the resulting interface is both user-friendly
and enables collaboration with bioinformaticians.
Successful completion of this project will provide the research community with a cutting edge, flexible and
biologist-friendly data analysis platform.
近年来,基于下一代测序(NGS)的基因表达分析方法越来越多
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Artem Barski其他文献
Artem Barski的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Artem Barski', 18)}}的其他基金
Epigenetic mechanisms of disrupted neurodevelopment in Menke-Hennekam syndrome
Menke-Hennekam 综合征神经发育障碍的表观遗传机制
- 批准号:
10816703 - 财政年份:2023
- 资助金额:
$ 37.45万 - 项目类别:
An experimentally-refined, dynamic gene regulatory network model of T-cell memory
经过实验改进的 T 细胞记忆动态基因调控网络模型
- 批准号:
10576265 - 财政年份:2021
- 资助金额:
$ 37.45万 - 项目类别:
An experimentally-refined, dynamic gene regulatory network model of T-cell memory
经过实验改进的 T 细胞记忆动态基因调控网络模型
- 批准号:
10210685 - 财政年份:2021
- 资助金额:
$ 37.45万 - 项目类别:
Commercialization of SciDAP, a next generation universal platform for collaborative data analysis
SciDAP 的商业化,下一代协作数据分析通用平台
- 批准号:
10338010 - 财政年份:2021
- 资助金额:
$ 37.45万 - 项目类别:
An experimentally-refined, dynamic gene regulatory network model of T-cell memory
经过实验改进的 T 细胞记忆动态基因调控网络模型
- 批准号:
10368121 - 财政年份:2021
- 资助金额:
$ 37.45万 - 项目类别:
An experimentally-refined, dynamic gene regulatory network model of T-cell memory
经过实验改进的 T 细胞记忆动态基因调控网络模型
- 批准号:
10213550 - 财政年份:2020
- 资助金额:
$ 37.45万 - 项目类别:
Death-Seq, a Method for Genome-wide Identification of Functional Silencer Elements
Death-Seq,一种全基因组识别功能性沉默元件的方法
- 批准号:
9979291 - 财政年份:2020
- 资助金额:
$ 37.45万 - 项目类别: