Analysis of genomics datasets at a massive scale
大规模基因组学数据集分析
基本信息
- 批准号:10223343
- 负责人:
- 金额:$ 46.17万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2017
- 资助国家:美国
- 起止时间:2017-08-01 至 2023-07-31
- 项目状态:已结题
- 来源:
- 关键词:AddressAllelesBioconductorBiologyCatalogsCloud ComputingCommunitiesComputer softwareComputing MethodologiesDataDiseaseEventExcisionExonsGene ExpressionGene Expression ProfileGene Expression ProfilingGenesGenetic TranscriptionGenetic VariationGenomeGenotype-Tissue Expression ProjectGoalsHumanHuman BiologyImpact evaluationLeadLicensingLightLinear ModelsMapsMeasuresMetadataMethodsModelingMorphologic artifactsNucleotidesPathogenicityPatternProceduresProcessQuality ControlRNARNA SplicingRegulationResearchResolutionResourcesSample SizeSamplingSequence Read ArchiveSpliced GenesStatistical MethodsSystematic BiasThe Cancer Genome AtlasTissuesTranscriptUpdateVariantVisualizationWorkbasecell typedata resourcegenome-widegenomic datahuman RNA sequencinghuman diseaseinsightopen sourcereconstructionstatisticstranscriptometranscriptome sequencing
项目摘要
Understanding both normal and pathogenic patterns of human gene expression can help shed light on the biology
of human disease. Thousands of studies have now been undertaken measuring gene expression in different
tissues and diseases. By aggregating and analyzing all available human RNA-sequencing data using a high
powered computational and statistical framework, we will provide a transformative resource for characterizing
human gene expression patterns including rare transcriptional events, cellular networks, and genetic variation.
In Aim 1 we propose to uniformly process all publicly available human transcriptome sequencing data and collect it
into a publicly available resource called the Transcriptome Aggregation Resource (TAR); at least 150,000 samples
will be processed using cloud computing. This resource will contain single-base resolution maps of expression,
de novo mapped exon-exon splice junctions and allele specific expression across a set of common variations.
We will supplement the expression data with cleaned and predicted metadata. In Aim 2 we will develop statistical
and computational methods necessary to fully realize the potential of this resource. Specifically we will remove
unwanted variation at scale and develop mixture models to summarize the large data resource at the gene,
junction and single base levels. In Aim 3 we will analyze this resource to address fundamental questions in
expression biology, include a systematic study of expression outliers and allele specific expression at the gene,
junction and single base resolution. We will infer well-powered co-expression networks over both expressed
genes and splicing patterns.
This work will contribute significantly to our understanding of gene expression by analyzing genomics data at a
massive scale.
了解人类基因表达的正常和致病模式有助于了解生物学
人类疾病的威胁。现在已经进行了数千项研究,以测量不同物种的基因表达
组织和疾病。通过使用HIGH来聚集和分析所有可用的人类RNA测序数据
强大的计算和统计框架,我们将提供一个变革性的资源来描述
人类基因表达模式包括罕见的转录事件、细胞网络和遗传变异。
在目标1中,我们建议统一处理所有公开可用的人类转录组测序数据并收集它们
到名为Transcriptome Aggregation Resource(TAR)的公开可用资源中;至少150,000个样本
将使用云计算进行处理。该资源将包含表达的单基分辨率图,
从新定位外显子-外显子剪接连接和等位基因fic在一组常见变异中的表达。
我们将使用经过清理和预测的元数据来补充表达式数据。在目标2中,我们将发展统计学
以及充分实现这一资源潜力所需的计算方法。我们将删除特定的fi
并开发混合模型来总结基因中的大量数据资源,
交界处标高和单基准标高。在目标3中,我们将分析此资源以解决以下基本问题
表达生物学,包括对基因上表达异常值和等位基因fic表达的系统研究,
结点和单碱基分辨率。我们将在两个表达的基础上推断功能强大的共表达网络
基因和拼接模式。
这项工作将通过对基因组数据的分析,为我们理解基因表达做出重大贡献。fi
巨大的规模。
项目成果
期刊论文数量(7)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Missense variants causing Wiedemann-Steiner syndrome preferentially occur in the KMT2A-CXXC domain and are accurately classified using AlphaFold2.
- DOI:10.1371/journal.pgen.1010278
- 发表时间:2022-06
- 期刊:
- 影响因子:4.5
- 作者:
- 通讯作者:
recountmethylation enables flexible analysis of public blood DNA methylation array data.
- DOI:10.1093/bioadv/vbad020
- 发表时间:2023
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
Leveraging the Mendelian disorders of the epigenetic machinery to systematically map functional epigenetic variation.
- DOI:10.7554/elife.65884
- 发表时间:2021-08-31
- 期刊:
- 影响因子:7.7
- 作者:Luperchio TR;Boukas L;Zhang L;Pilarowski G;Jiang J;Kalinousky A;Hansen KD;Bjornsson HT
- 通讯作者:Bjornsson HT
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Kasper Daniel Hansen其他文献
Kasper Daniel Hansen的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Kasper Daniel Hansen', 18)}}的其他基金
Data Science Tools to Increase Insight in Genomics Data
提高基因组数据洞察力的数据科学工具
- 批准号:
10623491 - 财政年份:2023
- 资助金额:
$ 46.17万 - 项目类别:
Analysis of genomics datasets at a massive scale
大规模基因组学数据集分析
- 批准号:
9978590 - 财政年份:2017
- 资助金额:
$ 46.17万 - 项目类别:
相似海外基金
Linkage of HIV amino acid variants to protective host alleles at CHD1L and HLA class I loci in an African population
非洲人群中 HIV 氨基酸变异与 CHD1L 和 HLA I 类基因座的保护性宿主等位基因的关联
- 批准号:
502556 - 财政年份:2024
- 资助金额:
$ 46.17万 - 项目类别:
Olfactory Epithelium Responses to Human APOE Alleles
嗅觉上皮对人类 APOE 等位基因的反应
- 批准号:
10659303 - 财政年份:2023
- 资助金额:
$ 46.17万 - 项目类别:
Deeply analyzing MHC class I-restricted peptide presentation mechanistics across alleles, pathways, and disease coupled with TCR discovery/characterization
深入分析跨等位基因、通路和疾病的 MHC I 类限制性肽呈递机制以及 TCR 发现/表征
- 批准号:
10674405 - 财政年份:2023
- 资助金额:
$ 46.17万 - 项目类别:
An off-the-shelf tumor cell vaccine with HLA-matching alleles for the personalized treatment of advanced solid tumors
具有 HLA 匹配等位基因的现成肿瘤细胞疫苗,用于晚期实体瘤的个性化治疗
- 批准号:
10758772 - 财政年份:2023
- 资助金额:
$ 46.17万 - 项目类别:
Identifying genetic variants that modify the effect size of ApoE alleles on late-onset Alzheimer's disease risk
识别改变 ApoE 等位基因对迟发性阿尔茨海默病风险影响大小的遗传变异
- 批准号:
10676499 - 财政年份:2023
- 资助金额:
$ 46.17万 - 项目类别:
New statistical approaches to mapping the functional impact of HLA alleles in multimodal complex disease datasets
绘制多模式复杂疾病数据集中 HLA 等位基因功能影响的新统计方法
- 批准号:
2748611 - 财政年份:2022
- 资助金额:
$ 46.17万 - 项目类别:
Studentship
Genome and epigenome editing of induced pluripotent stem cells for investigating osteoarthritis risk alleles
诱导多能干细胞的基因组和表观基因组编辑用于研究骨关节炎风险等位基因
- 批准号:
10532032 - 财政年份:2022
- 资助金额:
$ 46.17万 - 项目类别:
Recessive lethal alleles linked to seed abortion and their effect on fruit development in blueberries
与种子败育相关的隐性致死等位基因及其对蓝莓果实发育的影响
- 批准号:
22K05630 - 财政年份:2022
- 资助金额:
$ 46.17万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Investigating the Effect of APOE Alleles on Neuro-Immunity of Human Brain Borders in Normal Aging and Alzheimer's Disease Using Single-Cell Multi-Omics and In Vitro Organoids
使用单细胞多组学和体外类器官研究 APOE 等位基因对正常衰老和阿尔茨海默病中人脑边界神经免疫的影响
- 批准号:
10525070 - 财政年份:2022
- 资助金额:
$ 46.17万 - 项目类别:
Leveraging the Evolutionary History to Improve Identification of Trait-Associated Alleles and Risk Stratification Models in Native Hawaiians
利用进化历史来改进夏威夷原住民性状相关等位基因的识别和风险分层模型
- 批准号:
10689017 - 财政年份:2022
- 资助金额:
$ 46.17万 - 项目类别:














{{item.name}}会员




