Personal and panel references for improved alignment
用于改进对齐的个人和面板参考
基本信息
- 批准号:10655473
- 负责人:
- 金额:$ 36.69万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2020
- 资助国家:美国
- 起止时间:2020-09-01 至 2025-06-30
- 项目状态:未结题
- 来源:
- 关键词:AddressAllelesAwarenessBiologyCategoriesCollectionCommunitiesComplementarity Determining RegionsComputer softwareDNADNA sequencingDarknessDataData SetDiseaseEventGeneticGenetic VariationGenomeGenomicsGenotypeGleanGraphHaplotypesHumanHuman GenomeLibrariesLicensingLinkage DisequilibriumMapsMeasuresMethodsModernizationProcessPropertyPublishingResearchResearch PersonnelResourcesSequence AlignmentSiteSoftware ToolsStandardizationSystemTextTrainingVariantVisualizationWorkexperimental studygenetic informationimprovedindexingmodel organismnext generation sequencingnovelopen sourcepreservationreference genometooluptake
项目摘要
PROJECT SUMMARY
Next-generation sequencing is ubiquitous in the study of biology and disease. The first step when analyz-
ing a sequencing dataset is read alignment: the process of determining where each snippet of sequencing
data (“read”) came from with respect to a reference genome. Currently, genomics research is hampered
by the use of a single, arbitrary reference. This fails to account for the vast genetic diversity that exists
among humans and model organisms. Further, it can result in “reference bias,” in turn leading to false or
misleading scientific results.
We propose a three-aim project that addresses the reference bias problem on multiple fronts. In
Aim 1, we will develop new methods and a new software tool called biastools for summarizing and
visualizing reference bias. In Aim 2, we will develop new software and methods that address reference
bias by enabling alignment to multiple representative reference genomes. In one subproject, we will use
genotype imputation to infer a personalized genome with the help of a large panel of reference haplotypes.
In a second subproject, we will use small collections of representative genomes connected in a “flow
graph,” so that reads are ultimately analyzed with respect to the most appropriate reference. The methods
described in both subprojects will be implemented as part of a new software tool called pals. Also as part
of this aim, we will release a software library and tool called jector for transforming alignments from one
reference coordinate system to another. Finally, for Aim 3, we apply a novel text-indexing method called
r-index to enable alignment of reads to large panels of reference haplotypes. We will release the software
as a software library and tool called pandex.
Successful completion of the project will provide the community with new methods and references
that leverage the genetic information we are gleaning from large-scale genotyping studies and from new
long-read assemblies. All software will be made available under an open source license.
项目摘要
下一代测序在生物学和疾病研究中无处不在。分析的第一步-
测序数据集是读段比对:确定测序的每个片段
数据(“读取”)来自参考基因组。目前,基因组学研究受到阻碍
通过使用一个单一的,任意的引用。这并不能解释存在的巨大的遗传多样性
在人类和模式生物中。此外,它可能导致“参考偏差”,进而导致错误或错误的结果。
误导性的科学结果。
我们提出了一个三个目标的项目,解决多方面的参考偏差问题。在
目标1,我们将开发新方法和一种名为biastools的新软件工具,用于总结和
可视化参考偏差。在目标2中,我们将开发新的软件和方法,
通过与多个代表性的参考基因组进行比对来进行偏倚。在一个子项目中,我们将使用
基因型插补,以在大量参考单倍型的帮助下推断个性化基因组。
在第二个子项目中,我们将使用小的代表性基因组集合连接在一个“队列”中,
图表”,以便最终根据最合适的参考对阅读内容进行分析。的方法
这两个分项目中所述的信息和通信技术将作为称为帕尔斯的新软件工具的一部分加以实施。作为一部分,
为了实现这一目标,我们将发布一个名为jector的软件库和工具,
坐标系到另一个。最后,对于目标3,我们应用了一种新的文本索引方法,
r-索引,以使读段能够与大的参考单倍型组进行比对。我们将发布软件
as a software软件library图书馆and tool工具called叫pandex潘德克斯.
该项目的成功完成将为社会提供新的方法和参考
利用我们从大规模基因分型研究中收集的遗传信息,
长读程序集。所有软件都将在开源许可证下提供。
项目成果
期刊论文数量(17)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Reference flow: reducing reference bias using multiple population genomes.
- DOI:10.1186/s13059-020-02229-3
- 发表时间:2021-01-04
- 期刊:
- 影响因子:12.3
- 作者:Chen NC;Solomon B;Mun T;Iyer S;Langmead B
- 通讯作者:Langmead B
Measuring, visualizing and diagnosing reference bias with biastools.
使用偏差工具测量、可视化和诊断参考偏差。
- DOI:10.1101/2023.09.13.557552
- 发表时间:2024
- 期刊:
- 影响因子:0
- 作者:Lin,Mao-Jan;Iyer,Sheila;Chen,Nae-Chyun;Langmead,Ben
- 通讯作者:Langmead,Ben
μ- PBWT: a lightweight r-indexing of the PBWT for storing and querying UK Biobank data.
- DOI:10.1093/bioinformatics/btad552
- 发表时间:2023-09-02
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
Movi: a fast and cache-efficient full-text pangenome index.
Movi:快速且高效缓存的全文泛基因组索引。
- DOI:10.1101/2023.11.04.565615
- 发表时间:2024
- 期刊:
- 影响因子:0
- 作者:Zakeri,Mohsen;Brown,NathanielK;Ahmed,OmarY;Gagie,Travis;Langmead,Ben
- 通讯作者:Langmead,Ben
Centrifuger: lossless compression of microbial genomes for efficient and accurate metagenomic sequence classification.
离心机:无损压缩微生物基因组,以实现高效、准确的宏基因组序列分类。
- DOI:10.1101/2023.11.15.567129
- 发表时间:2023
- 期刊:
- 影响因子:0
- 作者:Song,Li;Langmead,Ben
- 通讯作者:Langmead,Ben
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Benjamin Thomas Langmead其他文献
Benjamin Thomas Langmead的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Benjamin Thomas Langmead', 18)}}的其他基金
Methods for sequencing data analysis and archive-scale data science
排序数据分析和档案规模数据科学的方法
- 批准号:
10548746 - 财政年份:2021
- 资助金额:
$ 36.69万 - 项目类别:
Methods for sequencing data analysis and archive-scale data science
排序数据分析和档案规模数据科学的方法
- 批准号:
10322369 - 财政年份:2021
- 资助金额:
$ 36.69万 - 项目类别:
Personal and panel references for improved alignment
用于改进对齐的个人和面板参考
- 批准号:
10242948 - 财政年份:2020
- 资助金额:
$ 36.69万 - 项目类别:
Personal and panel references for improved alignment
用于改进对齐的个人和面板参考
- 批准号:
10057490 - 财政年份:2020
- 资助金额:
$ 36.69万 - 项目类别:
Personal and panel references for improved alignment
用于改进对齐的个人和面板参考
- 批准号:
10443815 - 财政年份:2020
- 资助金额:
$ 36.69万 - 项目类别:
相似海外基金
Linkage of HIV amino acid variants to protective host alleles at CHD1L and HLA class I loci in an African population
非洲人群中 HIV 氨基酸变异与 CHD1L 和 HLA I 类基因座的保护性宿主等位基因的关联
- 批准号:
502556 - 财政年份:2024
- 资助金额:
$ 36.69万 - 项目类别:
Olfactory Epithelium Responses to Human APOE Alleles
嗅觉上皮对人类 APOE 等位基因的反应
- 批准号:
10659303 - 财政年份:2023
- 资助金额:
$ 36.69万 - 项目类别:
Deeply analyzing MHC class I-restricted peptide presentation mechanistics across alleles, pathways, and disease coupled with TCR discovery/characterization
深入分析跨等位基因、通路和疾病的 MHC I 类限制性肽呈递机制以及 TCR 发现/表征
- 批准号:
10674405 - 财政年份:2023
- 资助金额:
$ 36.69万 - 项目类别:
An off-the-shelf tumor cell vaccine with HLA-matching alleles for the personalized treatment of advanced solid tumors
具有 HLA 匹配等位基因的现成肿瘤细胞疫苗,用于晚期实体瘤的个性化治疗
- 批准号:
10758772 - 财政年份:2023
- 资助金额:
$ 36.69万 - 项目类别:
Identifying genetic variants that modify the effect size of ApoE alleles on late-onset Alzheimer's disease risk
识别改变 ApoE 等位基因对迟发性阿尔茨海默病风险影响大小的遗传变异
- 批准号:
10676499 - 财政年份:2023
- 资助金额:
$ 36.69万 - 项目类别:
New statistical approaches to mapping the functional impact of HLA alleles in multimodal complex disease datasets
绘制多模式复杂疾病数据集中 HLA 等位基因功能影响的新统计方法
- 批准号:
2748611 - 财政年份:2022
- 资助金额:
$ 36.69万 - 项目类别:
Studentship
Genome and epigenome editing of induced pluripotent stem cells for investigating osteoarthritis risk alleles
诱导多能干细胞的基因组和表观基因组编辑用于研究骨关节炎风险等位基因
- 批准号:
10532032 - 财政年份:2022
- 资助金额:
$ 36.69万 - 项目类别:
Recessive lethal alleles linked to seed abortion and their effect on fruit development in blueberries
与种子败育相关的隐性致死等位基因及其对蓝莓果实发育的影响
- 批准号:
22K05630 - 财政年份:2022
- 资助金额:
$ 36.69万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Investigating the Effect of APOE Alleles on Neuro-Immunity of Human Brain Borders in Normal Aging and Alzheimer's Disease Using Single-Cell Multi-Omics and In Vitro Organoids
使用单细胞多组学和体外类器官研究 APOE 等位基因对正常衰老和阿尔茨海默病中人脑边界神经免疫的影响
- 批准号:
10525070 - 财政年份:2022
- 资助金额:
$ 36.69万 - 项目类别:
Leveraging the Evolutionary History to Improve Identification of Trait-Associated Alleles and Risk Stratification Models in Native Hawaiians
利用进化历史来改进夏威夷原住民性状相关等位基因的识别和风险分层模型
- 批准号:
10689017 - 财政年份:2022
- 资助金额:
$ 36.69万 - 项目类别: