Identification of novel pathogenic tandem repeat expansions using long read sequencing
使用长读长测序鉴定新型致病性串联重复扩增
基本信息
- 批准号:10468668
- 负责人:
- 金额:$ 62.23万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2018
- 资助国家:美国
- 起止时间:2018-09-30 至 2023-07-31
- 项目状态:已结题
- 来源:
- 关键词:AdultAlgorithmsAllelesAtaxiaBaseline SurveysBiological AssayBiological SciencesCAG repeatClinical ResearchCodeCollaborationsCollectionCustomDNADNA SequenceDNA Sequence AlterationDataDiploidyDiseaseEtiologyEventFamilyFragile X SyndromeGenerationsGeneticGenetic AnticipationGenetic PolymorphismGenetic studyGenomeGenotypeHaplotypesHigh-Throughput Nucleotide SequencingHumanHuman GenomeHuntington DiseaseHybridsHypermethylationIndividualInheritedLengthMethodsMutationMyotonic DystrophyNeurodegenerative DisordersNeuromuscular DiseasesPathogenicityPatientsPhasePhenocopyPopulationPopulation ControlRepetitive SequenceResearch PersonnelResourcesSamplingShotgunsSurveysSyndromeTandem Repeat SequencesTechnologyTrinucleotide RepeatsVariantbasecohortcostexome sequencingfallsgenetic linkagegenetic pedigreegenome sequencinggenome-widehuman diseasenervous system disordernovelpolyglutaminesingle molecule real time sequencingwhole genome
项目摘要
Large expansions of tandemly repeated (TR) DNA sequences (eg. polyCAG) are known to underlie >30
different human neurological diseases, including Huntington’s disease, Fragile X, and Myotonic dystrophy. The
vast majority of known TR expansions are observed in adult onset degenerative neuromuscular disorders and
ataxia syndromes. Although significant recent advances have been made that enable TRs to be genotyped
from high-throughput sequencing, the methods currently used to sequence human genomes are unable to
identify TR expansions, as they only look at very short fragments of DNA. However, recent advances and
falling costs of sequencing technologies like Pacific Biosciences SMRT sequencing that generates much
longer reads hold the promise to detect previously undetected repeat expansions. Here we will perform whole
genome sequencing using combined Pacific Biosciences (PacBio) on a selected cohort of patients with
unsolved ataxias and Huntington’s-like disease, in which all known TR expansions and other mutations have
been excluded. Many of these samples come from multi-generation pedigrees with dominant inheritance that
show genetic anticipation and linkage information that localizes the pathogenic mutation to a subset of the
genome, thus representing an optimized cohort in which to search for unknown pathogenic TR expansions.
In order to be able to identify TR expansions underlying human disease, it is first necessary to
characterize the spectrum of tandem repeat variation within the normal population. Using genomes of 26
individuals sequenced with PacBio, we will use a novel algorithms we have developed called MsPac and
PacMONSTR, to generate a survey of the size distribution of all TRs in the normal human genome. This will be
supplemented by TR genotypes generated by HipSTR from 1,500 Illumina genomes. This information will
provide a baseline survey of TR variation that will allow us identify pathogenic TR expansions in samples with
ataxia and neurodegenerative disease, and as we show, also enables us to identify candidate TRs that are
likely to expand in human disease. Using this approach, we will first perform targeted genotyping of four
polyglutamine TRs that show strong signatures of instability in 250 samples with SCA/HD-phenocopies.
We will next perform PacBio genome sequencing of 100 individuals from 40 pedigrees with unsolved
ataxia/HD-like disease, using a selected cohort of samples in which all known genetic and environmental
causes have already been excluded. We hypothesize that the mutation in some of these pedigrees will be
novel expanded TRs that have remained invisible to previous short-read approaches. We will search for novel
TR expansions not observed in our control population.
Using this optimized cohort and novel hybrid long-read sequencing approach, this proposal will lead to
the identification of novel pathogenic TR expansions that underlie human neurological diseases, yielding
significant advances in our understanding of the etiology of ataxia and neurodegenerative disease.
串联重复(TR)DNA序列的大扩增(例如,polyCAG)已知是>30
不同的人类神经系统疾病,包括亨廷顿病、脆性X染色体和强直性肌营养不良。的
绝大多数已知的TR扩张在成人发病的退行性神经肌肉疾病中观察到,
共济失调综合征虽然最近取得了重大进展,使TR的基因分型
从高通量测序来看,目前用于人类基因组测序的方法无法
鉴定TR扩增,因为它们只观察非常短的DNA片段。然而,最近的进展和
太平洋生物科学公司SMRT测序等测序技术的成本下降,
较长的读段有希望检测先前未检测到的重复扩增。在这里,我们将执行整个
使用联合太平洋生物科学公司(PacBio)对选定的患者队列进行基因组测序,
未解决的共济失调和亨廷顿样疾病,其中所有已知的TR扩展和其他突变都具有
被排除在外。这些样本中的许多来自多代系谱,具有显性遗传,
显示遗传预测和连锁信息,其将致病性突变定位于
基因组,因此代表了一个优化的队列,在其中搜索未知的致病性TR扩增。
为了能够鉴定人类疾病潜在的TR扩增,首先需要
表征正常人群中串联重复序列变异的谱。使用26个基因组
与PacBio测序的个体,我们将使用一种新的算法,我们已经开发了所谓的MsPac,
PacMONSTR,以生成正常人类基因组中所有TR的大小分布的调查。这将是
由来自1,500个Illumina基因组的HipSTR产生的TR基因型补充。这些信息将
提供TR变异的基线调查,使我们能够识别样本中的致病性TR扩增,
共济失调和神经退行性疾病,正如我们所展示的,也使我们能够识别候选TR,
很可能在人类疾病中传播。使用这种方法,我们将首先对四个基因进行靶向基因分型。
多聚谷氨酰胺TR在250个SCA/HD表型样本中显示出强烈的不稳定性特征。
接下来,我们将对来自40个家系的100名个体进行PacBio基因组测序,
共济失调/HD样疾病,使用选定的样本队列,其中所有已知的遗传和环境因素
原因已经排除。我们假设这些家系中的一些突变将是
新的扩展TR,这些TR对以前的短读方法仍然不可见。我们将寻找新的
在我们的对照人群中未观察到TR扩增。
使用这种优化的队列和新的混合长读序测序方法,该提议将导致
鉴定了人类神经系统疾病的新的致病性TR扩增,
我们对共济失调和神经退行性疾病病因学的理解有了重大进展。
项目成果
期刊论文数量(7)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
A phenome-wide association study of methylated GC-rich repeats identifies a GCC repeat expansion in AFF3 as a significant cause of intellectual disability.
一项针对甲基化 GC 重复序列的全表组关联研究发现,AFF3 中的 GCC 重复序列扩展是智力障碍的一个重要原因。
- DOI:10.1101/2023.05.03.23289461
- 发表时间:2023
- 期刊:
- 影响因子:0
- 作者:Jadhav,Bharati;Garg,Paras;vanVugt,JokeJFA;Ibanez,Kristina;Gagliardi,Delia;Lee,William;Shadrina,Mariya;Mokveld,Tom;Dolzhenko,Egor;Martin-Trujillo,Alejandro;Gies,ScottL;Rocca,Clarissa;Barbosa,Mafalda;Jain,Miten;Lahiri,Nayan
- 通讯作者:Lahiri,Nayan
POPULATION FREQUENCY OF REPEAT EXPANSIONS INDICATES INCREASED DISEASE PREVALENCE ESTIMATES ACROSS DIFFERENT POPULATIONS.
人口重复扩张的频率表明不同人群的疾病患病率估计有所增加。
- DOI:10.1101/2023.07.03.23292162
- 发表时间:2023
- 期刊:
- 影响因子:0
- 作者:Ibañez,Kristina;Jadhav,Bharati;Facchini,Stefano;Garg,Paras;Zanovello,Matteo;Martin-Trujillo,Alejandro;Gies,ScottJ;Deforie,ValentinaGalassi;Gagliardi,Delia;Hensman,Davina;Moutsianas,Loukas;Shoai,Maryam;GenomicsEnglandResearchCo
- 通讯作者:GenomicsEnglandResearchCo
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Andrew James Sharp其他文献
Andrew James Sharp的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Andrew James Sharp', 18)}}的其他基金
A comprehensive study of tandem repeat variation as a cause of Alzheimer's disease
串联重复变异作为阿尔茨海默病病因的综合研究
- 批准号:
10585034 - 财政年份:2023
- 资助金额:
$ 62.23万 - 项目类别:
Investigating tandem repeat expansions as a cause of schizophrenia
研究串联重复扩增是精神分裂症的原因
- 批准号:
10426348 - 财政年份:2021
- 资助金额:
$ 62.23万 - 项目类别:
Investigating tandem repeat expansions as a cause of schizophrenia
研究串联重复扩增是精神分裂症的原因
- 批准号:
10301434 - 财政年份:2021
- 资助金额:
$ 62.23万 - 项目类别:
Investigating tandem repeat variation as a cause of Alzheimer's disease from exome sequencing data
根据外显子组测序数据研究串联重复变异是阿尔茨海默氏病的原因
- 批准号:
10337187 - 财政年份:2021
- 资助金额:
$ 62.23万 - 项目类别:
Investigating tandem repeat variation as a cause of Alzheimer's disease from exome sequencing data
根据外显子组测序数据研究串联重复变异是阿尔茨海默氏病的原因
- 批准号:
10091108 - 财政年份:2021
- 资助金额:
$ 62.23万 - 项目类别:
Identification of novel pathogenic tandem repeat expansions using long read sequencing
使用长读长测序鉴定新型致病性串联重复扩增
- 批准号:
9791002 - 财政年份:2018
- 资助金额:
$ 62.23万 - 项目类别:
Identification of novel pathogenic tandem repeat expansions using long read sequencing
使用长读长测序鉴定新型致病性串联重复扩增
- 批准号:
9983189 - 财政年份:2018
- 资助金额:
$ 62.23万 - 项目类别:
Identification of novel pathogenic tandem repeat expansions using long read sequencing
使用长读长测序鉴定新型致病性串联重复扩增
- 批准号:
10225433 - 财政年份:2018
- 资助金额:
$ 62.23万 - 项目类别:
Epigenetic Determinants of Major Depression: a Monozygotic Discordant Twin Study
重度抑郁症的表观遗传决定因素:同卵不一致双胞胎研究
- 批准号:
8503945 - 财政年份:2013
- 资助金额:
$ 62.23万 - 项目类别:
Epigenetic Determinants of Major Depression: a Monozygotic Discordant Twin Study
重度抑郁症的表观遗传决定因素:同卵不一致双胞胎研究
- 批准号:
8915745 - 财政年份:2013
- 资助金额:
$ 62.23万 - 项目类别:
相似海外基金
DMS-EPSRC: Asymptotic Analysis of Online Training Algorithms in Machine Learning: Recurrent, Graphical, and Deep Neural Networks
DMS-EPSRC:机器学习中在线训练算法的渐近分析:循环、图形和深度神经网络
- 批准号:
EP/Y029089/1 - 财政年份:2024
- 资助金额:
$ 62.23万 - 项目类别:
Research Grant
CAREER: Blessing of Nonconvexity in Machine Learning - Landscape Analysis and Efficient Algorithms
职业:机器学习中非凸性的祝福 - 景观分析和高效算法
- 批准号:
2337776 - 财政年份:2024
- 资助金额:
$ 62.23万 - 项目类别:
Continuing Grant
CAREER: From Dynamic Algorithms to Fast Optimization and Back
职业:从动态算法到快速优化并返回
- 批准号:
2338816 - 财政年份:2024
- 资助金额:
$ 62.23万 - 项目类别:
Continuing Grant
CAREER: Structured Minimax Optimization: Theory, Algorithms, and Applications in Robust Learning
职业:结构化极小极大优化:稳健学习中的理论、算法和应用
- 批准号:
2338846 - 财政年份:2024
- 资助金额:
$ 62.23万 - 项目类别:
Continuing Grant
CRII: SaTC: Reliable Hardware Architectures Against Side-Channel Attacks for Post-Quantum Cryptographic Algorithms
CRII:SaTC:针对后量子密码算法的侧通道攻击的可靠硬件架构
- 批准号:
2348261 - 财政年份:2024
- 资助金额:
$ 62.23万 - 项目类别:
Standard Grant
CRII: AF: The Impact of Knowledge on the Performance of Distributed Algorithms
CRII:AF:知识对分布式算法性能的影响
- 批准号:
2348346 - 财政年份:2024
- 资助金额:
$ 62.23万 - 项目类别:
Standard Grant
CRII: CSR: From Bloom Filters to Noise Reduction Streaming Algorithms
CRII:CSR:从布隆过滤器到降噪流算法
- 批准号:
2348457 - 财政年份:2024
- 资助金额:
$ 62.23万 - 项目类别:
Standard Grant
EAGER: Search-Accelerated Markov Chain Monte Carlo Algorithms for Bayesian Neural Networks and Trillion-Dimensional Problems
EAGER:贝叶斯神经网络和万亿维问题的搜索加速马尔可夫链蒙特卡罗算法
- 批准号:
2404989 - 财政年份:2024
- 资助金额:
$ 62.23万 - 项目类别:
Standard Grant
CAREER: Efficient Algorithms for Modern Computer Architecture
职业:现代计算机架构的高效算法
- 批准号:
2339310 - 财政年份:2024
- 资助金额:
$ 62.23万 - 项目类别:
Continuing Grant
CAREER: Improving Real-world Performance of AI Biosignal Algorithms
职业:提高人工智能生物信号算法的实际性能
- 批准号:
2339669 - 财政年份:2024
- 资助金额:
$ 62.23万 - 项目类别:
Continuing Grant