Statistical methods for population and family-based whole-genome sequence data
基于人群和家系的全基因组序列数据的统计方法
基本信息
- 批准号:8694183
- 负责人:
- 金额:$ 35.35万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2014
- 资助国家:美国
- 起止时间:2014-04-25 至 2019-01-31
- 项目状态:已结题
- 来源:
- 关键词:AccountingAddressAffectAlgorithmsBase SequenceBiologyChromosome MappingChromosomesCollaborationsCommunitiesComplexComputer softwareComputing MethodologiesDataData AnalysesData SetDiseaseDocumentationEvaluationFamilyFamily memberFutureGenetic ProgrammingGenomicsGenotypeGoalsHaplotypesHuman GenomeIndividualLarge-Scale SequencingLinkage DisequilibriumMethodologyMethodsModelingMutationNatureNuclearParentsPhasePopulationPopulation AnalysisPopulation ControlPublishingRelative (related person)ResearchResearch DesignSequence AnalysisSiteSource CodeSpeedStatistical MethodsStratificationStretchingTechnologyTestingUncertaintyVariantWalkingbasecostdesigndisease phenotypefallsfamily structureflexibilitygenome sequencinggenome wide association studyimprovedinnovationinterestmarkov modelnext generation sequencingnovelopen sourcepopulation basedportabilityprogramspublic health relevancerare variantsimulationsoftware developmenttooltraittransmission processvector
项目摘要
DESCRIPTION (provided by applicant):
Emerging sequencing technologies have made whole-genome sequencing become available for researches to study various phenotypes/diseases of interest, particularly focusing on rare variants sites. Although the first batch of sequencing projects has mainly focused on the analysis of unrelated individuals, numerous sequencing studies including related individuals have been carried out or launched recently as the sequencing cost reduces rapidly. However, the methodologies for analyzing family-based sequence data are largely falling behind partially due to the complexity of family structures and computational barrier. In this study, our primary goals are to efficiently and accurately infer individual genotypes and haplotypes - the key component of any sequencing project - by combining information from both family and population levels, and to study how differential sequencing errors will affect downstream association analysis. To achieve these goals, we propose specific aims as follows: 1) We will propose a novel statistical framework for genotyping calling and haplotype inference of sequence data including relative individuals. The new method takes advantages of both short stretches shared between unrelated individuals and long stretches shared between family members in a computationally feasible manner while retaining a high degree of accuracy via the synergy between two classic approaches: hidden Markov model (HMM) for linkage disequilibrium information and Lander-Green algorithm for inheritance vectors; 2) We will develop an exact algorithm for HMM computation to speed up a class of widely use genetics programs, including the method developed in Aim 1, without any sacrifice of accuracy; 3) We will assess the impact of sequencing errors on family-based association methods for rare variants and use the intrinsic stochastic nature of the proposed methods in Aim 1 to reduce the false positives under a framework of multiple imputation; 4) We will test and recalibrate our developed methods in collaboration with ongoing sequencing projects and systematically investigate different study designs. Successful completion of these aims will yield state-of-the-art statistical methods and software, which will facilitate the fast growing sequencing projects including family members and guide the design and analysis of future studies.
描述(由申请人提供):
新兴的测序技术使全基因组测序可供研究人员研究各种感兴趣的表型/疾病,特别是关注罕见的变异位点。虽然第一批测序项目主要集中在无关个体的分析上,但随着测序成本的迅速降低,包括相关个体在内的大量测序研究近期已经开展或启动。然而,由于家族结构的复杂性和计算障碍,分析基于家族的序列数据的方法在很大程度上落后。在这项研究中,我们的主要目标是通过结合来自家族和群体水平的信息,高效准确地推断个体基因型和单倍型(任何测序项目的关键组成部分),并研究差异测序错误将如何影响下游关联分析。为了实现这些目标,我们提出以下具体目标:1)我们将提出一种新颖的统计框架,用于包括相关个体的序列数据的基因分型调用和单倍型推断。新方法以计算上可行的方式利用了无关个体之间共享的短序列和家庭成员之间共享的长序列,同时通过两种经典方法之间的协同作用保持了高精度:用于连锁不平衡信息的隐马尔可夫模型(HMM)和用于遗传向量的Lander-Green算法; 2)我们将开发一种精确的 HMM 计算算法,以加速一类广泛使用的遗传学程序,包括目标 1 中开发的方法,而不会牺牲任何准确性; 3)我们将评估测序错误对罕见变异的基于家族的关联方法的影响,并利用目标1中提出的方法的内在随机性来减少多重插补框架下的误报; 4) 我们将与正在进行的测序项目合作测试和重新校准我们开发的方法,并系统地研究不同的研究设计。成功完成这些目标将产生最先进的统计方法和软件,这将促进包括家庭成员在内的快速发展的测序项目,并指导未来研究的设计和分析。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Wei Chen其他文献
Wei Chen的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Wei Chen', 18)}}的其他基金
An ensemble deep learning model for tumor bud detection and risk stratification in colorectal carcinoma.
用于结直肠癌肿瘤芽检测和风险分层的集成深度学习模型。
- 批准号:
10564824 - 财政年份:2023
- 资助金额:
$ 35.35万 - 项目类别:
Establishing translational neuroimaging tools for quantitative assessment of energy metabolism and metabolic reprogramming in healthy and diseased human brain at 7T
建立转化神经影像工具,用于定量评估 7T 健康和患病人脑的能量代谢和代谢重编程
- 批准号:
10714863 - 财政年份:2023
- 资助金额:
$ 35.35万 - 项目类别:
SCH: New Advanced Machine Learning Framework for Mining Heterogeneous Ocular Data to Accelerate
SCH:新的先进机器学习框架,用于挖掘异构眼部数据以加速
- 批准号:
10601180 - 财政年份:2022
- 资助金额:
$ 35.35万 - 项目类别:
SCH: New Advanced Machine Learning Framework for Mining Heterogeneous Ocular Data to Accelerate
SCH:新的先进机器学习框架,用于挖掘异构眼部数据以加速
- 批准号:
10665804 - 财政年份:2022
- 资助金额:
$ 35.35万 - 项目类别:
Cellular Interactions in Vascular Calcification of Chronic Kidney Disease
慢性肾病血管钙化中的细胞相互作用
- 批准号:
10525401 - 财政年份:2022
- 资助金额:
$ 35.35万 - 项目类别:
Console Replacement and Upgrade of 9.4 Tesla Animal Instrument
9.4特斯拉动物仪控制台更换升级
- 批准号:
10414184 - 财政年份:2022
- 资助金额:
$ 35.35万 - 项目类别:
Deep-learning-based prediction of AMD and its progression with GWAS and fundus image data
基于 GWAS 和眼底图像数据的 AMD 及其进展的深度学习预测
- 批准号:
10226322 - 财政年份:2020
- 资助金额:
$ 35.35万 - 项目类别:
Advancing simultaneous fMRI-multiphoton imaging technique to study brain function and connectivity across different scales at ultrahigh field
推进同步功能磁共振成像多光子成像技术,研究超高场下不同尺度的大脑功能和连接性
- 批准号:
10043972 - 财政年份:2020
- 资助金额:
$ 35.35万 - 项目类别:
Advancing simultaneous fMRI-multiphoton imaging technique to study brain function and connectivity across different scales at ultrahigh field
推进同步功能磁共振成像多光子成像技术,研究超高场下不同尺度的大脑功能和连接性
- 批准号:
10268184 - 财政年份:2020
- 资助金额:
$ 35.35万 - 项目类别:
Advancing simultaneous fMRI-multiphoton imaging technique to study brain function and connectivity across different scales at ultrahigh field
推进同步功能磁共振成像多光子成像技术,研究超高场下不同尺度的大脑功能和连接性
- 批准号:
10463737 - 财政年份:2020
- 资助金额:
$ 35.35万 - 项目类别:
相似海外基金
Rational design of rapidly translatable, highly antigenic and novel recombinant immunogens to address deficiencies of current snakebite treatments
合理设计可快速翻译、高抗原性和新型重组免疫原,以解决当前蛇咬伤治疗的缺陷
- 批准号:
MR/S03398X/2 - 财政年份:2024
- 资助金额:
$ 35.35万 - 项目类别:
Fellowship
Re-thinking drug nanocrystals as highly loaded vectors to address key unmet therapeutic challenges
重新思考药物纳米晶体作为高负载载体以解决关键的未满足的治疗挑战
- 批准号:
EP/Y001486/1 - 财政年份:2024
- 资助金额:
$ 35.35万 - 项目类别:
Research Grant
CAREER: FEAST (Food Ecosystems And circularity for Sustainable Transformation) framework to address Hidden Hunger
职业:FEAST(食品生态系统和可持续转型循环)框架解决隐性饥饿
- 批准号:
2338423 - 财政年份:2024
- 资助金额:
$ 35.35万 - 项目类别:
Continuing Grant
Metrology to address ion suppression in multimodal mass spectrometry imaging with application in oncology
计量学解决多模态质谱成像中的离子抑制问题及其在肿瘤学中的应用
- 批准号:
MR/X03657X/1 - 财政年份:2024
- 资助金额:
$ 35.35万 - 项目类别:
Fellowship
CRII: SHF: A Novel Address Translation Architecture for Virtualized Clouds
CRII:SHF:一种用于虚拟化云的新型地址转换架构
- 批准号:
2348066 - 财政年份:2024
- 资助金额:
$ 35.35万 - 项目类别:
Standard Grant
BIORETS: Convergence Research Experiences for Teachers in Synthetic and Systems Biology to Address Challenges in Food, Health, Energy, and Environment
BIORETS:合成和系统生物学教师的融合研究经验,以应对食品、健康、能源和环境方面的挑战
- 批准号:
2341402 - 财政年份:2024
- 资助金额:
$ 35.35万 - 项目类别:
Standard Grant
The Abundance Project: Enhancing Cultural & Green Inclusion in Social Prescribing in Southwest London to Address Ethnic Inequalities in Mental Health
丰富项目:增强文化
- 批准号:
AH/Z505481/1 - 财政年份:2024
- 资助金额:
$ 35.35万 - 项目类别:
Research Grant
ERAMET - Ecosystem for rapid adoption of modelling and simulation METhods to address regulatory needs in the development of orphan and paediatric medicines
ERAMET - 快速采用建模和模拟方法的生态系统,以满足孤儿药和儿科药物开发中的监管需求
- 批准号:
10107647 - 财政年份:2024
- 资助金额:
$ 35.35万 - 项目类别:
EU-Funded
Ecosystem for rapid adoption of modelling and simulation METhods to address regulatory needs in the development of orphan and paediatric medicines
快速采用建模和模拟方法的生态系统,以满足孤儿药和儿科药物开发中的监管需求
- 批准号:
10106221 - 财政年份:2024
- 资助金额:
$ 35.35万 - 项目类别:
EU-Funded
Recite: Building Research by Communities to Address Inequities through Expression
背诵:社区开展研究,通过表达解决不平等问题
- 批准号:
AH/Z505341/1 - 财政年份:2024
- 资助金额:
$ 35.35万 - 项目类别:
Research Grant