Novel Statistical methods for extracting information from genetic data
从遗传数据中提取信息的新统计方法
基本信息
- 批准号:2744324
- 负责人:
- 金额:--
- 依托单位:
- 依托单位国家:英国
- 项目类别:Studentship
- 财政年份:2022
- 资助国家:英国
- 起止时间:2022 至 无数据
- 项目状态:未结题
- 来源:
- 关键词:
项目摘要
This mathematical modelling project involves a combination of analysis of genetic data taken from plants and the development of novel mathematical and statistical tools to extract information from the data.This project involves applying of ideas from statistical physics and information theory to data on genotype-phenotype variations. In both disciplines, the concepts of entropy and the distribution of microstates is useful in theories which describe macroscopic quantities. We consider a sample from the population, which is then ranked by phenotype (that is, an observable characteristic, such as height or weight). By analysing the respective order of genotypes (AA,Aa,aa), we derive an effective field which quantifies how strongly one particular single nucleotide polymorphism (SNP - a minor genetic mutation) influences a particular phenotype. This provides an alternative method for analysing genotype-phenotype interactions, which is more powerful than the classic genome wide association studies (GWAS), and does not rely on the statistical assumptions made by Fisher in 1901.In this PhD project this new method will be applied to Arabidopsis to understand how the genetic states of individuals influence observed phenotypes such as how ions uptake in Arabidopsis influence plant growth. Data comes from ionomic studies performed by Sian Bray, who studies the transport of a range of ions in plants. We will typically be concerned with three bi-allelic states and the impact of these states on a continuous range of phenotype values. We will then consider the effects of multiple genes on a single phenotype.The method quantifies the strength of the genotype-phenotype dependency; whilst many SNPs will have no significant effect on phenotype, some will, and to varying extents. We will investigate linkage disequilibrium, that is, how the strength of interaction varies as one moves along the chromosomes to nearby SNPs. Typically, where there is a SNP with a strong phenotype effect, other SNPs which are close by on the chromosome are also seen to have significant impact on phenotype. We will investigate the range of this interaction, and construct models to explain how rapidly this decays with distance from the most significant SNP. Whilst the ultimate goal is to interpret the plant-data, and other real data sets, new methods will be tested on synthetic data, in order to test their deductive power. We will investigate generalisations of the model, for example, the phenotypes may only available as `binned' data, that is numbers of each genotype (AA,Aa,aa) in a sequence of ranges (eg 10-20, 20-30, 30-40 etc). In this case the data has less 'power' and information, so might be less likely to be flagged as significant, but since it is smoother, it may give a simpler form for the field-strength.We will also consider multi-allelic systems - that is, where more than three alleles are present. The mathematical analysis of such higher-dimensional systems is significantly more complicated, as the number of possible arrangements of the ranked list becomes increasingly large.
这个数学模型项目包括分析从植物中提取的遗传数据,并开发新的数学和统计工具,从数据中提取信息。这个项目涉及将统计物理学和信息论的思想应用于基因型-表型变异的数据。在这两个学科中,熵和微观状态分布的概念在描述宏观量的理论中是有用的。我们考虑从人群中抽取一个样本,然后根据表型(即一个可观察到的特征,如身高或体重)对其进行排名。通过分析基因型(AA,Aa,aa)各自的顺序,我们得到一个有效的字段,量化如何强烈一个特定的单核苷酸多态性(SNP -一个小的遗传突变)影响一个特定的表型。这为分析基因型-表型相互作用提供了一种新的方法,它比经典的全基因组关联研究(GWAS)更强大,并且不依赖于Fisher在1901年提出的统计假设。在本博士项目中,这种新方法将应用于拟南芥,以了解个体的遗传状态如何影响观察到的表型,例如拟南芥中的离子吸收如何影响植物生长。数据来自Sian Bray进行的离子学研究,他研究了植物中一系列离子的运输。我们将典型地关注三个双等位基因状态和这些状态对连续范围的表型值的影响。然后,我们将考虑多个基因对单个表型的影响,该方法量化了基因型-表型依赖性的强度;虽然许多SNP对表型没有显著影响,但有些SNP会对表型产生不同程度的影响。我们将研究连锁不平衡,即当一个人沿着染色体移动到附近的SNP时,相互作用的强度如何变化。通常,在存在具有强表型效应的SNP的情况下,也可以看到染色体上邻近的其他SNP对表型具有显著影响。我们将研究这种相互作用的范围,并构建模型来解释这种相互作用随着与最重要的SNP的距离而衰减的速度。虽然最终目标是解释工厂数据和其他真实的数据集,但新方法将在合成数据上进行测试,以测试其演绎能力。我们将研究模型的一般化,例如,表型可能仅作为“分组”数据可用,即在一系列范围(例如10-20,20-30,30-40等)中的每种基因型(AA,Aa,aa)的数量。在这种情况下,数据的“功效”和信息较少,因此可能不太可能被标记为显著,但由于它更平滑,它可能给出更简单的场强形式。我们还将考虑多等位基因系统-即存在三个以上等位基因的系统。这种高维系统的数学分析明显更加复杂,因为排序列表的可能排列的数量变得越来越大。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
其他文献
Internet-administered, low-intensity cognitive behavioral therapy for parents of children treated for cancer: A feasibility trial (ENGAGE).
针对癌症儿童父母的互联网管理、低强度认知行为疗法:可行性试验 (ENGAGE)。
- DOI:
10.1002/cam4.5377 - 发表时间:
2023-03 - 期刊:
- 影响因子:4
- 作者:
- 通讯作者:
Differences in child and adolescent exposure to unhealthy food and beverage advertising on television in a self-regulatory environment.
在自我监管的环境中,儿童和青少年在电视上接触不健康食品和饮料广告的情况存在差异。
- DOI:
10.1186/s12889-023-15027-w - 发表时间:
2023-03-23 - 期刊:
- 影响因子:4.5
- 作者:
- 通讯作者:
The association between rheumatoid arthritis and reduced estimated cardiorespiratory fitness is mediated by physical symptoms and negative emotions: a cross-sectional study.
类风湿性关节炎与估计心肺健康降低之间的关联是由身体症状和负面情绪介导的:一项横断面研究。
- DOI:
10.1007/s10067-023-06584-x - 发表时间:
2023-07 - 期刊:
- 影响因子:3.4
- 作者:
- 通讯作者:
ElasticBLAST: accelerating sequence search via cloud computing.
ElasticBLAST:通过云计算加速序列搜索。
- DOI:
10.1186/s12859-023-05245-9 - 发表时间:
2023-03-26 - 期刊:
- 影响因子:3
- 作者:
- 通讯作者:
Amplified EQCM-D detection of extracellular vesicles using 2D gold nanostructured arrays fabricated by block copolymer self-assembly.
使用通过嵌段共聚物自组装制造的 2D 金纳米结构阵列放大 EQCM-D 检测细胞外囊泡。
- DOI:
10.1039/d2nh00424k - 发表时间:
2023-03-27 - 期刊:
- 影响因子:9.7
- 作者:
- 通讯作者:
的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('', 18)}}的其他基金
An implantable biosensor microsystem for real-time measurement of circulating biomarkers
用于实时测量循环生物标志物的植入式生物传感器微系统
- 批准号:
2901954 - 财政年份:2028
- 资助金额:
-- - 项目类别:
Studentship
Exploiting the polysaccharide breakdown capacity of the human gut microbiome to develop environmentally sustainable dishwashing solutions
利用人类肠道微生物群的多糖分解能力来开发环境可持续的洗碗解决方案
- 批准号:
2896097 - 财政年份:2027
- 资助金额:
-- - 项目类别:
Studentship
A Robot that Swims Through Granular Materials
可以在颗粒材料中游动的机器人
- 批准号:
2780268 - 财政年份:2027
- 资助金额:
-- - 项目类别:
Studentship
Likelihood and impact of severe space weather events on the resilience of nuclear power and safeguards monitoring.
严重空间天气事件对核电和保障监督的恢复力的可能性和影响。
- 批准号:
2908918 - 财政年份:2027
- 资助金额:
-- - 项目类别:
Studentship
Proton, alpha and gamma irradiation assisted stress corrosion cracking: understanding the fuel-stainless steel interface
质子、α 和 γ 辐照辅助应力腐蚀开裂:了解燃料-不锈钢界面
- 批准号:
2908693 - 财政年份:2027
- 资助金额:
-- - 项目类别:
Studentship
Field Assisted Sintering of Nuclear Fuel Simulants
核燃料模拟物的现场辅助烧结
- 批准号:
2908917 - 财政年份:2027
- 资助金额:
-- - 项目类别:
Studentship
Assessment of new fatigue capable titanium alloys for aerospace applications
评估用于航空航天应用的新型抗疲劳钛合金
- 批准号:
2879438 - 财政年份:2027
- 资助金额:
-- - 项目类别:
Studentship
Developing a 3D printed skin model using a Dextran - Collagen hydrogel to analyse the cellular and epigenetic effects of interleukin-17 inhibitors in
使用右旋糖酐-胶原蛋白水凝胶开发 3D 打印皮肤模型,以分析白细胞介素 17 抑制剂的细胞和表观遗传效应
- 批准号:
2890513 - 财政年份:2027
- 资助金额:
-- - 项目类别:
Studentship
Understanding the interplay between the gut microbiome, behavior and urbanisation in wild birds
了解野生鸟类肠道微生物组、行为和城市化之间的相互作用
- 批准号:
2876993 - 财政年份:2027
- 资助金额:
-- - 项目类别:
Studentship
相似海外基金
Liver Fibrosis: Leveraging Novel Statistical Methods to Determine Optimal Screening Strategy for People Living with Type 2 Diabetes
肝纤维化:利用新的统计方法确定 2 型糖尿病患者的最佳筛查策略
- 批准号:
488421 - 财政年份:2023
- 资助金额:
-- - 项目类别:
Operating Grants
SCH: Novel and Interpretable Statistical Learning for Brain Images in AD/ADRDs
SCH:针对 AD/ADRD 大脑图像的新颖且可解释的统计学习
- 批准号:
10816764 - 财政年份:2023
- 资助金额:
-- - 项目类别:
Novel statistical genetics methods to unravel polygenic interactions in complex traits
揭示复杂性状中多基因相互作用的新统计遗传学方法
- 批准号:
10713965 - 财政年份:2023
- 资助金额:
-- - 项目类别:
Novel and Rigorous Statistical Learning and Inference for Comparative Effectiveness Research with Complex Data
复杂数据比较有效性研究的新颖而严格的统计学习和推理
- 批准号:
10635323 - 财政年份:2023
- 资助金额:
-- - 项目类别:
Novel statistical methods for data with non-Euclidean geometric structure
非欧几何结构数据的新颖统计方法
- 批准号:
DP220102232 - 财政年份:2022
- 资助金额:
-- - 项目类别:
Discovery Projects
Novel Statistical Methods for Oral Microbiome Data Analysis
口腔微生物组数据分析的新统计方法
- 批准号:
10525318 - 财政年份:2022
- 资助金额:
-- - 项目类别:
Resolving single-cell analysis challenges via data-driven decision frameworks and novel statistical methods
通过数据驱动的决策框架和新颖的统计方法解决单细胞分析挑战
- 批准号:
10707308 - 财政年份:2022
- 资助金额:
-- - 项目类别:
Accelerating biomarker development through novel statistical methods for analyzing phase III/IV studies
通过分析 III/IV 期研究的新统计方法加速生物标志物开发
- 批准号:
10568744 - 财政年份:2022
- 资助金额:
-- - 项目类别:
Novel Statistical Data Integration Methods for Multi-View Data
多视图数据的新颖统计数据集成方法
- 批准号:
548103-2020 - 财政年份:2022
- 资助金额:
-- - 项目类别:
Postgraduate Scholarships - Doctoral
Novel Statistical Integration Methods for Multi-View Data
多视图数据的新颖统计集成方法
- 批准号:
RGPIN-2022-03034 - 财政年份:2022
- 资助金额:
-- - 项目类别:
Discovery Grants Program - Individual