Collaborative Research: RESEARCH-PGR: Predicting Phenotype from Molecular Profiles with Deep Learning: Topological Data Analysis to Address a Grand Challenge in the Plant Sciences
合作研究:RESEARCH-PGR:利用深度学习从分子概况预测表型:拓扑数据分析应对植物科学的重大挑战
基本信息
- 批准号:2310356
- 负责人:
- 金额:$ 52.61万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2023
- 资助国家:美国
- 起止时间:2023-07-01 至 2027-06-30
- 项目状态:未结题
- 来源:
- 关键词:
项目摘要
Organisms are a consequence of information embedded in their genome expressed through molecular processes. Sequencing technologies allow biologists to extract nearly all information content from the genome. However, measuring what an organism is has not advanced as far as genomic sequencing: unlike the genome, it is not yet possible to measure the totality of information embedded in the organismal form. If all the information that is contained within organisms could be extracted, a model could be developed that would address one of the Grand Challenges in biology, the ability to predict what an organism is from its genomic information. In this project, mathematical approaches that have not been fully explored in biology will be used to extract information in data by measuring its structure. This field of mathematics has a motto: that all shape is data, and all data have shape. By measuring the shapes and gene expression patterns of leaves, the project will treat them as data from which embedded information can be extracted. Deep learning methods will then be used to predict the shapes of leaves from their gene expression profiles. As part of the connection between the project and its impact to society, students from both the U.S. and México will help analyze the data through Plants&Python, a bilingual, freely available curriculum initiated as a means to bring together plant biologists who have never coded and data scientists new to plant science, with groups that comprise U.S. agriculture. Using X-ray Computed Tomography (CT) to measure plant morphology and transcriptome profiling (RNA-seq) to measure gene expression, the project will use the Euler Characteristic Transform (ECT) and the Mapper algorithm, two Topological Data Analysis (TDA) techniques, to extract the total information embedded in the leaf morphology of Arabidopsis accessions with contrasting developmental reproducibility. The ECT is mathematically proven to distinguish any object from any other, and the Mapper algorithm is used to visualize underlying data structures as a graph. Specific aims include: 1) using the ECT to measure the total information embedded in leaf shape and benchmarking against traditional methods to see how much “hidden” phenotypic information is revealed when measured comprehensively; 2) generating RNA-Seq gene expression profiles from identical leaves, visualizing the underlying data structure as a Mapper graph; the same will be done for phenotypic data as measured by the ECT; and, 3) predicting the precise leaf shape features associated with gene expression signatures using deep learning. By converting underlying molecular and phenotypic data structures into node embeddings, an encoder-decoder neural network will align molecular and phenotypic Mapper graphs. The result will be a mapping of gene expression profiles to features of leaf shape as predicted using deep learning methods on underlying data structures. All project outcomes will be made publicly available through long term data repositories.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
生物体是嵌入其基因组中的信息通过分子过程表达的结果。测序技术使生物学家能够从基因组中提取几乎所有的信息内容。然而,衡量一个有机体是什么还没有达到基因组测序的程度:与基因组不同,它还不可能衡量嵌入在有机体形式中的全部信息。如果生物体中包含的所有信息都能被提取出来,就可以开发出一种模型,来解决生物学中的一大挑战,即根据生物的基因组信息预测生物是什么的能力。在这个项目中,生物学中尚未充分探索的数学方法将被用来通过测量数据的结构来提取数据中的信息。这个数学领域有一个座右铭:所有的形状都是数据,所有的数据都有形状。通过测量树叶的形状和基因表达模式,该项目将把它们视为可以提取嵌入信息的数据。然后将使用深度学习方法根据树叶的基因表达谱预测树叶的形状。作为该项目与其对社会影响之间联系的一部分,来自美国和墨西哥的学生将通过Plants&;Python帮助分析数据,这是一种双语的免费课程,旨在将从未编码的植物生物学家和刚刚接触植物科学的数据科学家与组成美国农业的团体聚集在一起。利用X射线计算机断层成像(CT)测量植物形态和转录组图谱(RNA-seq)测量基因表达,该项目将使用两种拓扑数据分析(TDA)技术-欧拉特征转换(ECT)和映射器算法(MAPPER),以提取发育重复性不同的拟南芥材料叶片形态中嵌入的总信息。ECT在数学上被证明可以将任何对象与任何其他对象区分开来,而Mapper算法用于将底层数据结构可视化为图形。具体目标包括:1)使用ECT测量包含在叶片形状中的总信息,并与传统方法进行比较,以了解在综合测量时揭示了多少“隐藏”的表型信息;2)从相同的叶片生成RNA-Seq基因表达谱,将底层数据结构可视化为Mapper图;将对ECT测量的表型数据进行同样的处理;以及3)使用深度学习预测与基因表达特征相关联的精确的叶形特征。通过将潜在的分子和表型数据结构转换为节点嵌入,编码器-解码器神经网络将对齐分子和表型映射器图。结果将是基因表达谱与使用底层数据结构的深度学习方法预测的叶子形状特征的映射。所有项目成果将通过长期数据库向公众公布。这一奖项反映了NSF的法定使命,并通过使用基金会的智力优势和更广泛的影响审查标准进行评估,被认为值得支持。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Aman Husbands其他文献
Aman Husbands的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Aman Husbands', 18)}}的其他基金
Context-dependent activity of an essential transcription factor
重要转录因子的上下文依赖性活性
- 批准号:
2244797 - 财政年份:2022
- 资助金额:
$ 52.61万 - 项目类别:
Continuing Grant
Context-dependent activity of an essential transcription factor
重要转录因子的上下文依赖性活性
- 批准号:
2039489 - 财政年份:2021
- 资助金额:
$ 52.61万 - 项目类别:
Continuing Grant
Elucidating the Mechanisms of Intercellular Movement
阐明细胞间运动的机制
- 批准号:
1856292 - 财政年份:2019
- 资助金额:
$ 52.61万 - 项目类别:
Standard Grant
相似国自然基金
Research on Quantum Field Theory without a Lagrangian Description
- 批准号:24ZR1403900
- 批准年份:2024
- 资助金额:0.0 万元
- 项目类别:省市级项目
Cell Research
- 批准号:31224802
- 批准年份:2012
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Cell Research
- 批准号:31024804
- 批准年份:2010
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Cell Research (细胞研究)
- 批准号:30824808
- 批准年份:2008
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Research on the Rapid Growth Mechanism of KDP Crystal
- 批准号:10774081
- 批准年份:2007
- 资助金额:45.0 万元
- 项目类别:面上项目
相似海外基金
Collaborative Research: REU Site: Earth and Planetary Science and Astrophysics REU at the American Museum of Natural History in Collaboration with the City University of New York
合作研究:REU 地点:地球与行星科学和天体物理学 REU 与纽约市立大学合作,位于美国自然历史博物馆
- 批准号:
2348998 - 财政年份:2025
- 资助金额:
$ 52.61万 - 项目类别:
Standard Grant
Collaborative Research: REU Site: Earth and Planetary Science and Astrophysics REU at the American Museum of Natural History in Collaboration with the City University of New York
合作研究:REU 地点:地球与行星科学和天体物理学 REU 与纽约市立大学合作,位于美国自然历史博物馆
- 批准号:
2348999 - 财政年份:2025
- 资助金额:
$ 52.61万 - 项目类别:
Standard Grant
Collaborative Research: Investigating Southern Ocean Sea Surface Temperatures and Freshening during the Late Pliocene and Pleistocene along the Antarctic Margin
合作研究:调查上新世晚期和更新世沿南极边缘的南大洋海面温度和新鲜度
- 批准号:
2313120 - 财政年份:2024
- 资助金额:
$ 52.61万 - 项目类别:
Standard Grant
Collaborative Research: Non-Linearity and Feedbacks in the Atmospheric Circulation Response to Increased Carbon Dioxide (CO2)
合作研究:大气环流对二氧化碳 (CO2) 增加的响应的非线性和反馈
- 批准号:
2335762 - 财政年份:2024
- 资助金额:
$ 52.61万 - 项目类别:
Standard Grant
Collaborative Research: Using Adaptive Lessons to Enhance Motivation, Cognitive Engagement, And Achievement Through Equitable Classroom Preparation
协作研究:通过公平的课堂准备,利用适应性课程来增强动机、认知参与和成就
- 批准号:
2335802 - 财政年份:2024
- 资助金额:
$ 52.61万 - 项目类别:
Standard Grant
Collaborative Research: Using Adaptive Lessons to Enhance Motivation, Cognitive Engagement, And Achievement Through Equitable Classroom Preparation
协作研究:通过公平的课堂准备,利用适应性课程来增强动机、认知参与和成就
- 批准号:
2335801 - 财政年份:2024
- 资助金额:
$ 52.61万 - 项目类别:
Standard Grant
Collaborative Research: Holocene biogeochemical evolution of Earth's largest lake system
合作研究:地球最大湖泊系统的全新世生物地球化学演化
- 批准号:
2336132 - 财政年份:2024
- 资助金额:
$ 52.61万 - 项目类别:
Standard Grant
Collaborative Research: LTREB: The importance of resource availability, acquisition, and mobilization to the evolution of life history trade-offs in a variable environment.
合作研究:LTREB:资源可用性、获取和动员对于可变环境中生命史权衡演变的重要性。
- 批准号:
2338394 - 财政年份:2024
- 资助金额:
$ 52.61万 - 项目类别:
Continuing Grant
Collaborative Research: Constraining next generation Cascadia earthquake and tsunami hazard scenarios through integration of high-resolution field data and geophysical models
合作研究:通过集成高分辨率现场数据和地球物理模型来限制下一代卡斯卡迪亚地震和海啸灾害情景
- 批准号:
2325311 - 财政年份:2024
- 资助金额:
$ 52.61万 - 项目类别:
Standard Grant
Collaborative Research: BoCP-Implementation: Testing Evolutionary Models of Biotic Survival and Recovery from the Permo-Triassic Mass Extinction and Climate Crisis
合作研究:BoCP-实施:测试二叠纪-三叠纪大规模灭绝和气候危机中生物生存和恢复的进化模型
- 批准号:
2325380 - 财政年份:2024
- 资助金额:
$ 52.61万 - 项目类别:
Standard Grant