Measures of association in microbiome and other count-based platforms
微生物组和其他基于计数的平台的关联测量
基本信息
- 批准号:RGPIN-2021-03634
- 负责人:
- 金额:$ 1.68万
- 依托单位:
- 依托单位国家:加拿大
- 项目类别:Discovery Grants Program - Individual
- 财政年份:2021
- 资助国家:加拿大
- 起止时间:2021-01-01 至 2022-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
The human microbiome is the collection of organisms residing in or on the human body and plays an integral part in human health. A typical microbiome dataset consists of a count matrix where rows correspond to samples from different microbial populations and columns correspond to microbial taxa (e.g. species, genus, family). Microbiome sequencing data are compositional in nature, meaning that the count of a taxon in a sample should not be interpreted in absolute terms; rather, it should only be interpreted relative to one or more of the other taxa. This makes measuring associations between different taxa difficult, since correlations on the raw taxon proportions will be negatively biased. In this proposal I will extend existing multinomial model to estimate covariance matrices in compositional data as a function of covariates. To do this, I will modify previous models I have developed to handle a more flexible covariance structure. Relatedly, statistical techniques have been developed to address the pitfalls of compositional data, for example, the log-ratio transformation. However, the log-ratio transformation is scale invariant, whereas count-based sequencing platforms are not. Recently, (Lovell, Chua, and McGrath 2020) published an important paper exploring the negative effects of applying standard compositional techniques compositional data from count platforms, demonstrating a need for new statistical techniques that will account for differences in count totals across biological samples. A common measure of association in compositional data is the "variation matrix", which contains the variances of the pairwise log-ratios of the parts of the composition. There is a lack of statistical theory surrounding estimation of the variation matrix in count-based compositional data. Relatedly, zero counts are often very prevalent in microbiome data. It has been shown that measures of association and diversity are sensitive to the choice of procedure used to handle zero counts. However, Bayesian non-parametric models such as the hierarchical Pitman-Yor (HPY) process have recently proved successful in modelling species abundance distributions in microbiome data. I will additionally develop measures of association in the context of the HPY process, which will allow estimation of association when there are zero counts present in the data. Finally, I will study the applicability of these methods on other count-based compositional platforms such as single-cell RNA sequencing. This work will provide new methods of estimating measures of association and diversity in metagenomic data that will account for differences in the count totals over biological samples as well as the abundance of zero counts. Though this problem has recently gained attention, there remains a lack of sound statistical theory in this context. Furthermore, these methods will be applicable to many other kinds of platforms such as single-cell RNA sequencing datasets.
人体微生物组是居住在人体内或人体上的生物体的集合,在人类健康中起着不可或缺的作用。典型的微生物组数据集由计数矩阵组成,其中行对应于来自不同微生物群体的样品,列对应于微生物分类群(例如种、属、科)。微生物组测序数据本质上是组成性的,这意味着样本中一个分类单元的计数不应该以绝对值来解释;相反,它只应该相对于一个或多个其他分类单元来解释。这使得测量不同分类群之间的关联变得困难,因为原始分类群比例的相关性将是负偏差的。在这个建议中,我将扩展现有的多项式模型来估计协方差矩阵的组成数据作为协变量的函数。为此,我将修改以前的模型,我已经开发了一个更灵活的协方差结构。相关的,统计技术已经开发出来,以解决组成数据的陷阱,例如,对数比转换。然而,对数比变换是尺度不变的,而基于计数的测序平台不是。最近,(Lovell,Chua,and麦格拉思2020)发表了一篇重要论文,探讨了应用标准组成技术对来自计数平台的组成数据的负面影响,表明需要新的统计技术来解释生物样本中计数总数的差异。组成数据中关联性的一个常见度量是“变异矩阵”,它包含组成部分的成对对数比的方差。在基于计数的组成数据中,缺乏关于估计变异矩阵的统计理论。相关地,零计数在微生物组数据中通常非常普遍。它已被证明,协会和多样性的措施是敏感的选择用于处理零计数的程序。然而,贝叶斯非参数模型,如分层Pitman-Yor(HPY)过程,最近已被证明在微生物组数据中建模物种丰度分布方面是成功的。我将在HPY过程的背景下额外开发关联性的度量,这将允许在数据中存在零计数时估计关联性。最后,我将研究这些方法在其他基于计数的组合平台上的适用性,例如单细胞RNA测序。这项工作将提供新的方法来估计宏基因组数据中的关联性和多样性的测量,这些方法将解释生物样品中计数总数的差异以及零计数的丰度。虽然这个问题最近得到了关注,但在这方面仍然缺乏健全的统计理论。此外,这些方法将适用于许多其他类型的平台,如单细胞RNA测序数据集。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
McGregor, Kevin其他文献
MDiNE: a model to estimate differential co-occurrence networks in microbiome studies
- DOI:
10.1093/bioinformatics/btz824 - 发表时间:
2020-03-15 - 期刊:
- 影响因子:5.8
- 作者:
McGregor, Kevin;Labbe, Aurelie;Greenwood, Celia M. T. - 通讯作者:
Greenwood, Celia M. T.
McGregor, Kevin的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('McGregor, Kevin', 18)}}的其他基金
Measures of association in microbiome and other count-based platforms
微生物组和其他基于计数的平台的关联测量
- 批准号:
RGPIN-2021-03634 - 财政年份:2022
- 资助金额:
$ 1.68万 - 项目类别:
Discovery Grants Program - Individual
Measures of association in microbiome and other count-based platforms
微生物组和其他基于计数的平台的关联测量
- 批准号:
DGECR-2021-00459 - 财政年份:2021
- 资助金额:
$ 1.68万 - 项目类别:
Discovery Launch Supplement
相似国自然基金
精神分裂症全基因组关联研究的通路分析及验证
- 批准号:81071087
- 批准年份:2010
- 资助金额:35.0 万元
- 项目类别:面上项目
精神分裂症与吸烟关联的分子遗传学机制研究
- 批准号:81000579
- 批准年份:2010
- 资助金额:20.0 万元
- 项目类别:青年科学基金项目
孤独症全基因组关联第二阶段研究
- 批准号:81071110
- 批准年份:2010
- 资助金额:32.0 万元
- 项目类别:面上项目
多盘科单殖吸虫宿主特异性及其与无尾两栖类宿主协同进化关系研究
- 批准号:30960049
- 批准年份:2009
- 资助金额:23.0 万元
- 项目类别:地区科学基金项目
Non-coherent网络中的纠错码及其应用
- 批准号:60972011
- 批准年份:2009
- 资助金额:30.0 万元
- 项目类别:面上项目
孤独症与突触发育相关候选基因的关联研究
- 批准号:30870897
- 批准年份:2008
- 资助金额:50.0 万元
- 项目类别:面上项目
大鱼际掌纹特应征与5个哮喘易感基因单核苷酸多态性的关联分析
- 批准号:30873315
- 批准年份:2008
- 资助金额:31.0 万元
- 项目类别:面上项目
Multistage,haplotype and functional tests-based FCAR 基因和IgA肾病相关关系研究
- 批准号:30771013
- 批准年份:2007
- 资助金额:30.0 万元
- 项目类别:面上项目
相似海外基金
The Celiac Disease Genomic, Environmental, Microbiome, and Metabolomic (CD-GEMM) Prospective Cohort Study
乳糜泻基因组、环境、微生物组和代谢组 (CD-GEMM) 前瞻性队列研究
- 批准号:
10905694 - 财政年份:2023
- 资助金额:
$ 1.68万 - 项目类别:
The Interplay of Host Genetic Variation and the Gut Microbiome in Crohn's Disease
克罗恩病宿主遗传变异与肠道微生物组的相互作用
- 批准号:
10751276 - 财政年份:2023
- 资助金额:
$ 1.68万 - 项目类别:
Sex Differences in Statural Growth Impairment in Pediatric Crohn's Disease: Part 2
儿童克罗恩病身高发育障碍的性别差异:第 2 部分
- 批准号:
10638422 - 财政年份:2023
- 资助金额:
$ 1.68万 - 项目类别:
Collaborative Research: DMS/NIGMS 2: New statistical methods, theory, and software for microbiome data
合作研究:DMS/NIGMS 2:微生物组数据的新统计方法、理论和软件
- 批准号:
10797410 - 财政年份:2023
- 资助金额:
$ 1.68万 - 项目类别:
Understanding the Association between Sublingual Buprenorphine and Oral Health Outcomes
了解舌下含服丁丙诺啡与口腔健康结果之间的关联
- 批准号:
10765299 - 财政年份:2023
- 资助金额:
$ 1.68万 - 项目类别:
Association between early Candida infection (oral thrush) and severe early childhood caries
早期念珠菌感染(鹅口疮)与严重儿童早期龋齿之间的关联
- 批准号:
10739505 - 财政年份:2023
- 资助金额:
$ 1.68万 - 项目类别:
Epidemiology of diet, metabolism and non-alcoholic fatty liver disease in Hispanic/Latino adults
西班牙裔/拉丁裔成人饮食、代谢和非酒精性脂肪肝的流行病学
- 批准号:
10735454 - 财政年份:2023
- 资助金额:
$ 1.68万 - 项目类别:
Amino acid mimicry: Insights into glyphosate transport and toxicity to mitochondria
氨基酸拟态:深入了解草甘膦转运和线粒体毒性
- 批准号:
10573869 - 财政年份:2023
- 资助金额:
$ 1.68万 - 项目类别:
Understanding and Targeting the Pathophysiology of Youth-onset Type 2 Diabetes- NYU Clinical Center
了解并针对青年发病 2 型糖尿病的病理生理学 - 纽约大学临床中心
- 批准号:
10584108 - 财政年份:2023
- 资助金额:
$ 1.68万 - 项目类别:














{{item.name}}会员




