Statistics for high-dimensional data with applications in analysis of high-throughput genomics experiments
高维数据统计及其在高通量基因组学实验分析中的应用
基本信息
- 批准号:240006-2006
- 负责人:
- 金额:$ 0.8万
- 依托单位:
- 依托单位国家:加拿大
- 项目类别:Discovery Grants Program - Individual
- 财政年份:2007
- 资助国家:加拿大
- 起止时间:2007-01-01 至 2008-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
The analysis of very high-dimensional data poses a unique set of challenges for statisticians: in designing feasible algorithms, in estimation and visualization, and in inference. In this research project I propose to continue my work in addressing some issues in all these areas, with a special emphasis on high-throughput genomic data. This includes data from expression microarray experiments, which can interrogate up to 50k genes in a single biological assay, genome-wide scans of genotypic information, such as Affymetrix GeneChip Human Mapping arrays which can interrogate up to 500k markers on a human genome (so-call Single Nucleotide Polymorphism which are locations on a genome accounting for vast majority of genetic differences), and from proteomics experiments, such as Tandem Mass Spectrometry that is being currently used to detect presence of tens or hundreds of thousands of proteins in one experiment, and is being researched for quantitative use. While the nature of each of these data examples can be very different, as well as the potential applications of such technologies (which are now widely used for both basic and clinical research), they all share a number of characteristics, that can be addressed by general statistical research. The results of this research program will benefit both the statistical community, by introducing new methods for statistical analysis and visualization of high-dimensional data and by motivating further research in this very important area, as well as biologists and clinicians by expanding the toolbox of statistical methods and software tools that can be applied to their genomic experiments. As part of this work we will apply and validate the new methods on real data to perform primary and secondary analysis of important clinical and biological experiments. The long term objective of this work is to connect the some results in theoretical statistics of high-dimensional data, great advances computer technology and the resulting novel approaches to statistical analysis, and the high-throughput revolution in biological research to help answer important questions in biological and health sciences.
非常高维数据的分析对统计学家提出了一系列独特的挑战:设计可行的算法,估计和可视化以及推理。在这个研究项目中,我建议继续我的工作,解决所有这些领域的一些问题,特别强调高通量基因组数据。这包括来自表达微阵列实验的数据,该实验可以在单个生物测定中询问多达50k个基因,基因型信息的全基因组扫描,例如Affytechnic GeneChip Human Mapping阵列,该阵列可以询问人类基因组上多达500k个标记(所谓的单核苷酸多态性,其是基因组上占绝大多数遗传差异的位置),并且根据蛋白质组学实验,例如串联质谱法,其目前用于在一个实验中检测数万或数十万种蛋白质的存在,并且正在研究定量使用。虽然每个数据示例的性质可能非常不同,以及这些技术的潜在应用(现在广泛用于基础和临床研究),但它们都有一些共同的特征,可以通过一般统计研究来解决。这项研究计划的结果将有利于统计界,通过引入新的方法进行统计分析和高维数据的可视化,并通过激励在这一非常重要的领域进一步研究,以及生物学家和临床医生通过扩大统计方法和软件工具的工具箱,可以应用于他们的基因组实验。作为这项工作的一部分,我们将在真实的数据上应用和验证新方法,对重要的临床和生物学实验进行初步和次要分析。这项工作的长期目标是将高维数据的理论统计、计算机技术的巨大进步和由此产生的统计分析新方法以及生物研究中的高通量革命的一些结果联系起来,以帮助回答生物和健康科学中的重要问题。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Kustra, Rafal其他文献
5-hmC in the brain is abundant in synaptic genes and shows differences at the exon-intron boundary.
- DOI:
10.1038/nsmb.2372 - 发表时间:
2012-10 - 期刊:
- 影响因子:16.8
- 作者:
Khare, Tarang;Pai, Shraddha;Koncevicius, Karolis;Pal, Mrinal;Kriukiene, Edita;Liutkeviciute, Zita;Irimia, Manuel;Jia, Peixin;Ptak, Carolyn;Xia, Menghang;Tice, Raymond;Tochigi, Mamoru;Morera, Solange;Nazarians, Anaies;Belsham, Denise;Wong, Albert H. C.;Blencowe, Benjamin J.;Wang, Sun Chong;Kapranov, Philipp;Kustra, Rafal;Labrie, Viviane;Klimasauskas, Saulius;Petronis, Arturas - 通讯作者:
Petronis, Arturas
Predictors of all-cause mortality among patients hospitalized with influenza, respiratory syncytial virus, or SARS-CoV-2.
- DOI:
10.1111/irv.13004 - 发表时间:
2022-11 - 期刊:
- 影响因子:4.4
- 作者:
Hamilton, Mackenzie A.;Liu, Ying;Calzavara, Andrew;Sundaram, Maria E.;Djebli, Mohamed;Darvin, Dariya;Baral, Stefan;Kustra, Rafal;Kwong, Jeffrey C.;Mishra, Sharmistha - 通讯作者:
Mishra, Sharmistha
Data-Fusion in Clustering Microarray Data: Balancing Discovery and Interpretability
- DOI:
10.1109/tcbb.2007.70267 - 发表时间:
2010-01-01 - 期刊:
- 影响因子:4.5
- 作者:
Kustra, Rafal;Zagdanski, Adam - 通讯作者:
Zagdanski, Adam
CFH and ARMS2 genetic risk determines progression to neovascular age-related macular degeneration after antioxidant and zinc supplementation
- DOI:
10.1073/pnas.1718059115 - 发表时间:
2018-01-23 - 期刊:
- 影响因子:11.1
- 作者:
Vavvas, Demetrios G.;Small, Kent W.;Kustra, Rafal - 通讯作者:
Kustra, Rafal
A factor analysis model for functional genomics
- DOI:
10.1186/1471-2105-7-216 - 发表时间:
2006-04-21 - 期刊:
- 影响因子:3
- 作者:
Kustra, Rafal;Shioda, Romy;Zhu, Mu - 通讯作者:
Zhu, Mu
Kustra, Rafal的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Kustra, Rafal', 18)}}的其他基金
Computational and Inferential Tools for Machine Learning Methods in Biostatistical Research
生物统计研究中机器学习方法的计算和推理工具
- 批准号:
RGPIN-2017-06586 - 财政年份:2019
- 资助金额:
$ 0.8万 - 项目类别:
Discovery Grants Program - Individual
Statistics for high-dimensional data with applications in analysis of high-throughput genomics experiments
高维数据统计及其在高通量基因组学实验分析中的应用
- 批准号:
240006-2006 - 财政年份:2010
- 资助金额:
$ 0.8万 - 项目类别:
Discovery Grants Program - Individual
Statistics for high-dimensional data with applications in analysis of high-throughput genomics experiments
高维数据统计及其在高通量基因组学实验分析中的应用
- 批准号:
240006-2006 - 财政年份:2009
- 资助金额:
$ 0.8万 - 项目类别:
Discovery Grants Program - Individual
Statistics for high-dimensional data with applications in analysis of high-throughput genomics experiments
高维数据统计及其在高通量基因组学实验分析中的应用
- 批准号:
240006-2006 - 财政年份:2008
- 资助金额:
$ 0.8万 - 项目类别:
Discovery Grants Program - Individual
Statistics for high-dimensional data with applications in analysis of high-throughput genomics experiments
高维数据统计及其在高通量基因组学实验分析中的应用
- 批准号:
240006-2006 - 财政年份:2006
- 资助金额:
$ 0.8万 - 项目类别:
Discovery Grants Program - Individual
High-performance computing resource for statistical genomics
用于统计基因组学的高性能计算资源
- 批准号:
330595-2006 - 财政年份:2005
- 资助金额:
$ 0.8万 - 项目类别:
Research Tools and Instruments - Category 1 (<$150,000)
Biostatistical analysis of medical signals and images
医学信号和图像的生物统计分析
- 批准号:
240006-2001 - 财政年份:2005
- 资助金额:
$ 0.8万 - 项目类别:
Discovery Grants Program - Individual
Biostatistical analysis of medical signals and images
医学信号和图像的生物统计分析
- 批准号:
240006-2001 - 财政年份:2004
- 资助金额:
$ 0.8万 - 项目类别:
Discovery Grants Program - Individual
Biostatistical analysis of medical signals and images
医学信号和图像的生物统计分析
- 批准号:
240006-2001 - 财政年份:2003
- 资助金额:
$ 0.8万 - 项目类别:
Discovery Grants Program - Individual
Biostatistical analysis of medical signals and images
医学信号和图像的生物统计分析
- 批准号:
240006-2001 - 财政年份:2002
- 资助金额:
$ 0.8万 - 项目类别:
Discovery Grants Program - Individual
相似国自然基金
Scalable Learning and Optimization: High-dimensional Models and Online Decision-Making Strategies for Big Data Analysis
- 批准号:
- 批准年份:2024
- 资助金额:万元
- 项目类别:合作创新研究团队
Fibered纽结的自同胚、Floer同调与4维亏格
- 批准号:12301086
- 批准年份:2023
- 资助金额:30.00 万元
- 项目类别:青年科学基金项目
基于个体分析的投影式非线性非负张量分解在高维非结构化数据模式分析中的研究
- 批准号:61502059
- 批准年份:2015
- 资助金额:19.0 万元
- 项目类别:青年科学基金项目
应用iTRAQ定量蛋白组学方法分析乳腺癌新辅助化疗后相关蛋白质的变化
- 批准号:81150011
- 批准年份:2011
- 资助金额:10.0 万元
- 项目类别:专项基金项目
肝脏管道系统数字化及三维成像的研究
- 批准号:30470493
- 批准年份:2004
- 资助金额:23.0 万元
- 项目类别:面上项目
相似海外基金
High-Dimensional and Large-Sample Asymptotic Theory for the Test Statistics with Monotone Missing Data and Its Application
单调缺失数据检验统计的高维大样本渐近理论及其应用
- 批准号:
16K17642 - 财政年份:2016
- 资助金额:
$ 0.8万 - 项目类别:
Grant-in-Aid for Young Scientists (B)
MEETING: Interactions between Omics and Statistics: Analyzing High Dimensional Data to be held at the 8th Intl Purdue Symposium on Statistics June 20-24, 2012, West Lafayette, IN
会议:组学与统计学之间的相互作用:分析高维数据将于 2012 年 6 月 20-24 日在印第安纳州西拉斐特举行的第八届普渡国际统计研讨会上举行
- 批准号:
1240803 - 财政年份:2012
- 资助金额:
$ 0.8万 - 项目类别:
Standard Grant
Deciphering the Histone Code by Mass Spectrometry and High Dimensional Statistics
通过质谱和高维统计破译组蛋白密码
- 批准号:
7912731 - 财政年份:2010
- 资助金额:
$ 0.8万 - 项目类别:
Deciphering the Histone Code by Mass Spectrometry and High Dimensional Statistics
通过质谱和高维统计破译组蛋白密码
- 批准号:
8281647 - 财政年份:2010
- 资助金额:
$ 0.8万 - 项目类别:
Statistics for high-dimensional data with applications in analysis of high-throughput genomics experiments
高维数据统计及其在高通量基因组学实验分析中的应用
- 批准号:
240006-2006 - 财政年份:2010
- 资助金额:
$ 0.8万 - 项目类别:
Discovery Grants Program - Individual
Deciphering the Histone Code by Mass Spectrometry and High Dimensional Statistics
通过质谱和高维统计破译组蛋白密码
- 批准号:
8244953 - 财政年份:2010
- 资助金额:
$ 0.8万 - 项目类别:
Statistics for high-dimensional data with applications in analysis of high-throughput genomics experiments
高维数据统计及其在高通量基因组学实验分析中的应用
- 批准号:
240006-2006 - 财政年份:2009
- 资助金额:
$ 0.8万 - 项目类别:
Discovery Grants Program - Individual