Statistical And Computational Methods For Gene Expression and Proteomic Analysis

基因表达和蛋白质组分析的统计和计算方法

基本信息

  • 批准号:
    8746528
  • 负责人:
  • 金额:
    $ 94.38万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
  • 财政年份:
  • 资助国家:
    美国
  • 起止时间:
  • 项目状态:
    未结题

项目摘要

Gene expression measurement using microarrays or next-generation sequencing techniques, is a popular and useful technology for genomic analysis. Challenging problems result from the large volume of data generated in these experiments. Quality control and experimental design remain important fundamental issues. Analytical techniques which account for complex experimental designs and minimize artifacts are required. Many problematic statistical and bioinformatics issues remain and are addressed in this project. Next generation sequencing techniques are now a popular means for RNA expression measurement (RNAseq). As with microarrays, a host of technical and quality control issues remain as challenges, in addition to the new statistical problems implied by change of scale from continuous (microarray fluorescence) to discrete (read counts). We develop and test methods for analysis of alternative gene splicing, based on microarray platforms especially designed for the purpose, and more recently, using RNAseq. Two measurement platforms, the Affymetrix exon array and the ExonHit junction probe array have been studied. A special version of our analysis package, The MSCL Toolbox, was written for this study, namely the ExonSVD. This statistical technique was shown to be highly efficient at identifying genes undergoing alternative splicing, and was less susceptible to the false positives encountered with the earlier ExonANOVA method. The ExonANOVA model has now been tested with RNAseq data in two different studies. It performs well, and perhaps better than it does in the microarray context, owing to better conformity of the data with the underlying assumptions of independence and uniformity of variance, after transformation. The Framingham Heart Survey SABRe project uses the Affymetrix Exon array, which increases the available transcriptional information by roughly a factor of 10, compared to earlier expression arrays. This large project, which assayed almost 6,000 samples, has now been completed. The last phase (Third Generation cohort, about 3,000 samples) was completed in 2011. In addition to careful continuous quality control monitoring of data collection over 3 calendar years, our lab has carefully monitored and developed corrections for several important artifacts affecting the data. Data adjustment for laboratory measured QC parameters allowed for substantial reduction of variation in the data. Principal Components analysis led to the possibility of further correction of the data. Both raw and adjusted versions of the dataset for the Offspring and Third Generation cohorts have been completed and submitted to dbGaP for distribution to qualified investigators. Careful analysis of gene expression in conjunction with SNP determinations found that individual identities, to within close family membership could be re-established from expression data alone. This finding allowed for the determination and removal of about a dozen samples for which the identity had apparently been scrambled. Further analysis of expression data in combination with Complete Blood Count with Differential results on a fraction of the entire dataset, allowed for effective imputation of CBC results for the entire dataset. These data make it possible to adjust expression data for the varying makeup of white-blood cell and platelet composition, which might otherwise confound expression analysis. The Offspring and Third Generation results have now been analyzed with many phenotype working groups and have provided strong results for such phenotypes as blood lipid levels, IL-6 levels, smoking effects, osteoporosis, diabetes, and cardiovascular disease The case-control study (manuscript published), has yielded lists of genes significantly associated with cardiovascular disease (CVD). Pending the confirmation by qPCR analysis, many of these newly detected associations will become the subject of a third manuscript. Together with other investigators, we are analyzing the expression data in combination with genetic data (eQTL analysis), with microRNA expression data and finding many strong statistical associations, due to the large, homogeneous nature of our dataset. We are comparing our results to that of others in a variety of international consortia, to find validation for many of our findings. Affordable, high-quality software availability has been one of the bottlenecks in analysis of microarray data. We have further developed the "MSCL Analyst's Toolbox" to address this need. This toolbox allows investigators to download Affymetrix microarray data from a central database, normalize and transform the data, inspect it for a variety of outliers or defects, perform a variety of statistical tests to select relevant genes affected in the experiment, and then visualize and classify various patterns of gene expression. In collaboration with over forty investigators in NCI, CC, NHLBI, NINDS, NIAID, NHGRI, NICHD, NIA, NIDDK, NIDA , this tool has been applied to dozens of microarray studies. The Analyst's Toolbox has been extended to now handle analysis of RNAseq data, with inclusion of new data transformations, and utility functions. In a continuing NIH-wide project, we maintain a database for storage, retrieval and analysis of Affymetrix microarrays, the NIHAGCC. Our downloadable tool set (MSCL Analyst's Toolbox) is now mature, widely tested and applied in numerous studies. We also maintain a quarterly-updated set of annotation files for use with Affymetrix data, in a format for convenient download and use by our collaborators. Last year, the NIHAGCC was re-hosted on newer server hardware, with high capacity data storage needed for RNAseq datasets. In a continuing study of the rat pineal transcriptome, we have found a dramatic number of novel, unannotated, but demonstrably controlled regions of genomic expression, termed non-coding RNAs (ncRNAs) some of which were found to be pseudo-genes of highly expressed genes. The growing list of such novel features has grown to several hundred, as multiple RNA-seq experiments become available. In a collaboration with NHGRI, we are conducting an RNA-seq investigation of transcriptomic differences using a case-control design, of coronary artery calcification, based on ClinSeq study samples. We integrated RNA-seq and microarray data from the same individuals, and found consistent changes across the two methodologies, which are now candidates for follow-up studies. In a collaboration with NEI, we are analyzing the transcriptome of mouse photoreceptor from embryonic, through neonatal to later adult stages. This extensive time series, using bot the Affymetrix Exon array and RNA-seq in parallel, allows for high resolution analysis at the gene and exon levels, and is providing an unparalleled view of transcriptomic changes accompanying important developmental events (e.g. differentiation, eye opening). The aim is to identify genes involved in mammalian aging and which may be relevant to age-related diseases of the eye in human.

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

peter j munson其他文献

peter j munson的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('peter j munson', 18)}}的其他基金

Statistical And Computational Methods For Molecular Biology And Biomedicine
分子生物学和生物医学的统计和计算方法
  • 批准号:
    8565482
  • 财政年份:
  • 资助金额:
    $ 94.38万
  • 项目类别:
Statistical And Computational Methods For Gene Expression and Proteomic Analysis
基因表达和蛋白质组分析的统计和计算方法
  • 批准号:
    8148480
  • 财政年份:
  • 资助金额:
    $ 94.38万
  • 项目类别:
Statistical And Computational Methods For Molecular Biol
分子生物学的统计和计算方法
  • 批准号:
    7296867
  • 财政年份:
  • 资助金额:
    $ 94.38万
  • 项目类别:
Statistical And Computational Methods For Gene Expression and Proteomic Analysis
基因表达和蛋白质组分析的统计和计算方法
  • 批准号:
    8941406
  • 财政年份:
  • 资助金额:
    $ 94.38万
  • 项目类别:
Physical modeling of biological systems
生物系统的物理建模
  • 批准号:
    8746533
  • 财政年份:
  • 资助金额:
    $ 94.38万
  • 项目类别:
Statistical And Computational Methods For Molecular Biology And Biomedicine
分子生物学和生物医学的统计和计算方法
  • 批准号:
    7966721
  • 财政年份:
  • 资助金额:
    $ 94.38万
  • 项目类别:
Statistical And Computational Methods For Gene Expression and Proteomic Analysis
基因表达和蛋白质组分析的统计和计算方法
  • 批准号:
    7966728
  • 财政年份:
  • 资助金额:
    $ 94.38万
  • 项目类别:
Statistical & Computational Method For Molecular Biology
统计
  • 批准号:
    7145131
  • 财政年份:
  • 资助金额:
    $ 94.38万
  • 项目类别:
Statistical & Computational Methods For Gene Expression
统计
  • 批准号:
    6988060
  • 财政年份:
  • 资助金额:
    $ 94.38万
  • 项目类别:
White Matter Connectivity and Network Analysis
白质连接和网络分析
  • 批准号:
    8746532
  • 财政年份:
  • 资助金额:
    $ 94.38万
  • 项目类别:

相似海外基金

Rational design of rapidly translatable, highly antigenic and novel recombinant immunogens to address deficiencies of current snakebite treatments
合理设计可快速翻译、高抗原性和新型重组免疫原,以解决当前蛇咬伤治疗的缺陷
  • 批准号:
    MR/S03398X/2
  • 财政年份:
    2024
  • 资助金额:
    $ 94.38万
  • 项目类别:
    Fellowship
Re-thinking drug nanocrystals as highly loaded vectors to address key unmet therapeutic challenges
重新思考药物纳米晶体作为高负载载体以解决关键的未满足的治疗挑战
  • 批准号:
    EP/Y001486/1
  • 财政年份:
    2024
  • 资助金额:
    $ 94.38万
  • 项目类别:
    Research Grant
CAREER: FEAST (Food Ecosystems And circularity for Sustainable Transformation) framework to address Hidden Hunger
职业:FEAST(食品生态系统和可持续转型循环)框架解决隐性饥饿
  • 批准号:
    2338423
  • 财政年份:
    2024
  • 资助金额:
    $ 94.38万
  • 项目类别:
    Continuing Grant
Metrology to address ion suppression in multimodal mass spectrometry imaging with application in oncology
计量学解决多模态质谱成像中的离子抑制问题及其在肿瘤学中的应用
  • 批准号:
    MR/X03657X/1
  • 财政年份:
    2024
  • 资助金额:
    $ 94.38万
  • 项目类别:
    Fellowship
CRII: SHF: A Novel Address Translation Architecture for Virtualized Clouds
CRII:SHF:一种用于虚拟化云的新型地址转换架构
  • 批准号:
    2348066
  • 财政年份:
    2024
  • 资助金额:
    $ 94.38万
  • 项目类别:
    Standard Grant
The Abundance Project: Enhancing Cultural & Green Inclusion in Social Prescribing in Southwest London to Address Ethnic Inequalities in Mental Health
丰富项目:增强文化
  • 批准号:
    AH/Z505481/1
  • 财政年份:
    2024
  • 资助金额:
    $ 94.38万
  • 项目类别:
    Research Grant
ERAMET - Ecosystem for rapid adoption of modelling and simulation METhods to address regulatory needs in the development of orphan and paediatric medicines
ERAMET - 快速采用建模和模拟方法的生态系统,以满足孤儿药和儿科药物开发中的监管需求
  • 批准号:
    10107647
  • 财政年份:
    2024
  • 资助金额:
    $ 94.38万
  • 项目类别:
    EU-Funded
BIORETS: Convergence Research Experiences for Teachers in Synthetic and Systems Biology to Address Challenges in Food, Health, Energy, and Environment
BIORETS:合成和系统生物学教师的融合研究经验,以应对食品、健康、能源和环境方面的挑战
  • 批准号:
    2341402
  • 财政年份:
    2024
  • 资助金额:
    $ 94.38万
  • 项目类别:
    Standard Grant
Ecosystem for rapid adoption of modelling and simulation METhods to address regulatory needs in the development of orphan and paediatric medicines
快速采用建模和模拟方法的生态系统,以满足孤儿药和儿科药物开发中的监管需求
  • 批准号:
    10106221
  • 财政年份:
    2024
  • 资助金额:
    $ 94.38万
  • 项目类别:
    EU-Funded
Recite: Building Research by Communities to Address Inequities through Expression
背诵:社区开展研究,通过表达解决不平等问题
  • 批准号:
    AH/Z505341/1
  • 财政年份:
    2024
  • 资助金额:
    $ 94.38万
  • 项目类别:
    Research Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了