课题基金基金详情
基因表达调控分析的非参数回归模型
结题报告
批准号:
39900126
项目类别:
青年科学基金项目
资助金额:
12.0 万元
负责人:
陈长生
学科分类:
H3011.流行病学方法与卫生统计
结题年份:
2002
批准年份:
1999
项目状态:
已结题
项目参与者:
阎玉霞、李文潮、宇传华、张俊杰、尚磊、李鹏、韩炯
国基评审专家1V1指导 中标率高出同行96.8%
结合最新热点,提供专业选题建议
深度指导申报书撰写,确保创新可行
指导项目中标800+,快速提高中标率
客服二维码
微信扫码咨询
中文摘要
本项目研究建立一套适合分析基因表达调控网络的非参数回归模型。从理论上阐述模型的特性,用粗糙度惩罚方法构造出模型的目标函数并证明出密码子不同位置上的碱基组成及其相关性与基因表达水平间的回归关系,提出用于基因表达调控分析的一些统计量。在模型中考虑协变量,控制非调节因素对模型核心参估计值的干扰,并建立高维非参数回归模型。
英文摘要
A nonparametric regression model do relax the strict assumptions of classical regression models, and serve any form distribution data. It does not choose model form, especially, relaxing the assumption of linear relationship between the responses and the explanatory variables. Therefore, it extends linear models and strengthen model adaptability. In order to improve on LS estimate, the penalized sum of squares is set up. The penalized least squares estimator for regression function by minimizing the penalized sum of squares can be obtained, this estimator compromise between goodness of fit and smoothness.There are few quantitative indices measuring level of gene expression. Distributions of these indices are unknown, and patterns of dependent relationship between level of gene expression and influencing factors are indefinite. So, some strict assumptions supporting the classical theory of linear models are not satisfied. If data do not meet these conditions of classical statistical approaches, statistical inferences drawn from classical approaches would be, to different extent, influenced in negative direction and even erroneous conclusions would be drawn.Therefore, nonparametric regression models would help us solve statistical problems of genome nformatics..This project aims at establishing non-parametric regression models for analyzing gene expression regulation networks. Based on cubic spline and roughness penalty approach, a set of theories and algorithms of nonparametric regression models are proposed for various cases of nonparametric regression analysis. We explore smoothing spline, weighted nonparametric regression model, semiparametric regression model and multidimensional nonparametric regression model in consideration of weights, ties and covariables. We provide cross-validation (CV) score function and generalized cross-validation (GCV) score function. The best design of interest parameters can be obtained by a module form search method. Various nonparametric regression models are verified and assessed by statistical simulations and examples. The computational method measuring codon usage bias is proposed, and codon usage frequencies for two known yeasts are analyzed by using Relative Synonymous Codon Usage (RSCU). Thus highly expressed optimal codons are determined. RSCU-based quantitative statistic, Codon Adaptation Index (CAI), is proposed to measure level of gene expression. The regression relationship between CAI for yeast and such factors as codon usage bias, third base composition and linear correlation of codon usage with tRNA abundance. A proper software for nonparametric regression models is compiled.
专著列表
科研奖励列表
会议论文列表
专利列表
基于神经网络混合效应模型的循证2型糖尿病监测研究
基于集成统计学习方法鉴定I型糖尿病肠道微生物标志物及其作用机制研究
多因素复杂疾病微阵列数据富集分析方法研究
国内基金
海外基金