Bayesian Variable Selection in Generalized Linear Models with Missing Varibles
缺失变量的广义线性模型中的贝叶斯变量选择
基本信息
- 批准号:8471550
- 负责人:
- 金额:$ 23万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2011
- 资助国家:美国
- 起止时间:2011-08-11 至 2014-08-30
- 项目状态:已结题
- 来源:
- 关键词:AccountingAddressAlgorithmsArchivesAutistic DisorderBayesian MethodBinomial DistributionBiomedical ResearchChild AbuseClinical TrialsComplexComputer softwareDataData AnalysesData SetDependenceDevelopmentDropoutEffectivenessEquationEvaluationGeneric DrugsGenesGenetic TranscriptionImmunobiologyIndividualLibrariesLinear ModelsLinear RegressionsMarkov ChainsMedical ResearchMethodologyMethodsModelingObservational StudyOutcomePathway interactionsPerformancePhenotypePoisson DistributionProblem behaviorProceduresProcessResearchResearch PersonnelResortSchemeSideSocial ProblemsSocietiesSolutionsStatistical Data InterpretationStructureTestingUncertaintybasebehavior measurementclinically relevantcytokineempoweredflexibilityimprovedsmoking cessationsoftware developmenttool
项目摘要
DESCRIPTION (provided by applicant): In conducting medical research, especially with behavioral and social problems, a challenge for statistical data analysis comes from the problems introduced by missing values. Missing values may be caused by subjective (e.g., nonresponse and dropout) and technical reasons (e.g., censoring over/below quantization level). Generalized linear models (GLMs) are popularly applied in biomedical data analysis where a fundamental task is to interpret or predict an outcome variable by a subset of potentially explanatory variables. Given an incomplete data set, practitioners frequently resort to the strategy of case-deletion where individuals are excluded from consideration if they miss any of the variables targeted for analysis. This is the default option used in many software packages. Yet, case-deletion may not only sacrifice useful information, but also give rise to biased estimates because it requires strong assumptions on the missingness mechanisms. A more satisfactory solution for missing data problems involves multiple imputation, where several imputations are created for the same set of missing values. The variance between imputations reflects the uncertainty due to missingness. Across multiply imputed data sets, however, traditional variable selection methods (based on significance tests or various criteria) often result in models with different selected predictors, thus presenting a problem of combining the models to make final inferences. In this R01 proposal with a 3-year research plan, we aim to develop two alternative strategies of variable selection for GLMs with missing values by drawing on a Bayesian framework. One approach, which we call "impute, then select" (ITS) involves initially performing multiple imputation and then applying Bayesian variable selection to the multiply imputed data sets. The second strategy - "simultaneously impute and select" (SIAS) - is to conduct Bayesian variable selection and missing data imputation simultaneously within one Markov Chain Monte Carlo (MCMC) process. ITS and SIAS offer two generic frameworks within which various Bayesian variable selection algorithms and missing data imputation algorithms can be implemented. Both strategies will be developed, evaluated, and implemented into an R library for normal regression, binomial regression, and other GLMs with categorical and/or continuous explanatory variables. Practical data sets from several studies on substances abuse and childhood autism will be used to address the effectiveness and flexibility of the proposed strategies. Development of these procedures and contribution of the software to statisticians and researchers in medical research would significantly improve the quality of evaluation of important and clinically relevant data.
描述(申请人提供):在进行医学研究,特别是在行为和社会问题方面,统计数据分析的挑战来自于缺失值带来的问题。遗漏的值可能是由主观原因(例如,无响应和丢弃)和技术原因(例如,在量化水平之上/之下进行审查)造成的。广义线性模型被广泛应用于生物医学数据分析中,其基本任务是通过潜在解释变量的子集来解释或预测结果变量。由于数据集不完整,从业者经常采取病例删除策略,即如果个人错过了任何目标分析变量,就会被排除在考虑范围之外。这是许多软件包中使用的默认选项。然而,删除案例不仅可能牺牲有用的信息,而且还会引起有偏见的估计,因为这需要对缺失机制做出强有力的假设。对于丢失数据问题,更令人满意的解决方案涉及多个补偿,其中为相同的缺失值集合创建多个补偿。推算之间的差异反映了由于遗漏而产生的不确定性。然而,在多重推定的数据集上,传统的变量选择方法(基于显著性检验或各种标准)往往会导致选择不同预测因子的模型,因此出现了组合模型以做出最终推断的问题。在这份包含三年研究计划的R01提案中,我们的目标是通过借鉴贝叶斯框架,为缺失数值的GLMS开发两种变量选择的替代策略。一种方法,我们称之为“输入,然后选择”(ITS),涉及最初执行多个输入,然后将贝叶斯变量选择应用于多个输入的数据集。第二种策略是在一个马尔可夫链蒙特卡罗(MCMC)过程中同时进行贝叶斯变量选择和缺失数据填充。ITS和SIAS提供了两个通用框架,可以在其中实现各种贝叶斯变量选择算法和缺失数据补偿算法。这两种策略都将被开发、评估并实施到R库中,用于正态回归、二项回归和其他具有分类和/或连续解释变量的GLMS。将使用关于药物滥用和儿童自闭症的几项研究的实际数据集来说明拟议战略的有效性和灵活性。这些程序的开发和该软件对医学研究中的统计学家和研究人员的贡献将显著提高重要和临床相关数据的评估质量。
项目成果
期刊论文数量(3)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
A PLSPM-based test statistic for detecting gene-gene co-association in genome-wide association study with case-control design.
- DOI:10.1371/journal.pone.0062129
- 发表时间:2013
- 期刊:
- 影响因子:3.7
- 作者:Zhang X;Yang X;Yuan Z;Liu Y;Li F;Peng B;Zhu D;Zhao J;Xue F
- 通讯作者:Xue F
Integrative Bayesian variable selection with gene-based informative priors for genome-wide association studies.
综合贝叶斯变量选择与基于基因的信息先验,用于全基因组关联研究。
- DOI:10.1186/s12863-014-0130-7
- 发表时间:2014-12-10
- 期刊:
- 影响因子:2.9
- 作者:Zhang X;Xue F;Liu H;Zhu D;Peng B;Wiemels JL;Yang X
- 通讯作者:Yang X
An integrative framework for Bayesian variable selection with informative priors for identifying genes and pathways.
- DOI:10.1371/journal.pone.0067672
- 发表时间:2013
- 期刊:
- 影响因子:3.7
- 作者:Peng B;Zhu D;Ander BP;Zhang X;Xue F;Sharp FR;Yang X
- 通讯作者:Yang X
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
XIAOWEI YANG其他文献
XIAOWEI YANG的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('XIAOWEI YANG', 18)}}的其他基金
Bayesian Variable Selection in Generalized Linear Models with Missing Varibles
缺失变量的广义线性模型中的贝叶斯变量选择
- 批准号:
8317303 - 财政年份:2011
- 资助金额:
$ 23万 - 项目类别:
Bayesian Variable Selection in Generalized Linear Models with Missing Varibles
缺失变量的广义线性模型中的贝叶斯变量选择
- 批准号:
8543193 - 财政年份:2011
- 资助金额:
$ 23万 - 项目类别:
Bayesian Variable Selection in Generalized Linear Models with Missing Varibles
缺失变量的广义线性模型中的贝叶斯变量选择
- 批准号:
8194802 - 财政年份:2011
- 资助金额:
$ 23万 - 项目类别:
iPhone-based Real-time Data Solution for Drug Abuse and Other Medical Research
基于 iPhone 的药物滥用和其他医学研究实时数据解决方案
- 批准号:
7672825 - 财政年份:2009
- 资助金额:
$ 23万 - 项目类别:
Transition Model for Incomplete Longitudinal Binary Data
不完整纵向二进制数据的转换模型
- 批准号:
6676189 - 财政年份:2003
- 资助金额:
$ 23万 - 项目类别:
DEVELOPMENT OF AN AUTOMATED NEURAL SPIKE DISCRIMINATOR
自动神经尖峰鉴别器的开发
- 批准号:
3504570 - 财政年份:1991
- 资助金额:
$ 23万 - 项目类别:
相似海外基金
Rational design of rapidly translatable, highly antigenic and novel recombinant immunogens to address deficiencies of current snakebite treatments
合理设计可快速翻译、高抗原性和新型重组免疫原,以解决当前蛇咬伤治疗的缺陷
- 批准号:
MR/S03398X/2 - 财政年份:2024
- 资助金额:
$ 23万 - 项目类别:
Fellowship
CAREER: FEAST (Food Ecosystems And circularity for Sustainable Transformation) framework to address Hidden Hunger
职业:FEAST(食品生态系统和可持续转型循环)框架解决隐性饥饿
- 批准号:
2338423 - 财政年份:2024
- 资助金额:
$ 23万 - 项目类别:
Continuing Grant
Re-thinking drug nanocrystals as highly loaded vectors to address key unmet therapeutic challenges
重新思考药物纳米晶体作为高负载载体以解决关键的未满足的治疗挑战
- 批准号:
EP/Y001486/1 - 财政年份:2024
- 资助金额:
$ 23万 - 项目类别:
Research Grant
Metrology to address ion suppression in multimodal mass spectrometry imaging with application in oncology
计量学解决多模态质谱成像中的离子抑制问题及其在肿瘤学中的应用
- 批准号:
MR/X03657X/1 - 财政年份:2024
- 资助金额:
$ 23万 - 项目类别:
Fellowship
CRII: SHF: A Novel Address Translation Architecture for Virtualized Clouds
CRII:SHF:一种用于虚拟化云的新型地址转换架构
- 批准号:
2348066 - 财政年份:2024
- 资助金额:
$ 23万 - 项目类别:
Standard Grant
The Abundance Project: Enhancing Cultural & Green Inclusion in Social Prescribing in Southwest London to Address Ethnic Inequalities in Mental Health
丰富项目:增强文化
- 批准号:
AH/Z505481/1 - 财政年份:2024
- 资助金额:
$ 23万 - 项目类别:
Research Grant
ERAMET - Ecosystem for rapid adoption of modelling and simulation METhods to address regulatory needs in the development of orphan and paediatric medicines
ERAMET - 快速采用建模和模拟方法的生态系统,以满足孤儿药和儿科药物开发中的监管需求
- 批准号:
10107647 - 财政年份:2024
- 资助金额:
$ 23万 - 项目类别:
EU-Funded
BIORETS: Convergence Research Experiences for Teachers in Synthetic and Systems Biology to Address Challenges in Food, Health, Energy, and Environment
BIORETS:合成和系统生物学教师的融合研究经验,以应对食品、健康、能源和环境方面的挑战
- 批准号:
2341402 - 财政年份:2024
- 资助金额:
$ 23万 - 项目类别:
Standard Grant
Ecosystem for rapid adoption of modelling and simulation METhods to address regulatory needs in the development of orphan and paediatric medicines
快速采用建模和模拟方法的生态系统,以满足孤儿药和儿科药物开发中的监管需求
- 批准号:
10106221 - 财政年份:2024
- 资助金额:
$ 23万 - 项目类别:
EU-Funded
Recite: Building Research by Communities to Address Inequities through Expression
背诵:社区开展研究,通过表达解决不平等问题
- 批准号:
AH/Z505341/1 - 财政年份:2024
- 资助金额:
$ 23万 - 项目类别:
Research Grant