Shrinkage Estimation in Modern Statistics

现代统计学中的收缩估计

基本信息

  • 批准号:
    0707033
  • 负责人:
  • 金额:
    --
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Continuing grant
  • 财政年份:
    2007
  • 资助国家:
    美国
  • 起止时间:
    2007-07-01 至 2011-06-30
  • 项目状态:
    已结题

项目摘要

Technical Description: Contemporary statistics deals with increasingly large and complex collections of data. Complementary styles are emerging for appropriately interpreting such data. One general style involves constructing multi-faceted models for the data. These should be built from relatively simpler components to treat separate aspects of the situation, but with additional modeling parameters at higher levels of the structure in order to provide connections among the components and also to provide flexibility and a measure of robustness to cushion against the possibility that the model may be overly restrictive or partially inappropriate. These additional higher order quantities may be described numerically, qualitatively, or as unknown functions depending on the context. They have various names, such as hyperparameters or latent variables, but irrespective of the terminology and of their subjective interpretation they serve comparable roles relative to the analysis of the data. Adding such higher order quantities to component models has the effect in their analysis of shrinking estimates and other forms of inference toward global averages or patterns. The major thrust of the proposal is to understand the shrinkage phenomenon from a more fundamental, componentwise perspective. Charles Stein's discovery of the advantages of shrinkage in the estimation of independent normal means is among the most surprising and important statistical developments of the preceding century. As the proposal emphasizes this discovery can be interpreted in exactly the framework of independent pieces tied together by higher level quantities. A great deal of theory has been developed over the past 50 years to rigorously understand the consequences of dealing with such a connection, and of using certain structurally appealing techniques such as those labeled as random-effects models" or hierarchical or empirical objective Bayes analyses. However this theory has failed to adequately address some issues that need to be understood before it can be adequately and properly applied in modern complex settings. For example, classical theory has tended to break groups of parameters into separate blocks and, at best, to shrink separately within each block. But it will be shown how additional shrinkage across blocks can be beneficial, and further research is proposed in this regard. Another classical deficiency that this proposal focuses on trying to correct is that current theory of shrinkage is relatively incomplete and somewhat unsatisfactory with regard to unbalanced data situations involving unequal sampling variances (heteroscedasticity), but these are typical in all highly complex modern data.General Description: Modern statistical applications often involve massive amounts of data. Conceptual organization and interpretation of such large data sets is a fundamental challenge. It often involves modeling the data as being probabilistically dependent on parameters that control the process being investigated. Classical statistical formulations typically view individual parameters as the primitive structural quantities, rather than taking as primitives ensembles of parameters whose joint modeling characteristics are well understood and controlled. This proposal introduces the ensemble-risk to better address this issue and suggests judging estimators according to their performance relative to this ensemble-risk. Applications of the theory and methodology to be developed in this proposal include nearly all areas of science and technology, but principle applications can be identified in areas of physical and biological sciences such as genomics, climatology and astronomy where large data sets and ensembles of related parameters appear in a natural fashion. As a further instance of the range of potential applications, the orientation and conceptualization in this proposal derives in part from previous data modeling of telephone call-center traffic and internet traffic and intrusions, and a portion of the current proposal involves modeling in a different, complex context involving forecasting housing prices.
技术描述:当代统计学处理日益庞大和复杂的数据集合。相辅相成的风格正在涌现,以适当地解释这些数据。一种通用风格涉及为数据构建多方面模型。这些应由相对较简单的组件构建,以处理情况的不同方面,但在结构的较高级别具有额外的建模参数,以便在组件之间提供连接,并提供灵活性和稳健性措施,以缓冲模型可能过于严格或部分不合适的可能性。根据上下文,这些附加的更高阶量可以用数字、定性或未知函数来描述。它们有不同的名称,如超参数或潜在变量,但无论术语及其主观解释如何,它们在数据分析方面发挥着类似的作用。将这样的高阶量添加到成分模型中,可以在它们分析缩水估计和其他形式的全球平均或模式推断时产生效果。该提案的主旨是从更根本的、组成部分的角度来理解收缩现象。查尔斯·斯坦在独立正态均值估计中发现了收缩的优势,这是上个世纪最令人惊讶和最重要的统计发展之一。正如该提案强调的那样,这一发现可以在由更高水平的量连接在一起的独立部分的框架内准确地解释。在过去的50年里,已经发展了大量的理论,以严格理解处理这种联系的后果,以及使用某些结构上有吸引力的技术,例如那些被标记为随机效应模型的技术,或者分层或经验的客观贝叶斯分析。然而,这一理论未能充分解决在其能够充分和适当地应用于现代复杂环境之前需要理解的一些问题。例如,经典理论倾向于将参数组分成不同的块,最好的情况是在每个块内分别收缩。但这将表明,跨区块的额外收缩是如何有益的,并建议在这方面进行进一步的研究。这项建议试图纠正的另一个经典缺陷是,当前的收缩理论相对不完整,在涉及不等抽样方差(异方差)的不平衡数据情况下有点不令人满意,但这些情况在所有高度复杂的现代数据中都是典型的。一般描述:现代统计应用程序通常涉及大量数据。对如此庞大的数据集进行概念性组织和解释是一个根本的挑战。它通常涉及对数据进行建模,使其在概率上依赖于控制被调查过程的参数。经典的统计公式通常将单个参数视为原始结构量,而不是将其联合建模特征被很好地理解和控制的参数集合作为原始参数。这一建议引入了集合风险,以更好地解决这一问题,并建议根据估计者相对于这种集合风险的表现来判断估计者。本提案中将要发展的理论和方法的应用几乎涵盖了所有科学和技术领域,但可以确定主要应用领域是物理和生物科学领域,如基因组学、气候学和天文学,在这些领域中,大量数据集和相关参数集合以自然方式出现。作为潜在应用范围的另一个实例,本提案中的定向和概念化部分地源自电话呼叫中心业务和互联网业务以及入侵的先前数据建模,并且当前提案的一部分涉及在涉及预测房价的不同的、复杂的环境中建模。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Lawrence Brown其他文献

Surveillance results and bone effects in the Gulf War depleted uranium-exposed cohort
海湾战争贫铀暴露人群的监测结果和骨骼影响
  • DOI:
  • 发表时间:
    2018
  • 期刊:
  • 影响因子:
    0
  • 作者:
    M. McDiarmid;Marianne Cloeren;J. Gaitens;S. Hines;E. Streeten;Richard J. Breyer;Clayton H. Brown;M. Condon;T. Roth;M. Oliver;Lawrence Brown;M. Dux;M. Lewin;Frederick G. Strathmann;Maria A. Velez;P. Gucer
  • 通讯作者:
    P. Gucer
Correction to: Working with Misspecified Regression Models
  • DOI:
    10.1007/s10940-020-09464-8
  • 发表时间:
    2020-06-01
  • 期刊:
  • 影响因子:
    3.300
  • 作者:
    Richard Berk;Lawrence Brown;Andreas Buja;Edward George;Linda Zhao
  • 通讯作者:
    Linda Zhao
Biologic monitoring and surveillance results for the department of veterans affairs' depleted uranium cohort: Lessons learned from sustained exposure over two decades.
退伍军人事务部贫铀队列的生物监测和监测结果:二十年来持续暴露的经验教训。
  • DOI:
    10.1002/ajim.22435
  • 发表时间:
    2015
  • 期刊:
  • 影响因子:
    3.5
  • 作者:
    M. McDiarmid;J. Gaitens;S. Hines;M. Condon;T. Roth;M. Oliver;P. Gucer;Lawrence Brown;J. Centeno;E. Streeten;K. Squibb
  • 通讯作者:
    K. Squibb
Health effects of depleted uranium on exposed Gulf War veterans.
贫铀对暴露的海湾战争退伍军人的健康影响。
  • DOI:
  • 发表时间:
    2000
  • 期刊:
  • 影响因子:
    8.3
  • 作者:
    M. McDiarmid;James P. Keogh;Frank J. Hooper;Kathleen McPhaul;K. Squibb;Robert L. Kane;R. DiPino;M. Kabat;Bruce Kaup;Larry D. Anderson;D. Hoover;Lawrence Brown;Matthew M. Hamilton;David Jacobson;Belton A. Burrows;Mark Walsh
  • 通讯作者:
    Mark Walsh
The Gulf War Depleted Uranium Cohort at 20 years: Bioassay Results and Novel Approaches to Fragment Surveillance
海湾战争 20 年后的贫铀队列:生物测定结果和碎片监视的新方法
  • DOI:
    10.1097/hp.0b013e31827b1740
  • 发表时间:
    2013
  • 期刊:
  • 影响因子:
    2.2
  • 作者:
    M. McDiarmid;J. Gaitens;S. Hines;Richard J. Breyer;J. Wong;Susan M. Engelhardt;M. Oliver;P. Gucer;Robert L. Kane;A. Cernich;Bruce Kaup;D. Hoover;A. Gaspari;Juan Liu;Erin M. Harberts;Lawrence Brown;J. Centeno;Patrick J. Gray;Hanna Xu;K. Squibb
  • 通讯作者:
    K. Squibb

Lawrence Brown的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Lawrence Brown', 18)}}的其他基金

Collaborative Research: Inference for Linear Model Parameters in Model-free Populations
合作研究:无模型群体中线性模型参数的推断
  • 批准号:
    1310795
  • 财政年份:
    2013
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
Post Model Selection Inference and Empirical Bayes Methods
模型选择后推理和经验贝叶斯方法
  • 批准号:
    1007657
  • 财政年份:
    2010
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
Seventh International Workshop on Objective Bayesian Methodology; Philadelphia, PA
第七届客观贝叶斯方法论国际研讨会;
  • 批准号:
    0924257
  • 财政年份:
    2009
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
Prediction for Multi-factor Point Process Models
多因素点过程模型的预测
  • 批准号:
    0405716
  • 财政年份:
    2004
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
Service Engineering of Human Tele-Queues: Empirically Based Stochastic Analysis of Telephone Call Centers
人工电话队列服务工程:基于经验的电话呼叫中心随机分析
  • 批准号:
    0223304
  • 财政年份:
    2002
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
Asymptotic Equivalence in Nonparametric Function Problems-Theory and Applications
非参数函数问题中的渐近等价-理论与应用
  • 批准号:
    9971751
  • 财政年份:
    1999
  • 资助金额:
    --
  • 项目类别:
    Continuing grant
Dissertation Research: Making Ends Meet: Differences AmongYoruba Women in Benin in the use of a Multiple Enterprise Economic Strategy
论文研究:收支平衡:贝宁约鲁巴妇女在使用多元化企业经济战略方面的差异
  • 批准号:
    9711900
  • 财政年份:
    1997
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
Mathematical Sciences: Three Topics in Mathematical Statistics
数学科学:数理统计的三个主题
  • 批准号:
    9626118
  • 财政年份:
    1996
  • 资助金额:
    --
  • 项目类别:
    Continuing grant
Mathematical Sciences: Investigations in Mathematical Statistics
数学科学:数理统计研究
  • 批准号:
    9596094
  • 财政年份:
    1994
  • 资助金额:
    --
  • 项目类别:
    Continuing grant
Mathematical Sciences: Investigations in Mathematical Statistics
数学科学:数理统计研究
  • 批准号:
    9310228
  • 财政年份:
    1993
  • 资助金额:
    --
  • 项目类别:
    Continuing Grant

相似海外基金

Data-Driven Dynamic State-Estimation for Modern Power Systems
现代电力系统的数据驱动动态状态估计
  • 批准号:
    2221784
  • 财政年份:
    2022
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
CAREER: A Modern Philosophy for Classical Statistical Testing and Estimation
职业:经典统计测试和估计的现代哲学
  • 批准号:
    2042366
  • 财政年份:
    2021
  • 资助金额:
    --
  • 项目类别:
    Continuing Grant
Modern power semiconductor characterization, modelling and loss estimation in EMTP-type simulators
EMTP 型模拟器中的现代功率半导体表征、建模和损耗估计
  • 批准号:
    518000-2017
  • 财政年份:
    2021
  • 资助金额:
    --
  • 项目类别:
    Collaborative Research and Development Grants
Modern power semiconductor characterization, modelling and loss estimation in EMTP-type simulators
EMTP 型模拟器中的现代功率半导体表征、建模和损耗估计
  • 批准号:
    518000-2017
  • 财政年份:
    2020
  • 资助金额:
    --
  • 项目类别:
    Collaborative Research and Development Grants
Modern power semiconductor characterization, modelling and loss estimation in EMTP-type simulators
EMTP 型模拟器中的现代功率半导体表征、建模和损耗估计
  • 批准号:
    518000-2017
  • 财政年份:
    2019
  • 资助金额:
    --
  • 项目类别:
    Collaborative Research and Development Grants
Modern power semiconductor characterization, modelling and loss estimation in EMTP-type simulators
EMTP 型模拟器中的现代功率半导体表征、建模和损耗估计
  • 批准号:
    518000-2017
  • 财政年份:
    2018
  • 资助金额:
    --
  • 项目类别:
    Collaborative Research and Development Grants
CAREER: Locally Adaptive Nonparametric Estimation for the Modern Age - New Insights, Extensions, and Inference Tools
职业:现代局部自适应非参数估计 - 新见解、扩展和推理工具
  • 批准号:
    1554123
  • 财政年份:
    2016
  • 资助金额:
    --
  • 项目类别:
    Continuing Grant
Spatial and Temporal Power Estimation and Optimization for Modern Field Programmable Gate Array Architectures
现代现场可编程门阵列架构的空间和时间功率估计和优化
  • 批准号:
    427469-2012
  • 财政年份:
    2014
  • 资助金额:
    --
  • 项目类别:
    Postgraduate Scholarships - Doctoral
Spatial and Temporal Power Estimation and Optimization for Modern Field Programmable Gate Array Architectures
现代现场可编程门阵列架构的空间和时间功率估计和优化
  • 批准号:
    427469-2012
  • 财政年份:
    2013
  • 资助金额:
    --
  • 项目类别:
    Postgraduate Scholarships - Doctoral
Spatial and Temporal Power Estimation and Optimization for Modern Field Programmable Gate Array Architectures
现代现场可编程门阵列架构的空间和时间功率估计和优化
  • 批准号:
    427469-2012
  • 财政年份:
    2012
  • 资助金额:
    --
  • 项目类别:
    Postgraduate Scholarships - Doctoral
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了