Shrinkage Estimation in Modern Statistics

现代统计学中的收缩估计

基本信息

  • 批准号:
    0707033
  • 负责人:
  • 金额:
    --
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Continuing grant
  • 财政年份:
    2007
  • 资助国家:
    美国
  • 起止时间:
    2007-07-01 至 2011-06-30
  • 项目状态:
    已结题

项目摘要

Technical Description: Contemporary statistics deals with increasingly large and complex collections of data. Complementary styles are emerging for appropriately interpreting such data. One general style involves constructing multi-faceted models for the data. These should be built from relatively simpler components to treat separate aspects of the situation, but with additional modeling parameters at higher levels of the structure in order to provide connections among the components and also to provide flexibility and a measure of robustness to cushion against the possibility that the model may be overly restrictive or partially inappropriate. These additional higher order quantities may be described numerically, qualitatively, or as unknown functions depending on the context. They have various names, such as hyperparameters or latent variables, but irrespective of the terminology and of their subjective interpretation they serve comparable roles relative to the analysis of the data. Adding such higher order quantities to component models has the effect in their analysis of shrinking estimates and other forms of inference toward global averages or patterns. The major thrust of the proposal is to understand the shrinkage phenomenon from a more fundamental, componentwise perspective. Charles Stein's discovery of the advantages of shrinkage in the estimation of independent normal means is among the most surprising and important statistical developments of the preceding century. As the proposal emphasizes this discovery can be interpreted in exactly the framework of independent pieces tied together by higher level quantities. A great deal of theory has been developed over the past 50 years to rigorously understand the consequences of dealing with such a connection, and of using certain structurally appealing techniques such as those labeled as random-effects models" or hierarchical or empirical objective Bayes analyses. However this theory has failed to adequately address some issues that need to be understood before it can be adequately and properly applied in modern complex settings. For example, classical theory has tended to break groups of parameters into separate blocks and, at best, to shrink separately within each block. But it will be shown how additional shrinkage across blocks can be beneficial, and further research is proposed in this regard. Another classical deficiency that this proposal focuses on trying to correct is that current theory of shrinkage is relatively incomplete and somewhat unsatisfactory with regard to unbalanced data situations involving unequal sampling variances (heteroscedasticity), but these are typical in all highly complex modern data.General Description: Modern statistical applications often involve massive amounts of data. Conceptual organization and interpretation of such large data sets is a fundamental challenge. It often involves modeling the data as being probabilistically dependent on parameters that control the process being investigated. Classical statistical formulations typically view individual parameters as the primitive structural quantities, rather than taking as primitives ensembles of parameters whose joint modeling characteristics are well understood and controlled. This proposal introduces the ensemble-risk to better address this issue and suggests judging estimators according to their performance relative to this ensemble-risk. Applications of the theory and methodology to be developed in this proposal include nearly all areas of science and technology, but principle applications can be identified in areas of physical and biological sciences such as genomics, climatology and astronomy where large data sets and ensembles of related parameters appear in a natural fashion. As a further instance of the range of potential applications, the orientation and conceptualization in this proposal derives in part from previous data modeling of telephone call-center traffic and internet traffic and intrusions, and a portion of the current proposal involves modeling in a different, complex context involving forecasting housing prices.
技术描述:当代统计学处理越来越大和复杂的数据集合。为了恰当地解释这些数据,互补的风格正在出现。一种通用的风格涉及为数据构建多面模型。这些应该从相对简单的组件构建,以处理情况的不同方面,但在结构的更高级别上使用额外的建模参数,以便提供组件之间的连接,并提供灵活性和健壮性度量,以缓冲模型可能过度限制或部分不适当的可能性。根据具体情况,这些附加的高阶量可以用数值、定性或未知函数来描述。它们有各种各样的名称,例如超参数或潜在变量,但不管术语和它们的主观解释如何,它们相对于数据分析起着类似的作用。将这样的高阶量添加到组件模型中,会影响它们对缩小估计的分析,以及对全球平均或模式的其他形式的推断。该提案的主要目的是从更基本的、组件的角度来理解收缩现象。查尔斯·斯坦(Charles Stein)在估计独立正态均值时发现了收缩的优点,这是上个世纪最令人惊讶和最重要的统计学发展之一。正如提案所强调的那样,这一发现可以准确地解释为由更高水平的量连接在一起的独立碎片的框架。在过去的50年里,已经发展了大量的理论来严格理解处理这种联系的后果,以及使用某些结构上吸引人的技术,如那些被标记为“随机效应模型”或分层或经验客观贝叶斯分析的技术。然而,这一理论未能充分解决一些需要理解的问题,然后才能充分和适当地应用于现代复杂的环境。例如,经典理论倾向于将参数组分解成单独的块,并且在最好的情况下,在每个块中分别收缩。但它将显示如何在块间额外收缩是有益的,并在这方面提出了进一步的研究。另一个经典缺陷,这一建议的重点是试图纠正的是,目前的收缩理论是相对不完整的,在涉及不等抽样方差(异方差)的不平衡数据情况下,有些令人不满意,但这些是典型的在所有高度复杂的现代数据。一般描述:现代统计应用通常涉及大量数据。对如此大的数据集进行概念性组织和解释是一个根本性的挑战。它通常涉及将数据建模为概率依赖于控制所调查过程的参数。经典的统计公式通常将单个参数视为基本结构量,而不是将其联合建模特性很好地理解和控制的参数集合作为基本结构量。该建议引入了集成风险来更好地解决这个问题,并建议根据相对于集成风险的性能来判断评估器。本提案中所发展的理论和方法的应用几乎涵盖了所有科学和技术领域,但主要应用可以在物理和生物科学领域,如基因组学、气候学和天文学,在这些领域,大量数据集和相关参数的集合以自然的方式出现。作为潜在应用范围的进一步实例,本提案中的方向和概念化部分源于以前对电话呼叫中心流量和互联网流量和入侵的数据建模,而当前提案的一部分涉及在涉及预测房价的不同复杂背景下进行建模。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Lawrence Brown其他文献

Surveillance results and bone effects in the Gulf War depleted uranium-exposed cohort
海湾战争贫铀暴露人群的监测结果和骨骼影响
  • DOI:
  • 发表时间:
    2018
  • 期刊:
  • 影响因子:
    0
  • 作者:
    M. McDiarmid;Marianne Cloeren;J. Gaitens;S. Hines;E. Streeten;Richard J. Breyer;Clayton H. Brown;M. Condon;T. Roth;M. Oliver;Lawrence Brown;M. Dux;M. Lewin;Frederick G. Strathmann;Maria A. Velez;P. Gucer
  • 通讯作者:
    P. Gucer
Correction to: Working with Misspecified Regression Models
  • DOI:
    10.1007/s10940-020-09464-8
  • 发表时间:
    2020-06-01
  • 期刊:
  • 影响因子:
    3.300
  • 作者:
    Richard Berk;Lawrence Brown;Andreas Buja;Edward George;Linda Zhao
  • 通讯作者:
    Linda Zhao
Biologic monitoring and surveillance results for the department of veterans affairs' depleted uranium cohort: Lessons learned from sustained exposure over two decades.
退伍军人事务部贫铀队列的生物监测和监测结果:二十年来持续暴露的经验教训。
  • DOI:
    10.1002/ajim.22435
  • 发表时间:
    2015
  • 期刊:
  • 影响因子:
    3.5
  • 作者:
    M. McDiarmid;J. Gaitens;S. Hines;M. Condon;T. Roth;M. Oliver;P. Gucer;Lawrence Brown;J. Centeno;E. Streeten;K. Squibb
  • 通讯作者:
    K. Squibb
Health effects of depleted uranium on exposed Gulf War veterans.
贫铀对暴露的海湾战争退伍军人的健康影响。
  • DOI:
  • 发表时间:
    2000
  • 期刊:
  • 影响因子:
    8.3
  • 作者:
    M. McDiarmid;James P. Keogh;Frank J. Hooper;Kathleen McPhaul;K. Squibb;Robert L. Kane;R. DiPino;M. Kabat;Bruce Kaup;Larry D. Anderson;D. Hoover;Lawrence Brown;Matthew M. Hamilton;David Jacobson;Belton A. Burrows;Mark Walsh
  • 通讯作者:
    Mark Walsh
The Gulf War Depleted Uranium Cohort at 20 years: Bioassay Results and Novel Approaches to Fragment Surveillance
海湾战争 20 年后的贫铀队列:生物测定结果和碎片监视的新方法
  • DOI:
    10.1097/hp.0b013e31827b1740
  • 发表时间:
    2013
  • 期刊:
  • 影响因子:
    2.2
  • 作者:
    M. McDiarmid;J. Gaitens;S. Hines;Richard J. Breyer;J. Wong;Susan M. Engelhardt;M. Oliver;P. Gucer;Robert L. Kane;A. Cernich;Bruce Kaup;D. Hoover;A. Gaspari;Juan Liu;Erin M. Harberts;Lawrence Brown;J. Centeno;Patrick J. Gray;Hanna Xu;K. Squibb
  • 通讯作者:
    K. Squibb

Lawrence Brown的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Lawrence Brown', 18)}}的其他基金

Collaborative Research: Inference for Linear Model Parameters in Model-free Populations
合作研究:无模型群体中线性模型参数的推断
  • 批准号:
    1310795
  • 财政年份:
    2013
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
Post Model Selection Inference and Empirical Bayes Methods
模型选择后推理和经验贝叶斯方法
  • 批准号:
    1007657
  • 财政年份:
    2010
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
Seventh International Workshop on Objective Bayesian Methodology; Philadelphia, PA
第七届客观贝叶斯方法论国际研讨会;
  • 批准号:
    0924257
  • 财政年份:
    2009
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
Prediction for Multi-factor Point Process Models
多因素点过程模型的预测
  • 批准号:
    0405716
  • 财政年份:
    2004
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
Service Engineering of Human Tele-Queues: Empirically Based Stochastic Analysis of Telephone Call Centers
人工电话队列服务工程:基于经验的电话呼叫中心随机分析
  • 批准号:
    0223304
  • 财政年份:
    2002
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
Asymptotic Equivalence in Nonparametric Function Problems-Theory and Applications
非参数函数问题中的渐近等价-理论与应用
  • 批准号:
    9971751
  • 财政年份:
    1999
  • 资助金额:
    --
  • 项目类别:
    Continuing grant
Dissertation Research: Making Ends Meet: Differences AmongYoruba Women in Benin in the use of a Multiple Enterprise Economic Strategy
论文研究:收支平衡:贝宁约鲁巴妇女在使用多元化企业经济战略方面的差异
  • 批准号:
    9711900
  • 财政年份:
    1997
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
Mathematical Sciences: Three Topics in Mathematical Statistics
数学科学:数理统计的三个主题
  • 批准号:
    9626118
  • 财政年份:
    1996
  • 资助金额:
    --
  • 项目类别:
    Continuing grant
Mathematical Sciences: Investigations in Mathematical Statistics
数学科学:数理统计研究
  • 批准号:
    9596094
  • 财政年份:
    1994
  • 资助金额:
    --
  • 项目类别:
    Continuing grant
Mathematical Sciences: Investigations in Mathematical Statistics
数学科学:数理统计研究
  • 批准号:
    9310228
  • 财政年份:
    1993
  • 资助金额:
    --
  • 项目类别:
    Continuing Grant

相似海外基金

Data-Driven Dynamic State-Estimation for Modern Power Systems
现代电力系统的数据驱动动态状态估计
  • 批准号:
    2221784
  • 财政年份:
    2022
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
CAREER: A Modern Philosophy for Classical Statistical Testing and Estimation
职业:经典统计测试和估计的现代哲学
  • 批准号:
    2042366
  • 财政年份:
    2021
  • 资助金额:
    --
  • 项目类别:
    Continuing Grant
Modern power semiconductor characterization, modelling and loss estimation in EMTP-type simulators
EMTP 型模拟器中的现代功率半导体表征、建模和损耗估计
  • 批准号:
    518000-2017
  • 财政年份:
    2021
  • 资助金额:
    --
  • 项目类别:
    Collaborative Research and Development Grants
Modern power semiconductor characterization, modelling and loss estimation in EMTP-type simulators
EMTP 型模拟器中的现代功率半导体表征、建模和损耗估计
  • 批准号:
    518000-2017
  • 财政年份:
    2020
  • 资助金额:
    --
  • 项目类别:
    Collaborative Research and Development Grants
Modern power semiconductor characterization, modelling and loss estimation in EMTP-type simulators
EMTP 型模拟器中的现代功率半导体表征、建模和损耗估计
  • 批准号:
    518000-2017
  • 财政年份:
    2019
  • 资助金额:
    --
  • 项目类别:
    Collaborative Research and Development Grants
Modern power semiconductor characterization, modelling and loss estimation in EMTP-type simulators
EMTP 型模拟器中的现代功率半导体表征、建模和损耗估计
  • 批准号:
    518000-2017
  • 财政年份:
    2018
  • 资助金额:
    --
  • 项目类别:
    Collaborative Research and Development Grants
CAREER: Locally Adaptive Nonparametric Estimation for the Modern Age - New Insights, Extensions, and Inference Tools
职业:现代局部自适应非参数估计 - 新见解、扩展和推理工具
  • 批准号:
    1554123
  • 财政年份:
    2016
  • 资助金额:
    --
  • 项目类别:
    Continuing Grant
Spatial and Temporal Power Estimation and Optimization for Modern Field Programmable Gate Array Architectures
现代现场可编程门阵列架构的空间和时间功率估计和优化
  • 批准号:
    427469-2012
  • 财政年份:
    2014
  • 资助金额:
    --
  • 项目类别:
    Postgraduate Scholarships - Doctoral
Spatial and Temporal Power Estimation and Optimization for Modern Field Programmable Gate Array Architectures
现代现场可编程门阵列架构的空间和时间功率估计和优化
  • 批准号:
    427469-2012
  • 财政年份:
    2013
  • 资助金额:
    --
  • 项目类别:
    Postgraduate Scholarships - Doctoral
Spatial and Temporal Power Estimation and Optimization for Modern Field Programmable Gate Array Architectures
现代现场可编程门阵列架构的空间和时间功率估计和优化
  • 批准号:
    427469-2012
  • 财政年份:
    2012
  • 资助金额:
    --
  • 项目类别:
    Postgraduate Scholarships - Doctoral
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了