权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

Scalable, Sample Efficient and Interpretable Bayesian Deep Learning

可扩展、样本高效且可解释的贝叶斯深度学习

基本信息

批准号：
2275741
负责人：
金额：
--
依托单位：
University of Cambridge
依托单位国家：
英国
项目类别：
Studentship
财政年份：
2019
资助国家：
英国
起止时间：
2019 至无数据
项目状态：
已结题

来源：
https://gtr.ukri.org/projects?ref=studentship-2275741
关键词：
Scalable Sample Efficient Interpretable Bayesian

项目摘要

My work will be focused around the EPSRC's "Artificial intelligence technologies" research area. An abstract can be read bellow.Neural Networks (NN) are a class of machine learning models which have recently soared in popularity due to their flexibility and scalability to large amounts of data. NNs are most commonly trained using Maximum a Posteriori (MAP) parameter estimation. This framework provides point estimates for model weights, often leading to overfitting, overconfidence in predictions and sample inefficiency. Additionally, NNs often behave like black boxes. Their predictions are notoriously difficult to interpret.Bayesian methods provide a principled way of tackling the issue of overconfidence in model parameters. In Bayesian Neural Networks (BNN), weight point estimates are substituted by probability distributions. Predictions are made by marginalising the weights, considering all possible parameter values. In these models, uncertainty in weight space is translated into uncertainty in predictions, giving us a way to model 'what we do not know.' Unfortunately, exact inference is often intractable for complex models and approximate inference methods tend to rely on crude approximations which present a trade-off between accuracy of uncertainty estimates and scalability. My goal is to develop a new class of flexible approximate inference methods for neural networks which are able to fit complex data, produce reliable uncertainty estimates, and scale to large datasets. I want to use the uncertainty estimates produced by these models to automatically generate explanations that are understandable to non-experts about these model's decisions. Finally, I want to use these approximate inference methods to reduce model-bias and build sample-efficient model-based reinforcement learning algorithms.

我的工作将集中在 EPSRC 的“人工智能技术”研究领域。摘要如下。神经网络（NN）是一类机器学习模型，由于其对大量数据的灵活性和可扩展性，最近越来越受欢迎。神经网络最常使用最大后验 (MAP) 参数估计进行训练。该框架提供了模型权重的点估计，通常会导致过度拟合、预测过度自信和样本效率低下。此外，神经网络通常表现得像黑匣子。众所周知，他们的预测很难解释。贝叶斯方法提供了解决模型参数过度自信问题的原则性方法。在贝叶斯神经网络 (BNN) 中，权重点估计被概率分布替代。预测是通过边缘化权重并考虑所有可能的参数值来进行的。在这些模型中，权重空间的不确定性转化为预测的不确定性，为我们提供了一种对“我们不知道的东西”进行建模的方法。不幸的是，对于复杂模型来说，精确推断通常很困难，而近似推断方法往往依赖于粗略近似，这在不确定性估计的准确性和可扩展性之间进行权衡。我的目标是为神经网络开发一类新型灵活的近似推理方法，该方法能够拟合复杂的数据，产生可靠的不确定性估计，并扩展到大型数据集。我想使用这些模型产生的不确定性估计来自动生成非专家可以理解的关于这些模型决策的解释。最后，我想使用这些近似推理方法来减少模型偏差并构建样本高效的基于模型的强化学习算法。