权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

Towards Open-world Semi-supervised learning

走向开放世界的半监督学习

基本信息

批准号：
2766068
负责人：
金额：
--
依托单位：
University of Edinburgh
依托单位国家：
英国
项目类别：
Studentship
财政年份：
2022
资助国家：
英国
起止时间：
2022 至无数据
项目状态：
未结题

来源：
https://gtr.ukri.org/projects?ref=studentship-2766068
关键词：
Towards Open world Semi supervised

项目摘要

Large amounts of fully annotated data is one of the major components responsible for the success of current deep-learning models. However, in many application scenarios, this assumption of having an extensive collection of human provided annotations is not realistic, as gathering those annotations can be costly and require expert knowledge. Additionally in real-world settings, new concepts and categories may emerge over time. Thus it is often not feasible to gather human annotations for every possible concept. This project will focus on the question of how to build autonomous agents that can automatically reason about novel categories for which no human supervision is provided at training time.Current semi-supervised learning methods are limited in that they require all categories to have human annotations for at least one example. As a result it can be hard to directly adopt previous semi-supervised learning methods in this open-world regime. Recently a new area called Novel Category Discovery (NCD) has emerged, which is closely related to this project, which focuses on the problem of how to discover novel categories within a large unlabelled dataset by leveraging knowledge from an existing labelled dataset. However, NCD methods assume that all the categories in the unlabelled set are novel, and only focus on the performance of those novel categories.In a real-world scenario, it is of equal importance to be able to recognise both previously seen categories as well as discover novel ones. In this work, we aim to develop algorithms and solutions for open-world semi-supervised learning. The developed algorithms will not only be able to recognise previously seen categories using labelled and unlabelled data, but also be able to discover novel categories from unlabelled data. Upon successful development of these new methods, we also aim to design interpretable methods that can convey 'why' the model thinks a set of examples are novel and `how' a newly discovered novel category differs from previously seen ones. These proposed interpretable methods will give new insight into large unlabeled image collections and will advance current deep-learning approaches into a more realistic open-world setting. In this project, we will focus on the following goals: (1) Classifier learning: Learning to classify both seen and novel categories using labelled and unlabelled images.(2) Category number estimation: The category number estimation algorithm should be able to run efficiently and be performed simultaneously with the classifier learning process.(3) Interpreting discovered categories: Interpretable model outputs so that the user knows why some images are predicted to form a novel category.To design novel methods for this new open-world semi-supervised learning setting, we aim to draw inspiration from classic unsupervised clustering methods and cutting-edge deep learning methods:(1) Hierarchical clustering, these methods have the advantage of automatically merging data points to adaptively form a hierarchy of categories, by learning a similarity measuring function using neural networks. We can leverage it to perform hierarchical clustering and to be able to learn the classifier and estimate the number of novel categories at the same time.(2) Leveraging state-of-the-art visual-language models, we also aim to use recent advancements in visual-language learning to design methods for automatically generating a textual explanation for interpreting 'why' the model thinks a set of examples are novel, and 'how' the novel categories differ from the 'seen' categories from the human labelled dataset.We will apply these methods to real-world tasks to better understand the practical challenges present which will inform the development of more robust models.

大量完全注释的数据是当前深度学习模型成功的主要组成部分之一。然而，在许多应用场景中，具有大量人类提供的注释的这种假设是不现实的，因为收集这些注释可能是昂贵的并且需要专业知识。此外，在现实世界中，新的概念和类别可能会随着时间的推移而出现。因此，为每个可能的概念收集人类注释通常是不可行的。这个项目将集中在如何建立自主代理的问题，可以自动推理的新类别，没有人的监督提供在training time.当前的半监督学习方法是有限的，因为他们需要所有类别有人类注释至少一个例子。因此，在这个开放的世界中，很难直接采用以前的半监督学习方法。最近出现了一个新的领域，称为新的类别发现（NCD），这是密切相关的这个项目，其重点是如何发现新的类别在一个大型的未标记的数据集中，利用知识从现有的标记数据集的问题。然而，NCD方法假设未标记集合中的所有类别都是新的，并且只关注那些新类别的性能。在现实世界中，能够识别先前看到的类别以及发现新类别同样重要。在这项工作中，我们的目标是开发开放世界半监督学习的算法和解决方案。开发的算法不仅能够使用标记和未标记的数据识别以前见过的类别，而且还能够从未标记的数据中发现新的类别。在成功开发这些新方法后，我们还旨在设计可解释的方法，这些方法可以传达“为什么”模型认为一组示例是新颖的，以及“如何”新发现的新颖类别与以前看到的不同。这些提出的可解释方法将为大型未标记图像集提供新的见解，并将当前的深度学习方法推进到更现实的开放世界环境中。在这个项目中，我们将专注于以下目标：（1）分类器学习：学习使用标记和未标记的图像对可见和新类别进行分类。(2)类别数量估计：类别数估计算法应该能够有效地运行，并与分类器学习过程同时执行。(3)解释发现的类别：可解释的模型输出，使用户知道为什么一些图像被预测为形成一个新的类别。为了为这种新的开放世界半监督学习环境设计新颖的方法，我们的目标是从经典的无监督聚类方法和尖端的深度学习方法中汲取灵感：（1）层次聚类，这些方法具有自动合并数据点以自适应地形成类别的层次的优点，通过使用神经网络学习相似性测量函数。我们可以利用它来执行层次聚类，并能够学习分类器，同时估计新类别的数量。(2)利用最先进的视觉语言模型，我们还旨在利用视觉语言学习的最新进展来设计自动生成文本解释的方法，以解释“为什么”模型认为一组示例是新颖的，以及新类别与人类标记数据集的“见过”类别“如何不同。我们将把这些方法应用于真实的-世界任务是更好地了解当前的实际挑战，这将为开发更强大的模型提供信息。