权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

A Unified Model of Compositional and Distributional Semantics: Theory and Applications

组合语义和分布语义的统一模型：理论与应用

基本信息

批准号：
EP/I037512/1
负责人：
Stephen Clark
金额：
$ 44.01万
依托单位：
University of Cambridge
依托单位国家：
英国
项目类别：
Research Grant
财政年份：
2012
资助国家：
英国
起止时间：
2012 至无数据
项目状态：
已结题

来源：
https://gtr.ukri.org/projects?ref=EP%2FI037512%2F1
关键词：
Unified Model Compositional Distributional Semantics

项目摘要

The notion of meaning is central to many areas of Computer Science, Artificial Intelligence (AI), Linguistics, Philosophy, and Cognitive Science. A formal, mathematical account of the meaning of natural language utterances is crucial to AI, since an understanding of natural language (i.e. languages such as English, German, Chinese etc) is at the heart of much intelligent behaviour. More specifically, Natural Language Processing (NLP) --- the branch of AI concerned with the computer processing, analysis and generation of text --- requires a model of meaning for many of its tasks and applications.There have been two main approaches to modelling the meaning of language in NLP, in order that a computer can gain some "understanding" of the text. The first, the so-called compositional approach, is based on classical ideas from Philosophy and Mathematical Logic. Using a well-known principle from the 19th century logicianFrege --- that the meaning of a phrase can be determined from the meanings of its parts and how those parts are combined --- logicians have developed formal accounts of how the meaning of a sentence can be determined from the relations of words in a sentence. This idea culminated famously in Linguistics in the work of Richard Montague in the 1970s. The compositional approach addresses a fundamental problem in Linguistics -- how it is that humans are able to generate an unlimited number of sentences using a limited vocabulary. We would like computers to have a similar capacity also.The second, more recent, approach to modelling meaning in NLP focuses on the meanings of the words themselves. This is the so-called distributional approach to modelling word meanings and is based on the ideas of the "structural" linguists such as Firth from the 1950s. This idea is also sometimes related to Wittenstein's philosophy of "meaning as use". The idea is that the meanings of words can be determined by considering the contexts in which words appear in text. For example,if we take a large amount of text and see which words appear close to the word "dog", and do a similar thing for the word "cat", we will see that the contexts of dog and cat tend to share many words in common (such as walk, run, furry, pet, and so on). Whereas if we see which words appear in the context of the word "television", for example, we will find less overlap with the contexts for "dog". Mathematically we represent the contexts in a vector space, so that word meanings occupy positions in a geometrical space. We would expect to find that "dog" and "cat" are much closer in the space than "dog" and "television", indicating that "dog" and "cat" are closer in meaning than "dog" and "television".The two approaches to meaning can be roughly characterized as follows: the compositional approach is concerned with how meanings combine, but has little to say about the individual meanings of words; the distributional approach is concerned with word meanings, but has little to say about how those meanings combine. Our ambitious proposal is to exploit the strengths of the two approaches, by developing a unified model of distributional and compositional semantics. Our proposal has a central theoretical component, drawing on models of semantics from Theoretical Computer Science and Mathematical Logic. This central component which will inform, be driven by, and evaluated on tasks and applications in NLP and Information Retrieval, and also data drawn from empirical studies in Cognitive Science (thecomputational study of the mind). Hence we aim to make the following fundamental contributions:1. advance the theoretical study of meaning in Linguistics, Computer Science and Artificial Intelligence;2. develop new meaning-sensitive approaches to NLP applications which can be robustly applied to naturally occurring text.

意义的概念是计算机科学，人工智能（AI），语言学，哲学和认知科学的许多领域的核心。对自然语言话语的意义进行正式的数学解释对人工智能至关重要，因为对自然语言（即英语，德语，汉语等语言）的理解是许多智能行为的核心。更具体地说，自然语言处理（NLP）--人工智能的分支，涉及计算机处理、分析和生成文本--需要一个意义模型来完成它的许多任务和应用。在NLP中，有两种主要的方法来建模语言的意义，以便计算机可以获得对文本的一些“理解”。第一种是所谓的组合方法，它基于哲学和数理逻辑的经典思想。使用一个众所周知的原则，从19世纪世纪logicianFrege -一个短语的意义可以确定从它的部分的意义，以及这些部分是如何结合-逻辑学家已经制定了正式的帐户，一个句子的意义可以确定从一个句子中的词的关系。这一观点在20世纪70年代理查德·蒙塔古的《语言学》中达到了顶峰。组合方法解决了语言学中的一个基本问题-人类如何能够使用有限的词汇生成无限数量的句子。我们希望计算机也有类似的能力。第二种，也是最近的，在NLP中建模意义的方法关注单词本身的意义。这就是所谓的分布式方法来模拟单词的含义，它是基于20世纪50年代的“结构”语言学家如弗斯的想法。这个想法有时也与维滕施泰因的“意义即用途”的哲学有关。这个想法是，单词的含义可以通过考虑单词出现在文本中的上下文来确定。例如，如果我们获取大量文本，并查看哪些单词出现在单词“dog”附近，并对单词“cat”做类似的事情，我们将看到dog和cat的上下文往往共享许多共同的单词（例如walk，run，furry，pet等）。然而，如果我们看到哪些单词出现在单词“television”的上下文中，例如，我们会发现与“dog”的上下文中的重叠较少。在数学上，我们在向量空间中表示上下文，因此词义在几何空间中占据位置。这两种意义分析方法大致可以概括为以下几个方面：合成法关注意义的组合，关注意义的联合收割机，而不关注单个意义;分布方法关注的是词义，但对这些词义如何组合联合收割机却几乎没有涉及。我们雄心勃勃的建议是利用这两种方法的优势，通过开发一个统一的模型的分布和组合语义。我们的建议有一个核心的理论组成部分，从理论计算机科学和数理逻辑的语义模型。这个中心组成部分将通知，驱动，并在NLP和信息检索的任务和应用程序进行评估，以及从认知科学的实证研究（思维的计算研究）中得出的数据。因此，我们的目标是作出以下基本贡献：1。推进语言学、计算机科学和人工智能中意义的理论研究;2.为NLP应用开发新的意义敏感方法，这些方法可以稳健地应用于自然发生的文本。