CAREER: Utilizing Fine-Grained Knowledge Annotations in Text Understanding and Retrieval

职业:在文本理解和检索中利用细粒度知识注释

基本信息

  • 批准号:
    1846017
  • 负责人:
  • 金额:
    $ 55万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Continuing Grant
  • 财政年份:
    2019
  • 资助国家:
    美国
  • 起止时间:
    2019-01-15 至 2024-12-31
  • 项目状态:
    已结题

项目摘要

All members of our information society are constantly in need for quick access to knowledge for work, education, and personal interests. Often the necessary knowledge cannot be found on a single web page or article but deserves a longer, query-specific complex answer. Examples are questions about causes of political events, about nutritional benefits of chocolate, or about the economic viability of wind energy. However, today's search engines that merely provide a list of multiple sources, but leave users on their own to synthesize sources into knowledge. This project develops novel algorithms that distill relevant key concepts, text passages, and relational information to provide users with a single summary of comprehensive information. The summary is structured into different sections, each covering a different facet of a complex topic. The focus of this project is to identify the relevant facts and connections that will enable users to form their own opinions and make strategic decisions. Embedded in a self-directed-learning environment, it allows users to learn about new topics at their own pace. Integrated educational activities will make use of the software to inspire STEM-interest among middle school students and undergrads of other disciplines.Today's algorithms fall short to identify relevant resources for complex topics, as these require knowledge about the world. Utilizing knowledge graphs for information retrieval has led to several important advances, such as the Entity Query Feature Expansion model (EQFE). This project takes a novel holistic approach towards developing representations of knowledge in a knowledge graph, corresponding text annotation algorithms, and retrieval algorithms that work hand-in-hand. The project focuses on three thrusts: (1) Entity aspect linking, which determines the topical context of entity mentions to facilitate a high-precision paragraph ranking. (2) Utilizing relation extractions for open domain information retrieval, where the presence of many non-relevant relations is explicitly addressed. (3) Selecting the query-specific subgraph of the knowledge graph that is suitable to identify relevant entities through long-range dependencies while avoiding concept drift. All three thrusts lead to fine-grained machine-understanding of relevant connections and aspects of entities, aligned through supporting passages that provide provenance for their relevance. The impact of all three thrusts extends beyond information retrieval: many applications that build on entity links will gain topical precision from entity aspect links. Most technology that utilizes relations as-is, will be improved when relational information is considered in context. Any method that extracts information from the knowledge graph structure will perform better when spurious edges are eliminated. Overall, this research effort will lay the foundation for identifying query-relevant complex information in natural large collections.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
我们信息社会的所有成员都不断地需要快速获取知识,以用于工作、教育和个人兴趣。通常,在一个网页或一篇文章中无法找到必要的知识,但需要一个更长的、针对具体问题的复杂答案。例如,政治事件的起因、巧克力的营养价值或风能的经济可行性等问题。然而,今天的搜索引擎,仅仅提供了一个列表的多个来源,但离开用户自己的综合来源的知识。该项目开发了新的算法,提取相关的关键概念,文本段落和关系信息,为用户提供综合信息的单一摘要。摘要分为不同的部分,每个部分涵盖一个复杂主题的不同方面。该项目的重点是确定相关的事实和联系,使用户能够形成自己的意见,并作出战略决策。嵌入在一个自我导向的学习环境中,它允许用户以自己的节奏学习新的主题。综合教育活动将利用该软件激发中学生和其他学科的本科生对STEM的兴趣。目前的算法不足以识别复杂主题的相关资源,因为这些主题需要有关世界的知识。利用知识图进行信息检索已经取得了一些重要的进展,例如实体查询特征扩展模型(EQFE)。该项目采用一种新颖的整体方法来开发知识图中的知识表示、相应的文本注释算法和协同工作的检索算法。该项目侧重于三个方面:(1)实体方面链接,确定实体提及的主题上下文,以促进高精度的段落排名。(2)利用关系抽取的开放领域的信息检索,其中存在许多不相关的关系是明确解决。(3)选择知识图的查询特定子图,其适于通过长距离依赖性来识别相关实体,同时避免概念漂移。所有这三个方面都导致了对实体的相关连接和方面的细粒度机器理解,通过为其相关性提供出处的支持段落进行对齐。所有这三个方面的影响都超出了信息检索的范围:许多建立在实体链接上的应用程序将从实体方面链接中获得主题精度。当在上下文中考虑关系信息时,大多数利用关系的技术都将得到改进。任何从知识图结构中提取信息的方法在消除虚假边缘时都会表现得更好。总的来说,这项研究工作将奠定基础,确定查询相关的复杂信息在自然大集合。这个奖项反映了NSF的法定使命,并已被认为是值得通过评估使用基金会的智力价值和更广泛的影响审查标准的支持。

项目成果

期刊论文数量(19)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
A Large Test Collection for Entity Aspect Linking
实体方面链接的大型测试集合
Wikimarks: Harvesting Relevance Benchmarks from Wikipedia
Predicting Guiding Entities for Entity Aspect Linking
Neural Entity Context Models
神经实体上下文模型
  • DOI:
  • 发表时间:
    2023
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Oza, Pooja;Chatterjee, Shubham;Dietz, Laura
  • 通讯作者:
    Dietz, Laura
ECIR 23 Tutorial: Neuro-Symbolic Approaches for Information Retrieval
  • DOI:
    10.1007/978-3-031-28241-6_33
  • 发表时间:
    2023
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Laura Dietz;Hannah Bast;Shubham Chatterjee;Jeffrey Dalton;E. Meij;A. D. Vries
  • 通讯作者:
    Laura Dietz;Hannah Bast;Shubham Chatterjee;Jeffrey Dalton;E. Meij;A. D. Vries
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Laura Dietz其他文献

The Second Workshop on Knowledge Graphs and Semantics for Text Retrieval, Analysis, and Understanding (KG4IR)
第二届文本检索、分析和理解的知识图谱和语义学研讨会(KG4IR)
Why does this Entity matter?: Support Passage Retrieval for Entity Retrieval
为什么这个实体很重要?:支持实体检索的段落检索
TREC CAR Y3: Complex Answer Retrieval Overview
TREC CAR Y3:复杂答案检索概述
  • DOI:
  • 发表时间:
    2019
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Laura Dietz
  • 通讯作者:
    Laura Dietz
EXAM++: LLM-based Answerability Metrics for IR Evaluation
考试:基于法学硕士的 IR 评估应答性指标
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Naghmeh Farzi;Laura Dietz
  • 通讯作者:
    Laura Dietz
Building Entity-Centric Event Collections For Supporting Research in Political and Social History
构建以实体为中心的事件集合以支持政治和社会史研究
  • DOI:
  • 发表时间:
    2017
  • 期刊:
  • 影响因子:
    0
  • 作者:
    F. Nanni;N. Marinov;Simone Paolo Ponzetto;Laura Dietz
  • 通讯作者:
    Laura Dietz

Laura Dietz的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

相似海外基金

Study on EDM utilizing Ultra Fine Bubble
利用超细气泡电火花加工的研究
  • 批准号:
    19K04128
  • 财政年份:
    2019
  • 资助金额:
    $ 55万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Development of heat-resistant magnesium alloys with superior strength by utilizing fine lamellar microstructure
利用精细层状显微组织开发具有优异强度的耐热镁合金
  • 批准号:
    22560695
  • 财政年份:
    2010
  • 资助金额:
    $ 55万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
The development of a program for fine art education, utilizing walls of old folk houses-An establishment of the technique for fresco on earthen walls
利用古民居墙体进行美术教育的开发——土墙壁画技术的建立
  • 批准号:
    20520109
  • 财政年份:
    2008
  • 资助金额:
    $ 55万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Geosynthetic reinforced walls and embankments utilizing fine-grained soils in cold regions
寒冷地区细粒土土工合成材料加筋墙及路堤
  • 批准号:
    238313-2001
  • 财政年份:
    2003
  • 资助金额:
    $ 55万
  • 项目类别:
    Discovery Grants Program - Individual
Geosynthetic reinforced walls and embankments utilizing fine-grained soils in cold regions
寒冷地区细粒土土工合成材料加筋墙及路堤
  • 批准号:
    238313-2001
  • 财政年份:
    2002
  • 资助金额:
    $ 55万
  • 项目类别:
    Discovery Grants Program - Individual
Geosynthetic reinforced walls and embankments utilizing fine-grained soils in cold regions
寒冷地区细粒土土工合成材料加筋墙及路堤
  • 批准号:
    238313-2001
  • 财政年份:
    2001
  • 资助金额:
    $ 55万
  • 项目类别:
    Discovery Grants Program - Individual
Geosynthetic reinforced walls and embankments utilizing fine-grained soils in cold regions
寒冷地区细粒土土工合成材料加筋墙及路堤
  • 批准号:
    238313-2001
  • 财政年份:
    2000
  • 资助金额:
    $ 55万
  • 项目类别:
    Discovery Grants Program - Individual
Synthesis of Carbon hcp Co fine particle encaged in Carbon nano capsules utilizing laser ablation
利用激光烧蚀合成包裹在碳纳米胶囊中的碳 hcp Co 细颗粒
  • 批准号:
    10650009
  • 财政年份:
    1998
  • 资助金额:
    $ 55万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Development for New Method of Preparation of Fine particles utilizing Splashing phenomena in Molten Metal with Ammonia gas.
利用氨气在熔融金属中的飞溅现象制备细颗粒的新方法的开发。
  • 批准号:
    09650807
  • 财政年份:
    1997
  • 资助金额:
    $ 55万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
PRIMARY STUDY OF CREATION OF FINE METALLIC PARTICLES DISPERSED ON METAL OXIDES BY UTILIZING PROPERTIES OF NON-EQUILIBRIUM ALLOYS
利用非平衡合金的性质制造分散在金属氧化物上的细小金属颗粒的初步研究
  • 批准号:
    06402051
  • 财政年份:
    1994
  • 资助金额:
    $ 55万
  • 项目类别:
    Grant-in-Aid for Scientific Research (A)
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了