EAGER: Integrating Dense Paraphrased-Enriched Representations with Large Language Models

EAGER:将密集释义丰富的表示与大型语言模型相集成

基本信息

  • 批准号:
    2326985
  • 负责人:
  • 金额:
    $ 15万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2023
  • 资助国家:
    美国
  • 起止时间:
    2023-06-01 至 2024-05-31
  • 项目状态:
    已结题

项目摘要

Current large language models like Chat-GPT have captured attention by their impressive ability to answer a broad range of user questions. But these models can still get confused by content not specifically mentioned in written text. For example, while a recipe may say “cut the onions”, it does not say that one ends up with onion pieces or that a knife was used. It is the natural economy of language that often makes life hard for current language processing tools. This project develops an approach called Dense Paraphrasing that enriches the surface language with paraphrases that reveal the deeper relations between words in a body of text, providing information needed by current models to make better inferences and judgments from the text.In this project, we develop a model, Dense Paraphrasing, for text enrichment based on how a meaning can be associated with many different syntactic patterns in language. This project develops the process of integrating Dense Paraphrasing with Large Language Models, followed by an evaluation of the resulting system. This involves four subtasks, outlined here: (1) creation of a dataset that contains a rich annotation that makes explicit the hidden semantic relations in the text; (2) the development of baseline models for Dense Paraphrasing text generation systems that translate the source text into the “dense” text form, by fine-tuning the Large Language Models; (3) applications of Dense Paraphrasing on downstream NLP tasks are explored, applied to the task of Question Answering; and (4) applied to a coreference resolution problem involving objects as they move or change through a sequence of events. The ultimate goal of the project is to build a novel framework incorporating advanced Large Language Models with deeper semantics and integrate it in downstream applications to advance the state of the art in NLP, making the systems more generally useful for real-world problems.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
目前的大型语言模型,如Chat-GPT,因其回答广泛的用户问题的能力而引起了人们的关注。但是这些模型仍然会被书面文本中没有特别提到的内容所混淆。例如,虽然食谱可能说“切洋葱”,但它并没有说最后会得到洋葱片或使用刀。正是语言的自然经济性使得当前的语言处理工具举步维艰。该项目开发了一种称为密集释义的方法,该方法通过释义来丰富表层语言,揭示文本中单词之间的深层关系,为当前模型提供所需的信息,以便从文本中进行更好的推理和判断。在该项目中,我们开发了一种基于意义如何与语言中的许多不同句法模式相关联的文本丰富模型Dense Paraphrasing。该项目开发了将密集释义与大型语言模型相结合的过程,然后对所产生的系统进行评估。这涉及四个子任务,在这里概述:(1)创建包含丰富注释的数据集,该注释使文本中隐藏的语义关系显式化;(2)通过微调大型语言模型,开发用于将源文本翻译为"密集"文本形式的密集释义文本生成系统的基线模型;(3)探讨密集释义在下游NLP任务中的应用,应用于问题分类任务;(4)应用于涉及对象在事件序列中移动或变化的共指消解问题。该项目的最终目标是构建一个新的框架,将先进的大型语言模型与更深层次的语义相结合,并将其集成到下游应用中,以推进NLP的最新技术水平,使系统更普遍地适用于现实世界的问题。该奖项反映了NSF的法定使命,并通过使用基金会的智力价值和更广泛的影响审查标准进行评估,被认为值得支持。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

James Pustejovsky其他文献

Introduction to Special Issue on Advances in Question Answering
  • DOI:
    10.1007/s10579-005-7883-6
  • 发表时间:
    2006-02-28
  • 期刊:
  • 影响因子:
    1.800
  • 作者:
    James Pustejovsky;Janyce Wiebe
  • 通讯作者:
    Janyce Wiebe
Situated UMR for Multimodal Interactions
用于多模式交互的定位 UMR
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Kenneth Lai;R. Brutti;Lucia Donatelli;James Pustejovsky
  • 通讯作者:
    James Pustejovsky
Scalar Anaphora: Annotating Degrees of Coreference in Text
标量照应:注释文本中的共指程度
Integrated Annotation of Event Structure, Object States, and Entity Coreference
事件结构、对象状态和实体共指的集成注释
Encoding Gesture in Multimodal Dialogue: Creating a Corpus of Multimodal AMR
多模态对话中的手势编码:创建多模态 AMR 语料库

James Pustejovsky的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('James Pustejovsky', 18)}}的其他基金

Elements: Towards a Robust Cyberinfrastructure for NLP-based Search and Discoverability over Scientific Literature
要素:建立一个强大的网络基础设施,用于基于 NLP 的科学文献搜索和发现
  • 批准号:
    2104025
  • 财政年份:
    2021
  • 资助金额:
    $ 15万
  • 项目类别:
    Standard Grant
Travel Support for North American Summer School for Logic, Language, and Information (NASSLLI)
北美逻辑、语言和信息暑期学校 (NASSSLLI) 的差旅支持
  • 批准号:
    2002141
  • 财政年份:
    2020
  • 资助金额:
    $ 15万
  • 项目类别:
    Standard Grant
Collaborative Research: NSF2026: EAGER: A Playground and Proposal for Growing an AGI
合作研究:NSF2026:EAGER:发展 AGI 的游乐场和提案
  • 批准号:
    2033932
  • 财政年份:
    2020
  • 资助金额:
    $ 15万
  • 项目类别:
    Standard Grant
EAGER: Collaborative Research: Mining Scientific Literature with the LAPPS Grid
EAGER:协作研究:使用 LAPPS 网格挖掘科学文献
  • 批准号:
    1811402
  • 财政年份:
    2018
  • 资助金额:
    $ 15万
  • 项目类别:
    Standard Grant
Workshop: The International Linguistics Olympiad
研讨会:国际语言学奥林匹克竞赛
  • 批准号:
    1632453
  • 财政年份:
    2016
  • 资助金额:
    $ 15万
  • 项目类别:
    Standard Grant
Workshop: The International Linguistics Olympiad in Blagoevgrad, Bulgaria: July 20-24, 2015.
研讨会:保加利亚布拉戈耶夫格勒国际语言学奥林匹克竞赛:2015 年 7 月 20 日至 24 日。
  • 批准号:
    1547270
  • 财政年份:
    2015
  • 资助金额:
    $ 15万
  • 项目类别:
    Standard Grant
Workshop:The International Linguistics Olympiad
研讨会:国际语言学奥林匹克竞赛
  • 批准号:
    1442079
  • 财政年份:
    2014
  • 资助金额:
    $ 15万
  • 项目类别:
    Standard Grant
Outstanding Student Research at GL2013
GL2013 杰出学生研究
  • 批准号:
    1348830
  • 财政年份:
    2013
  • 资助金额:
    $ 15万
  • 项目类别:
    Standard Grant
SI2-SSI: The Language Application Grid: A Framework for Rapid Adaptation and Reuse
SI2-SSI:语言应用网格:快速适应和重用的框架
  • 批准号:
    1147912
  • 财政年份:
    2012
  • 资助金额:
    $ 15万
  • 项目类别:
    Standard Grant
RI: Small: Interpreting Linguistic Spatiotemporal Relations in Static and Dynamic Contexts
RI:小:解释静态和动态上下文中的语言时空关系
  • 批准号:
    1017765
  • 财政年份:
    2010
  • 资助金额:
    $ 15万
  • 项目类别:
    Standard Grant

相似海外基金

Challenging Health Outcomes/Integrating Care Environments Ph3: A Community Consortium to Tackle Health Disparity for People Living with Mental Illness
挑战健康成果/整合护理环境第三阶段:解决精神疾病患者健康差距的社区联盟
  • 批准号:
    AH/Z505420/1
  • 财政年份:
    2024
  • 资助金额:
    $ 15万
  • 项目类别:
    Research Grant
Evaluating the effectiveness and sustainability of integrating helminth control with seasonal malaria chemoprevention in West African children
评估西非儿童蠕虫控制与季节性疟疾化学预防相结合的有效性和可持续性
  • 批准号:
    MR/X023133/1
  • 财政年份:
    2024
  • 资助金额:
    $ 15万
  • 项目类别:
    Fellowship
Integrating metabolic signals through FOXO transcriptional complexes.
通过 FOXO 转录复合物整合代谢信号。
  • 批准号:
    BB/X000265/1
  • 财政年份:
    2024
  • 资助金额:
    $ 15万
  • 项目类别:
    Research Grant
Collaborative Research: BoCP-Implementation: Alpine plants as a model system for biodiversity dynamics in a warming world: Integrating genetic, functional, and community approaches
合作研究:BoCP-实施:高山植物作为变暖世界中生物多样性动态的模型系统:整合遗传、功能和社区方法
  • 批准号:
    2326020
  • 财政年份:
    2024
  • 资助金额:
    $ 15万
  • 项目类别:
    Continuing Grant
Collaborative Research: BoCP-Implementation: Alpine plants as a model system for biodiversity dynamics in a warming world: Integrating genetic, functional, and community approaches
合作研究:BoCP-实施:高山植物作为变暖世界中生物多样性动态的模型系统:整合遗传、功能和社区方法
  • 批准号:
    2326021
  • 财政年份:
    2024
  • 资助金额:
    $ 15万
  • 项目类别:
    Standard Grant
Integrating Self-Regulated Learning Into STEM Courses: Maximizing Learning Outcomes With The Success Through Self-Regulated Learning Framework
将自我调节学习融入 STEM 课程:通过自我调节学习框架取得成功,最大化学习成果
  • 批准号:
    2337176
  • 财政年份:
    2024
  • 资助金额:
    $ 15万
  • 项目类别:
    Standard Grant
CAREER: Hybridization and radiation: Integrating across phylogenomics, ancestral niche evolution, and pollination biology
职业:杂交和辐射:系统基因组学、祖先生态位进化和授粉生物学的整合
  • 批准号:
    2337784
  • 财政年份:
    2024
  • 资助金额:
    $ 15万
  • 项目类别:
    Continuing Grant
EAGER: Integrating Pathological Image and Biomedical Text Data for Clinical Outcome Prediction
EAGER:整合病理图像和生物医学文本数据进行临床结果预测
  • 批准号:
    2412195
  • 财政年份:
    2024
  • 资助金额:
    $ 15万
  • 项目类别:
    Standard Grant
Integrating Signals in Iron Homeostasis
将信号整合到铁稳态中
  • 批准号:
    2343917
  • 财政年份:
    2024
  • 资助金额:
    $ 15万
  • 项目类别:
    Standard Grant
FDSS Track 1: Integrating Research and Education in Magnetosphere-Ionosphere-Atmosphere Coupling at Clemson University
FDSS Track 1:克莱姆森大学磁层-电离层-大气耦合研究与教育相结合
  • 批准号:
    2347149
  • 财政年份:
    2024
  • 资助金额:
    $ 15万
  • 项目类别:
    Continuing Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了