Text-to-Text Generation for Summarizing Informal Genres

用于总结非正式流派的文本到文本生成

基本信息

  • 批准号:
    0534871
  • 负责人:
  • 金额:
    --
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Continuing Grant
  • 财政年份:
    2006
  • 资助国家:
    美国
  • 起止时间:
    2006-01-01 至 2010-12-31
  • 项目状态:
    已结题

项目摘要

This project aims at the generation of coherent and on-target summaries and answers through the use of text-to-text generation, an approach which generates new sentences from the input text, fusing relevant phrases and discarding irrelevant ones. A syntactic,statistical framework for text-to-text generation is being developed that can be applied to informal genres, such as transcribed speech and email, where sentences are not guaranteed to be either complete or grammatical. It is exactly these genres that stand to benefit the most from this approach; for them, summarization using sentence extractionalone is not an option.The aim is a fully developed, syntactic statistical framework for text-to-text generation which features the use of a full syntactic grammar within a statistical framework for compression and combination, a model for incorporating constraints from pragmatics andsemantics into the generation system, the ability to produce fluent, grammatical sentences from fragmentary and ungrammatical input, and the ability to generate sentences that make high level abstractions from input document sentences.The project features the integration of compression and language models into a lexicalized head-driven framework, enabling the generator to keep the sentence grammatical and avoid wording changes that dramatically alter meaning. Its framework can incorporate an arbitrary number of features beyond syntax that are important forsummarization. A new dynamic programming technique allows the automatic extraction of large amounts of training data from a summary/document corpus. Information about who speaks to whom and paraphrasing rules will increase the range of revisions that can be addressed.
该项目旨在通过使用文本到文本生成来生成连贯和目标摘要和答案,这种方法从输入文本生成新句子,融合相关短语并丢弃不相关短语。 一个用于文本到文本生成的句法统计框架正在开发中,该框架可以应用于非正式的体裁,例如转录的语音和电子邮件,其中句子不 保证是完整的或合乎语法的。正是这些流派从这种方法中受益最多;对他们来说,仅仅使用句子提取的摘要不是一种选择。其目标是一个完全开发的,用于文本到文本生成的句法统计框架,其特征是在用于压缩和组合的统计框架内使用完整的句法语法,一个将语用和语义的约束纳入生成系统的模型,能够从零碎和不合语法的输入中生成流畅、合乎语法的句子,并能够从输入的文档句子中生成高级抽象的句子。该项目的特点是将压缩和语言模型集成到词汇化的头部驱动框架中,使生成器能够保持句子的语法性,并避免显著改变含义的措辞变化。它的框架可以包含任意数量的语法之外的功能,这些功能对摘要很重要。一种新的动态编程技术允许从摘要/文档语料库中自动提取大量的训练数据。 关于谁对谁说话的信息和解释规则将增加可以解决的修订范围。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Kathleen McKeown其他文献

Detecting Grief Online Among Black Harlem Residents
在哈莱姆区黑人居民中在线检测悲伤情绪
  • DOI:
    10.1016/j.biopsych.2025.02.119
  • 发表时间:
    2025-05-01
  • 期刊:
  • 影响因子:
    9.000
  • 作者:
    Desmond Patton;Shana Kleiner;Shug Miller;Nick Deas;Jessie Grieser;James Shepherd;Elsbeth Turcan;Kathleen McKeown
  • 通讯作者:
    Kathleen McKeown
Cross-Document Temporal Relation Extraction with Temporal Anchoring Events
使用时间锚定事件进行跨文档时间关系提取
  • DOI:
  • 发表时间:
    2021
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Miguel Ballesteros;Rishita Anubhai;Shuai Wang;minder Bhatia;Kathleen McKeown;Yaser Al;Iz Beltagy;Matthew E. Peters;Arman Cohan;Steven Bethard;James H. Martin;Sara Klingenstein;Taylor Cassidy;Bill McDowell;Nathanael Chambers;Danqi Chen;Adam Fisch;Jason Weston;Anne;Manuela Speranza;Eneko Agirre;N. Mostafazadeh;Alyson Grealish
  • 通讯作者:
    Alyson Grealish
Sources of Hallucination by Large Language Models on Inference Tasks Anonymous EMNLP
推理任务中大型语言模型的幻觉来源 Anonymous EMNLP
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Nick McKenna;Mark Steedman. 2022;Smooth;Todor Mihaylov;Peter Clark;Tushar Khot;Dat Ba Nguyen;Johannes Hoffart;Martin Theobald;Ouyang Long;Jeff Wu;Xu Jiang;Car;L. Wainwright;Pamela Mishkin;Chong Zhang;Paul Christiano;J. Leike;Ryan Lowe;Adam Poliak;Jason Naradowsky;Aparajita Haldar;Rachel Rudinger;Benjamin Van;Eleanor Rosch;Carolyn B. Mervis;Wayne D Gray;David M Johnson;P. Boyes;Krishna Srinivasan;K. Raman;Anupam Samanta;Lingrui Liao;Luca Bertelli;Rohan Taori;Ishaan Gulrajani;Tianyi Zhang;Yann Dubois;Xuechen Li;Carlos Guestrin;Percy Liang;Tatsunori Hashimoto;Stan;Kushal Tirumala;A. Markosyan;Luke Zettlemoyer;Hugo Touvron;Thibaut Lavril;Gautier Izacard;Xavier Martinet;Marie;Timothée Lacroix;Baptiste Rozière;Naman Goyal;Eric Hambro;Faisal Azhar;Aurelien Rodriguez;Armand Joulin;Jason Wei;Xuezhi Wang;D. Schuurmans;Maarten Bosma;Brian Ichter;Fei Xia;E. Chi;V. Quoc;Le;Denny Zhou. 2022;Orion Weller;Marc Marone;Nathaniel Weir;Dawn Lawrie;Daniel Khashabi;Faisal Ladhak;Esin Durmus;Kathleen McKeown
  • 通讯作者:
    Kathleen McKeown
TinyStyler: Efficient Few-Shot Text Style Transfer with Authorship Embeddings
TinyStyler:通过作者嵌入进行高效的少量文本样式传输
  • DOI:
  • 发表时间:
    2024
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Zachary Horvitz;Ajay Patel;Kanishk Singh;Christopher Callison;Kathleen McKeown;Zhou Yu
  • 通讯作者:
    Zhou Yu

Kathleen McKeown的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Kathleen McKeown', 18)}}的其他基金

RI: Medium: Automatically Understanding and Identifying Digital Expression of Black Grief
RI:媒介:自动理解和识别黑人悲伤的数字表达
  • 批准号:
    2106666
  • 财政年份:
    2021
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
RI: Small: Describing Disasters and the Ensuing Personal Toll
RI:小:描述灾难和随之而来的个人损失
  • 批准号:
    1422863
  • 财政年份:
    2014
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
EAGER: Corpus-Based Narrative Semantics
EAGER:基于语料库的叙事语义
  • 批准号:
    0935360
  • 财政年份:
    2009
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
RI: Large: Collaborative Research: Richer Representations for Machine Translation
RI:大型:协作研究:更丰富的机器翻译表示
  • 批准号:
    0910778
  • 财政年份:
    2009
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
ITR: Collaborative Research: Interlingual Annotation of Multilingual Text Corporation
ITR:协作研究:多语言文本公司的语际注释
  • 批准号:
    0325887
  • 财政年份:
    2003
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
DLI-Phase 2: A Patient Care Digital Library: Personalized Retrieval Summarization of Multimedia Information
DLI-阶段 2:患者护理数字图书馆:多媒体信息的个性化检索摘要
  • 批准号:
    9817434
  • 财政年份:
    1999
  • 资助金额:
    --
  • 项目类别:
    Cooperative Agreement
STIMULATE: An Environment for Illustrated Briefing and Follow-up Search Over Live Multimedia Information
STIMULATE:通过实时多媒体信息进行图解简报和后续搜索的环境
  • 批准号:
    9619124
  • 财政年份:
    1997
  • 资助金额:
    --
  • 项目类别:
    Continuing Grant
STIMULATE: Generating Coherent Summaries of On-Line Documents: Combining Statistical and Symbolic Techniques
刺激:生成在线文档的连贯摘要:结合统计和符号技术
  • 批准号:
    9618797
  • 财政年份:
    1997
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
CARD: Corpus Analysis Resources for Discourse
CARD:话语语料库分析资源
  • 批准号:
    9528998
  • 财政年份:
    1996
  • 资助金额:
    --
  • 项目类别:
    Continuing Grant
CISE Research Infrastructure: Scalable Multimedia Information Processing
CISE 研究基础设施:可扩展多媒体信息处理
  • 批准号:
    9625374
  • 财政年份:
    1996
  • 资助金额:
    --
  • 项目类别:
    Continuing Grant

相似国自然基金

Next Generation Majorana Nanowire Hybrids
  • 批准号:
  • 批准年份:
    2020
  • 资助金额:
    20 万元
  • 项目类别:

相似海外基金

CAREER: Real-Time First-Principles Approach to Understanding Many-Body Effects on High Harmonic Generation in Solids
职业:实时第一性原理方法来理解固体高次谐波产生的多体效应
  • 批准号:
    2337987
  • 财政年份:
    2024
  • 资助金额:
    --
  • 项目类别:
    Continuing Grant
Collaborative Research: Constraining next generation Cascadia earthquake and tsunami hazard scenarios through integration of high-resolution field data and geophysical models
合作研究:通过集成高分辨率现场数据和地球物理模型来限制下一代卡斯卡迪亚地震和海啸灾害情景
  • 批准号:
    2325311
  • 财政年份:
    2024
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
RII Track-4:NSF: In-Situ/Operando Characterizations of Single Atom Catalysts for Clean Fuel Generation
RII Track-4:NSF:用于清洁燃料生成的单原子催化剂的原位/操作表征
  • 批准号:
    2327349
  • 财政年份:
    2024
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
ERI: Non-Contact Ultrasound Generation and Detection for Tissue Functional Imaging and Biomechanical Characterization
ERI:用于组织功能成像和生物力学表征的非接触式超声波生成和检测
  • 批准号:
    2347575
  • 财政年份:
    2024
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
SBIR Phase II: Thermally-optimized power amplifiers for next-generation telecommunication and radar
SBIR 第二阶段:用于下一代电信和雷达的热优化功率放大器
  • 批准号:
    2335504
  • 财政年份:
    2024
  • 资助金额:
    --
  • 项目类别:
    Cooperative Agreement
CAREER: Next-generation Logic, Memory, and Agile Microwave Devices Enabled by Spin Phenomena in Emergent Quantum Materials
职业:由新兴量子材料中的自旋现象实现的下一代逻辑、存储器和敏捷微波器件
  • 批准号:
    2339723
  • 财政年份:
    2024
  • 资助金额:
    --
  • 项目类别:
    Continuing Grant
CAREER: Securing Next-Generation Transportation Infrastructure: A Traffic Engineering Perspective
职业:保护下一代交通基础设施:交通工程视角
  • 批准号:
    2339753
  • 财政年份:
    2024
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
CAREER: Ultralow phase noise signal generation using Kerr-microresonator optical frequency combs
职业:使用克尔微谐振器光学频率梳生成超低相位噪声信号
  • 批准号:
    2340973
  • 财政年份:
    2024
  • 资助金额:
    --
  • 项目类别:
    Continuing Grant
Next-Generation Distributed Graph Engine for Big Graphs
适用于大图的下一代分布式图引擎
  • 批准号:
    DP240101322
  • 财政年份:
    2024
  • 资助金额:
    --
  • 项目类别:
    Discovery Projects
Next Generation Fluorescent Tools for Measuring Autophagy Dynamics in Cells
用于测量细胞自噬动态的下一代荧光工具
  • 批准号:
    DP240100465
  • 财政年份:
    2024
  • 资助金额:
    --
  • 项目类别:
    Discovery Projects
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了