CAREER: Discourse Level Event-Event Relation Identification
职业:话语层面事件-事件关系识别
基本信息
- 批准号:1942918
- 负责人:
- 金额:$ 55万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2020
- 资助国家:美国
- 起止时间:2020-02-01 至 2025-01-31
- 项目状态:未结题
- 来源:
- 关键词:
项目摘要
Understanding events (protests, elections, disease outbreaks, natural disasters) from natural language text is key to important analytic tasks like predicting future events, detecting fake news and other attempts to validate events, managing extreme events, answering complex questions and generating concise text summaries for analysis. Existing event extraction systems focus on identifying isolated events, but have rarely considered relations between events. Consequently, the extracted events are merely facts describing who did what, but it is hard to interpret how and why those events happened. Indeed, events tend to be described in a complex relationship with other events, for example, news articles are incomplete if they report an assassination event without mentioning how the event was conducted, or if they describe a protest event without information on why it was launched. This Faculty Early Career Development project aims to generate document-level event graphs that capture rich relations between events mentioned anywhere in a document, which will enable us to contextualize events, transform event extraction from simply extracting individual event facts to extracting informative context-rich event interpretations, and better support various event-oriented applications. The project will integrate research with education, train and prepare future researchers with advanced information extraction views and methods, as well as expose a large number of diverse undergraduate students and high school students to computer science and natural language processing research with a focus on significantly broadening participation of minorities and underrepresented groups. Building document-level event graphs requires identifying relations between two events even when they are sentences away, which presents multiple technical challenges. This project will lay the foundation for discourse-aware event-event relation identification, and study correlations between event-event relations and different dimensions of discourse structures. The research is motivated by the observation that events are major materials in forming a cohesive story and the presence of events is tightly correlated with the overall discourse structure of a document. The project develops both supervised and unsupervised learning methods to build effective discourse level event-event relation recognizers. Specifically, the project develops discourse guided approaches to identify two important types of event-event relations, coreference and temporal ordering, which are fundamental for building meaningful event graphs. Then, guided by event discourse correlations obtained via supervised learning, unsupervised learning methods are developed that can effectively make use of large volumes of unlabeled data, deal with lexical diversity issues and improve robustness of systems for event-event relation identification.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
从自然语言文本中理解事件(抗议、选举、疾病爆发、自然灾害)是重要分析任务的关键,例如预测未来事件、检测假新闻和其他验证事件的尝试、管理极端事件、回答复杂问题以及生成用于分析的简明文本摘要。现有的事件提取系统侧重于识别孤立事件,但很少考虑事件之间的关系。因此,提取出来的事件仅仅是描述谁做了什么事情的事实,但很难解释这些事件是如何发生的以及为什么发生的。事实上,对事件的描述往往与其他事件有着复杂的关系,例如,如果新闻报道了暗杀事件而没有提及该事件是如何进行的,或者如果它们描述了抗议事件而没有说明其发起原因,则新闻文章是不完整的。这个教师早期职业发展项目旨在生成文档级事件图,捕获文档中任何地方提到的事件之间的丰富关系,这将使我们能够将事件上下文化,将事件提取从简单地提取单个事件事实转换为提取信息丰富的上下文事件解释,并更好地支持各种面向事件的应用程序。该项目将把研究与教育结合起来,用先进的信息提取观点和方法培训和培养未来的研究人员,并让大量不同的本科生和高中生接触计算机科学和自然语言处理研究,重点是显著扩大少数民族和代表性不足群体的参与。构建文档级事件图需要识别两个事件之间的关系,即使它们相隔几个句子,这就提出了多个技术挑战。本项目将为话语感知事件-事件关系识别奠定基础,研究事件-事件关系与话语结构不同维度之间的相关性。本研究的动机是观察到事件是形成连贯故事的主要材料,事件的存在与文献的整体话语结构密切相关。该项目开发了监督和非监督学习方法来构建有效的话语级事件-事件关系识别器。具体来说,该项目开发了话语引导方法来识别两种重要类型的事件-事件关系,共参考和时间顺序,这是构建有意义的事件图的基础。然后,在通过监督学习获得的事件话语相关性的指导下,开发了无监督学习方法,该方法可以有效地利用大量未标记数据,处理词汇多样性问题并提高系统对事件-事件关系识别的鲁棒性。该奖项反映了美国国家科学基金会的法定使命,并通过使用基金会的知识价值和更广泛的影响审查标准进行评估,被认为值得支持。
项目成果
期刊论文数量(13)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
A Joint Model for Structure-based News Genre Classification with Application to Text Summarization
基于结构的新闻类型分类联合模型及其在文本摘要中的应用
- DOI:10.18653/v1/2021.findings-acl.295
- 发表时间:2021
- 期刊:
- 影响因子:0
- 作者:Dai, Zeyu;Huang, Ruihong
- 通讯作者:Huang, Ruihong
Profiling News Discourse Structure Using Explicit Subtopic Structures Guided Critics
- DOI:10.18653/v1/2021.findings-emnlp.137
- 发表时间:2021
- 期刊:
- 影响因子:0
- 作者:Prafulla Kumar Choubey;Ruihong Huang
- 通讯作者:Prafulla Kumar Choubey;Ruihong Huang
Sentence-level Media Bias Analysis Informed by Discourse Structures
- DOI:10.18653/v1/2022.emnlp-main.682
- 发表时间:2022
- 期刊:
- 影响因子:4.6
- 作者:Yuanyuan Lei;Ruihong Huang;Lu Wang;Nick Beauchamp
- 通讯作者:Yuanyuan Lei;Ruihong Huang;Lu Wang;Nick Beauchamp
Automatic Data Acquisition for Event Coreference Resolution
- DOI:10.18653/v1/2021.eacl-main.101
- 发表时间:2021
- 期刊:
- 影响因子:0
- 作者:Prafulla Kumar Choubey;Ruihong Huang
- 通讯作者:Prafulla Kumar Choubey;Ruihong Huang
One Classifier for All Ambiguous Words: Overcoming Data Sparsity by Utilizing Sense Correlations Across Words
- DOI:
- 发表时间:2020-05
- 期刊:
- 影响因子:0
- 作者:Prafulla Kumar Choubey;Ruihong Huang
- 通讯作者:Prafulla Kumar Choubey;Ruihong Huang
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Ruihong Huang其他文献
Simulating individual work trips for transit-facilitated accessibility study
模拟个人工作旅行以进行交通便利的可达性研究
- DOI:
10.1177/2399808317702148 - 发表时间:
2019 - 期刊:
- 影响因子:0
- 作者:
Ruihong Huang - 通讯作者:
Ruihong Huang
HYTREL: Hypergraph-enhanced Tabular Data Representation Learning
HYTREL:超图增强的表格数据表示学习
- DOI:
- 发表时间:
2023 - 期刊:
- 影响因子:0
- 作者:
Pei Chen;Soumajyoti Sarkar;Leonard Lausen;Balasubramaniam Srinivasan;Sheng Zha;Ruihong Huang;G. Karypis - 通讯作者:
G. Karypis
Comparison of methods for incomplete repeated measures data analysis in small samples
小样本不完全重复测量数据分析方法比较
- DOI:
- 发表时间:
2006 - 期刊:
- 影响因子:0
- 作者:
Ruihong Huang;K. Carriere - 通讯作者:
K. Carriere
Four essays on the econometric analysis of high-frequency order data
高频订单数据计量分析四篇论文
- DOI:
10.18452/16542 - 发表时间:
2012 - 期刊:
- 影响因子:1.6
- 作者:
Ruihong Huang - 通讯作者:
Ruihong Huang
Modeling transit networks by GML for distributed transit trip planners
- DOI:
10.1080/14498596.2008.9635131 - 发表时间:
2008-06 - 期刊:
- 影响因子:1.9
- 作者:
Ruihong Huang - 通讯作者:
Ruihong Huang
Ruihong Huang的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Ruihong Huang', 18)}}的其他基金
Collaborative Research: III: Small: Entity- and Event-driven Media Bias Detection
协作研究:III:小型:实体和事件驱动的媒体偏差检测
- 批准号:
2127746 - 财政年份:2021
- 资助金额:
$ 55万 - 项目类别:
Standard Grant
CRII: RI: Subevent Acquisition and Analysis
CRII:RI:子事件采集和分析
- 批准号:
1755943 - 财政年份:2018
- 资助金额:
$ 55万 - 项目类别:
Standard Grant
Workshop: Student Travel to the 2018 Abusive Language Online Conference
研讨会:学生参加 2018 年辱骂性语言在线会议
- 批准号:
1833638 - 财政年份:2018
- 资助金额:
$ 55万 - 项目类别:
Standard Grant
相似海外基金
Constructing Reading Comprehension Datasets to Evaluate Discourse-level Language Understanding
构建阅读理解数据集以评估话语级语言理解
- 批准号:
22K17954 - 财政年份:2022
- 资助金额:
$ 55万 - 项目类别:
Grant-in-Aid for Early-Career Scientists
Exploring the relationship between auditory, written and multi-modality comprehension at discourse level.
探索话语层面的听觉、书面和多模态理解之间的关系。
- 批准号:
2405187 - 财政年份:2020
- 资助金额:
$ 55万 - 项目类别:
Studentship
Syntactic and discourse-level constraints in native and non-native pronoun resolution
母语和非母语代词解析中的句法和语篇层面的约束
- 批准号:
254826349 - 财政年份:2014
- 资助金额:
$ 55万 - 项目类别:
Priority Programmes
Toward a multi-modal and multi-level analysis of Chinese aphasic discourse
中文失语话语的多模态、多层次分析
- 批准号:
8274650 - 财政年份:2010
- 资助金额:
$ 55万 - 项目类别:
Toward a multi-modal and multi-level analysis of Chinese aphasic discourse
中文失语话语的多模态、多层次分析
- 批准号:
8469747 - 财政年份:2010
- 资助金额:
$ 55万 - 项目类别:
Toward a multi-modal and multi-level analysis of Chinese aphasic discourse
中文失语话语的多模态、多层次分析
- 批准号:
8058689 - 财政年份:2010
- 资助金额:
$ 55万 - 项目类别:
Autism and written narrative: discourse analysis and the characterisation of higher level language disorder phenotypes
自闭症和书面叙事:话语分析和高级语言障碍表型的表征
- 批准号:
DP0662936 - 财政年份:2006
- 资助金额:
$ 55万 - 项目类别:
Discovery Projects
Multi-Level Assessment for Enhancing Mathematical Discourse, Curriculum, and Achievement in Diverse Elementary School Classrooms.
提高多元化小学课堂数学话语、课程和成绩的多层次评估。
- 批准号:
0440261 - 财政年份:2005
- 资助金额:
$ 55万 - 项目类别:
Continuing Grant
Multi-Level Assessment for Enhancing Mathematical Discourse, Curriculum, and Achievement in Diverse Elementary School Classrooms.
提高多元化小学课堂数学话语、课程和成绩的多层次评估。
- 批准号:
0553072 - 财政年份:2005
- 资助金额:
$ 55万 - 项目类别:
Continuing Grant
The Influence of Discourse-Level Information on the Processing of Upcoming Words
话语级信息对即将出现的单词处理的影响
- 批准号:
8808453 - 财政年份:1988
- 资助金额:
$ 55万 - 项目类别:
Standard Grant