权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

CRII: III: Learning to Extract Events from Knowledge Base Revisions

CRII：III：学习从知识库修订中提取事件

基本信息

批准号：
1464128
负责人：
Alan Ritter
金额：
$ 15.13万
依托单位：
Ohio State University
依托单位国家：
美国
项目类别：
Standard Grant
财政年份：
2015
资助国家：
美国
起止时间：
2015-09-01 至 2018-08-31
项目状态：
已结题

来源：
https://www.nsf.gov/awardsearch/showAward?AWD_ID=1464128&HistoricalAwards=false
关键词：
CRII III Learning Extract Events

项目摘要

Encyclopedic knowledge bases (KBs) such as Wikipedia and Freebase form the underlying intelligence behind Google's Knowledge Graph, Facebook's Graph Search, IBM's Watson and more. These broad-coverage databases contain facts about entities, for example a person's employer or a city's mayor. KBs should not simply be viewed as static snapshots, however, as we live in a constantly changing world. For example, an election event can change the Leader of a country, or a divorce/wedding can change the Spouse of a person. Today's knowledge bases rely on human editors to stay up-to-date; this works for prominent entities, such as celebrities or politicians, but manual editing will not scale to tracking the huge number of concepts covered by these massive KBs. The project will therefore investigate methods to continuously track real-time text streams, including news and social media, and automatically update concepts in a KB, as soon as new information becomes available. This will enable new kinds of intelligent systems that constantly read all the text that is publicly written each day, and maintain a detailed up-to-the-minute knowledge base describing the current state of the world. The expected results in weakly supervised information extraction techniques are expected to have a broad range of applications, including detecting cyber security events discussed on Twitter. The project will provide research training and educational experience for students at Ohio State University and beyond, as the research outcomes will be used in developing an open-source toolkit for weakly supervised information extraction that will be widely distributed. When important events occur, KB contributors often edit properties of affected entities in near-real-time, for instance on Wikipedia. At the same time, many people discuss these events on social media and in the news. Because the set of events that alter properties of KB entities is large and not fixed in advance, this project will investigate, implement and evaluate new models for learning text extractors from KB revisions. The project will conduct experiments learning extractors for news and Twitter using Wikipedia infobox edits as distant supervision. Rather than making the closed world assumption, which is common in previous work, the proposed methods will regularize the label distribution over events that do not match knowledge revisions towards a user-provided expectation. It is expected that the results of this research will help to address the problem of false positives due to events that are not reflected in the revision history. The approach's ability to automatically propose Wikipedia infobox edits in real-time will be tested as public knowledge of an event becomes available. Previous studies on weakly supervised event extraction have mostly been conducted in limited domains. In contrast, this work aims to scale up while simultaneously grounding events mentioned in text to revisions of an entity's properties in a knowledge base. The project web site (http://aritter.github.io/crii/) will include information on the project, links to publications, software and datasets produced as a result of this research.

百科知识库（KB），如维基百科和Freebase，构成了谷歌知识图谱、Facebook图谱搜索、IBM沃森等背后的潜在智能。这些覆盖面广的数据库包含有关实体的事实，例如一个人的雇主或一个城市的市长。知识库不应该被简单地看作是静态的快照，然而，因为我们生活在一个不断变化的世界。例如，选举事件可以改变一个国家的领导人，或者离婚/婚礼可以改变一个人的配偶。今天的知识库依赖于人类编辑器来保持最新;这适用于知名实体，如名人或政治家，但手动编辑无法跟踪这些大规模知识库所涵盖的大量概念。因此，该项目将研究持续跟踪实时文本流（包括新闻和社交媒体）的方法，并在新信息可用时自动更新知识库中的概念。这将使新型智能系统能够不断阅读每天公开撰写的所有文本，并维护一个详细的最新知识库，描述世界的当前状态。弱监督信息提取技术的预期结果预计将有广泛的应用，包括检测Twitter上讨论的网络安全事件。该项目将为俄亥俄州州立大学及其他大学的学生提供研究培训和教育经验，因为研究成果将用于开发一个用于弱监督信息提取的开源工具包，该工具包将被广泛分发。当重要事件发生时，知识库贡献者经常近乎实时地编辑受影响实体的属性，例如在维基百科上。与此同时，许多人在社交媒体和新闻中讨论这些事件。由于改变KB实体属性的事件集很大，并且事先没有固定，因此该项目将研究，实现和评估用于从KB修订中学习文本提取器的新模型。该项目将进行实验，学习新闻和Twitter的提取器，使用维基百科信息框编辑作为远程监督。而不是封闭世界的假设，这是常见的在以前的工作中，所提出的方法将规范的标签分布的事件，不匹配的知识修订对用户提供的期望。预计这项研究的结果将有助于解决由于修订历史中未反映的事件而导致的误报问题。该方法自动提出实时维基百科信息框编辑的能力将在公众对事件的了解变得可用时进行测试。以往的弱监督事件抽取研究大多局限于有限的领域。相比之下，这项工作的目的是扩大规模，同时接地文本中提到的事件在知识库中的实体的属性的修订。项目网站（http：//aritter.github.io/crii/）将包括项目信息、出版物链接、软件和研究成果数据集。

项目成果

期刊论文数量（0）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

数据更新时间：{{ journalArticles.updateTime }}

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

Alan Ritter其他文献

Stanceosaurus 2.0 - Classifying Stance Towards Russian and Spanish Misinformation

Stanceosaurus 2.0 - 对俄罗斯和西班牙错误信息的立场进行分类

DOI：
发表时间：
2024
期刊：
WNUT
影响因子：
0
作者：
Anton Lavrouk;Ian Ligon;Tarek Naous;Jonathan Zheng;Alan Ritter;Wei Xu
通讯作者：
Wei Xu

Extracting COVID-19 Events from Twitter

从 Twitter 中提取 COVID-19 事件

DOI：
发表时间：
2020
期刊：
arXiv.org
影响因子：
0
作者：
Shi Zong;Ashutosh Baheti;Wei Xu;Alan Ritter
通讯作者：
Alan Ritter

Why do they stay? : an analysis of factors influencing retention of international school teachers : a thesis submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy, Massey University, Albany, New Zealand

他们为什么留下来？

DOI：
发表时间：
2016
期刊：
影响因子：
0
作者：
Alan Ritter
通讯作者：
Alan Ritter

“i have a feeling trump will win..................”: Forecasting Winners and Losers from User Predictions on Twitter

“我有一种感觉特朗普会赢……”：根据 Twitter 上的用户预测预测赢家和输家

DOI：
发表时间：
2017
期刊：
Conference on Empirical Methods in Natural Language Processing
影响因子：
0
作者：
Sandesh Swamy;Alan Ritter;M. Marneffe
通讯作者：
M. Marneffe

Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game

张量信任：来自在线游戏的可解释的即时注入攻击

DOI：
10.48550/arxiv.2311.01011
发表时间：
2023
期刊：
ArXiv
影响因子：
0
作者：
S. Toyer;Olivia Watkins;Ethan Mendes;Justin Svegliato;Luke Bailey;Tiffany Wang;Isaac Ong;Karim Elmaaroufi;Pieter Abbeel;Trevor Darrell;Alan Ritter;Stuart Russell
通讯作者：
Stuart Russell