CCRI: Planning: Planning for the Development of a Platform to Support Multilingual and Multi-Domain Coreference Annotation for Natural Language Processing Research
CCRI:规划:规划开发支持自然语言处理研究多语言、多领域共指标注的平台
基本信息
- 批准号:1925548
- 负责人:
- 金额:$ 10万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2019
- 资助国家:美国
- 起止时间:2019-09-01 至 2022-08-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
In natural language processing, coreference resolution involves clustering together all words and phrases within a text that refer to the same entity. For example, in the sentence "Monsieur Poirot assured Hastings that he ought to have faith in him," the strings "Monsieur Poirot" and "him" refer to the same person, while "Hastings" and "he" refer to a different character. Resolving these references is challenging because it requires the application of syntactic, semantic, and world knowledge, and it is important since coreference is essential to intelligently understand the meaning of text for question answering, translation, corpus insights, and many other applications. Unfortunately, current coreference models are held back by the lack of human-annotated training data from various domains and world languages, mainly because it is expensive and time-consuming to collect such data at scale.This CCRI planning grant will take the first step toward breaking the coreference data bottleneck by creating two new resources for the community: (1) a software platform that facilitates cheap and accurate crowdsourced collection for tasks that require labeling text spans within documents, and (2) a multi-domain crowdsourced coreference dataset collected using this platform. The dataset resource will contain data from a variety of different domains (such as books and web forums), unlike prior datasets that focus primarily on newswire text, which will allow researchers who work on non-standard domains to integrate coreference systems into their modeling pipelines. This planning grant will also support discussions and conference workshops about the platform and data resources; the resulting community feedback will be incorporated into a CCRI full proposal that aims to use the platform to create a much larger and multilingual coreference dataset, as well as explore non-coreference data labeling tasks such as question answering.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
在自然语言处理中,共指关系解析涉及将文本中引用同一实体的所有单词和短语聚在一起。例如,在“波罗先生向黑斯廷斯保证他应该信任他”这句话中,“波罗先生”和“他”指的是同一个人,而“黑斯廷斯”和“他”指的是不同的角色。解决这些引用是具有挑战性的,因为它需要应用句法、语义和世界知识,而且由于共指对于智能地理解用于问题回答、翻译、语料库洞察和许多其他应用的文本的意义是必不可少的。遗憾的是,由于缺乏来自不同领域和世界语言的人工标注的训练数据,当前的共引模型受到阻碍,主要是因为大规模收集此类数据既昂贵又耗时。该CCRI规划赠款将通过为社区创建两个新资源来朝着打破共引数据瓶颈迈出第一步:(1)一个软件平台,它促进廉价而准确的众包收集,用于需要在文档中标记文本跨度的任务;(2)使用该平台收集的多域众包共引数据集。数据集资源将包含来自各种不同领域(如书籍和网络论坛)的数据,这与以前主要关注新闻报道文本的数据集不同,这将允许研究非标准领域的研究人员将共同参考系统整合到他们的建模管道中。这笔规划拨款还将支持关于该平台和数据资源的讨论和会议研讨会;由此产生的社区反馈将被纳入CCRI的全面提案中,该提案旨在利用该平台创建一个更大的、多语言的共参考数据集,并探索问题回答等非共参考数据标签任务。该奖项反映了NSF的法定使命,并通过使用基金会的智力优势和更广泛的影响审查标准进行评估,被认为值得支持。
项目成果
期刊论文数量(1)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
ezCoref: Towards Unifying Annotation Guidelines for Coreference Resolution
ezCoref:迈向统一共指解析注释指南
- DOI:
- 发表时间:2023
- 期刊:
- 影响因子:0
- 作者:Gupta, Ankita;Karpinska, Marzena;Zhao, Wenlong;Krishna, Kalpesh;Merullo, Jack;Yeh, Luke;Iyyer, Mohit;O'Connor, Brendan
- 通讯作者:O'Connor, Brendan
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Brendan O'Connor其他文献
Primary and secondary microneuroanastomotic repair of the mental nerve in the rat
- DOI:
10.1016/s0901-5027(87)80086-3 - 发表时间:
1987-08-01 - 期刊:
- 影响因子:
- 作者:
John R. Zuniga;Brendan O'Connor - 通讯作者:
Brendan O'Connor
Bovine brain pyroglutamyl aminopeptidase (type-1): purification and characterisation of a neuropeptide-inactivating peptidase.
牛脑焦谷氨酰氨基肽酶(1 型):神经肽失活肽酶的纯化和表征。
- DOI:
10.1016/1357-2725(96)00034-9 - 发表时间:
1996 - 期刊:
- 影响因子:0
- 作者:
Philip M. Cummins;Brendan O'Connor - 通讯作者:
Brendan O'Connor
Thyrotropin‐Releasing Hormone
促甲状腺激素释放激素
- DOI:
10.1046/j.1471-4159.1995.65030953.x - 发表时间:
1995 - 期刊:
- 影响因子:4.7
- 作者:
R. O'Leary;Brendan O'Connor - 通讯作者:
Brendan O'Connor
The Management of Chest Wall Resection in a Patient With Polyostotic Fibrous Dysplasia and Respiratory Failure
- DOI:
10.1053/j.jvca.2008.09.009 - 发表时间:
2009-08-01 - 期刊:
- 影响因子:
- 作者:
Brendan O'Connor;Frank J. Collins - 通讯作者:
Frank J. Collins
Brendan O'Connor的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Brendan O'Connor', 18)}}的其他基金
Collaborative Research: DMREF: Establishing a molecular interaction framework to design and predict modern polymer semiconductor assembly
合作研究:DMREF:建立分子相互作用框架来设计和预测现代聚合物半导体组装
- 批准号:
2324191 - 财政年份:2023
- 资助金额:
$ 10万 - 项目类别:
Standard Grant
CAREER: Social Aggregate Measurement from Text
职业:从文本进行社会聚合测量
- 批准号:
1845576 - 财政年份:2019
- 资助金额:
$ 10万 - 项目类别:
Continuing Grant
III: Small: Collaborative Research: Building Subjective Knowledge Bases by Modeling Viewpoints
III:小:协作研究:通过建模观点构建主观知识库
- 批准号:
1814955 - 财政年份:2018
- 资助金额:
$ 10万 - 项目类别:
Standard Grant
INFEWS/T3: Solar-Powered Integrated Greenhouse (SPRING) Systems Using Wavelength Selective Photovoltaics for Complete Solar Utilization
INFEWS/T3:使用波长选择性光伏技术实现太阳能完全利用的太阳能集成温室 (SPRING) 系统
- 批准号:
1639429 - 财政年份:2017
- 资助金额:
$ 10万 - 项目类别:
Continuing Grant
CAREER: Mechanical Behavior of Flexible Electronic Films
职业:柔性电子薄膜的机械行为
- 批准号:
1554322 - 财政年份:2016
- 资助金额:
$ 10万 - 项目类别:
Standard Grant
Mechanical Behavior of Polymer-Fullerene Blends for Photovoltaic Applications
用于光伏应用的聚合物-富勒烯共混物的机械行为
- 批准号:
1200340 - 财政年份:2012
- 资助金额:
$ 10万 - 项目类别:
Standard Grant
相似海外基金
Collaborative Research: CCRI: Planning-C: A Community for Configurability Open Research and Development (ACCORD)
合作研究:CCRI:Planning-C:可配置性开放研究与开发社区 (ACCORD)
- 批准号:
2234909 - 财政年份:2023
- 资助金额:
$ 10万 - 项目类别:
Standard Grant
Collaborative Research: CCRI: Planning-C: A Community for Configurability Open Research and Development (ACCORD)
合作研究:CCRI:Planning-C:可配置性开放研究与开发社区 (ACCORD)
- 批准号:
2234908 - 财政年份:2023
- 资助金额:
$ 10万 - 项目类别:
Standard Grant
CCRI: Planning-C: A Framework for Development of Robots and IoT for Precision Agriculture
CCRI:Planning-C:精准农业机器人和物联网开发框架
- 批准号:
2213839 - 财政年份:2022
- 资助金额:
$ 10万 - 项目类别:
Standard Grant
CCRI: Planning: Collaborative Proposal: Tools and Research Priority Analyses for Development of Open-Source AI-Enabled Control and Testing Framework for 6G Cellular Research
CCRI:规划:协作提案:为 6G 蜂窝研究开发开源人工智能控制和测试框架的工具和研究优先分析
- 批准号:
2016724 - 财政年份:2020
- 资助金额:
$ 10万 - 项目类别:
Standard Grant
CCRI: Planning: ScooterLab: Development of a Programmable and Participatory e-Scooter Testbed to Enable CISE-focused Micromobility Research
CCRI:规划:ScooterLab:开发可编程和参与式电动滑板车测试平台,以实现以 CISE 为重点的微移动研究
- 批准号:
2016717 - 财政年份:2020
- 资助金额:
$ 10万 - 项目类别:
Standard Grant
CCRI: Planning: Collaborative Research: Infrastructure for Enabling Systematic Development and Research of Scientific Workflow Management Systems
CCRI:规划:协作研究:支持科学工作流程管理系统系统开发和研究的基础设施
- 批准号:
2016610 - 财政年份:2020
- 资助金额:
$ 10万 - 项目类别:
Standard Grant
CCRI: Planning: Development of a Community Resource for Digital Image Research
CCRI:规划:数字图像研究社区资源的开发
- 批准号:
1925494 - 财政年份:2020
- 资助金额:
$ 10万 - 项目类别:
Standard Grant
CCRI: Planning: Collaborative Proposal: Tools and Research Priority Analyses for Development of Open-Source AI-Enabled Control and Testing Framework for 6G Cellular Research
CCRI:规划:协作提案:为 6G 蜂窝研究开发开源人工智能控制和测试框架的工具和研究优先分析
- 批准号:
2016688 - 财政年份:2020
- 资助金额:
$ 10万 - 项目类别:
Standard Grant
CCRI: Planning: Collaborative Research: Infrastructure for Enabling Systematic Development and Research of Scientific Workflow Management Systems
CCRI:规划:协作研究:支持科学工作流程管理系统系统开发和研究的基础设施
- 批准号:
2016619 - 财政年份:2020
- 资助金额:
$ 10万 - 项目类别:
Standard Grant
CCRI: Planning: Collaborative Research: Infrastructure for Enabling Systematic Development and Research of Scientific Workflow Management Systems
CCRI:规划:协作研究:支持科学工作流程管理系统系统开发和研究的基础设施
- 批准号:
2016682 - 财政年份:2020
- 资助金额:
$ 10万 - 项目类别:
Standard Grant