Collaborative Research: CRI: An Open Linguistic Infrastructure for American English
合作研究:CRI:美式英语的开放语言基础设施
基本信息
- 批准号:0551531
- 负责人:
- 金额:--
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2006
- 资助国家:美国
- 起止时间:2006-03-01 至 2008-02-29
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
This project, supporting research into computational linguistics, plans the enhancing of the American National Corpus (ANC) with an open linguistic infrastructure that will add multiple manual and automatic annotations to a portion of ANC and will provide free access to these annotations in a common XML data format via a project website. The following activities are envisioned:-Incorporation of automatic annotations derived from freely existing tools, mapped into the ANC XML format language,-Syntactic and named entity annotations of a 10Mw gold standard corpus, with partial manual annotation,-Hand-corrected automatic WordNet and FrameNet annotation for a portion of the gold standard corpus,-Enhancement of automatic annotation performance via experimentation with machine learning techniques, and-Development of a web interface for users to download above annotations, and to upload new annotation of ANC.This work, describing methods for internal and external evaluation of the resources and tools developed, plans to create a richly, multiple annotated diverse corpus of natural language, and tools to access it. The full project would be the first large-scale execution of such effort, developing a 100 million word ANC and providing a 10-million word subset, annotated with syntax, named entities, and semantic categories in WordNet (WN) and FrameNet (FN). The annotated data will be balanced from different genres of text. One of the activities of the planning award consists in harmonizing all three resources, ANC, WN, and FN, and maximally exploiting their respective strengths. The other involves the continued development of the ANC, which, with the addition of a wide range of linguistic annotations, will serve as a resource for language processing research and applications for the NLP community. The planning project undertakes the following activities:-Creation and annotation of WN senses and FN frames,-Planning meetings, -Further research into experimentation with methods and software to enhance automatic annotation, and-Outreach to the US computational linguistics community.Broader Impact: Full completion of this work will further enhance the ANC by creating a comprehensive linguistic infrastructur for American English. The availability of a massive, richly annotated corpus of American English has impacts at many levels and across several areas, including computational linguistics and natural language processing, corpus linguistics, cross-linguistic studies, dialect studies, language acquisition, and materials development for both English language students and teacher training.
这个项目支持计算语言学的研究,计划用一个开放的语言基础设施来增强美国国家语料库(ANC),该基础设施将向ANC的一部分添加多个手动和自动注释,并将通过项目网站以通用XML数据格式免费提供这些注释。预期的活动如下:-合并来自自由存在的工具的自动注释,映射为ANC XML格式语言;- 10Mw金标准语料库的语法和命名实体注释,部分手动注释;-对部分金标准语料库进行手动校正的自动WordNet和FrameNet注释;开发了一个web界面,供用户下载上述注释,并上传新的ANC注释。这项工作描述了对开发的资源和工具进行内部和外部评估的方法,计划创建一个丰富的、多注释的不同的自然语言语料库,以及访问它的工具。整个项目将是此类努力的第一次大规模执行,开发一个1亿字的ANC,并提供一个1000万字的子集,在WordNet (WN)和framework (FN)中使用语法、命名实体和语义类别进行注释。注释的数据将从不同类型的文本中得到平衡。规划奖的活动之一是协调所有三种资源,即非国大、自然资源和自然资源,并最大限度地发挥各自的优势。另一个是继续发展ANC,它加上广泛的语言注释,将成为NLP社区语言处理研究和应用的资源。规划项目承担以下活动:-创建和注释n感官和FN框架,-规划会议,-进一步研究增强自动注释的方法和软件的实验,以及与美国计算语言学社区的联系。更广泛的影响:这项工作的全面完成将通过为美国英语创建一个全面的语言基础设施来进一步加强美国英语。大量、注释丰富的美国英语语料库的可用性在许多层面和多个领域产生了影响,包括计算语言学和自然语言处理、语料库语言学、跨语言研究、方言研究、语言习得以及英语学生和教师培训的材料开发。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Christiane Fellbaum其他文献
Erratum to: Large, huge or gigantic? Identifying and encoding intensity relations among adjectives in WordNet
- DOI:
10.1007/s10579-013-9235-2 - 发表时间:
2013-06-02 - 期刊:
- 影响因子:1.800
- 作者:
Vera Sheinman;Christiane Fellbaum;Isaac Julien;Peter Schulam;Takenobu Tokunaga - 通讯作者:
Takenobu Tokunaga
Idiome in einem Digitalen Lexikalischen System
- DOI:
10.1007/bf03379401 - 发表时间:
2017-02-21 - 期刊:
- 影响因子:0.100
- 作者:
Christiane Fellbaum - 通讯作者:
Christiane Fellbaum
Human and Automatic Interpretation of Romanian Noun Compounds
罗马尼亚语名词复合词的人工和自动解释
- DOI:
10.48550/arxiv.2403.06360 - 发表时间:
2024 - 期刊:
- 影响因子:0
- 作者:
Ioana Marinescu;Christiane Fellbaum - 通讯作者:
Christiane Fellbaum
Christiane Fellbaum的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Christiane Fellbaum', 18)}}的其他基金
CI-P: Collaborative Research: LexLink: Aligning WordNet, FrameNet, PropBank and VerbNet
CI-P:协作研究:LexLink:对齐 WordNet、FrameNet、PropBank 和 VerbNet
- 批准号:
1205473 - 财政年份:2012
- 资助金额:
-- - 项目类别:
Standard Grant
A Workshop on Restructuring Adjectives in WordNet
WordNet 中形容词重组研讨会
- 批准号:
1139844 - 财政年份:2011
- 资助金额:
-- - 项目类别:
Standard Grant
CI-ADDO-EN: A Second-Generation Architecture for WordNet
CI-ADDO-EN:WordNet 的第二代架构
- 批准号:
0855157 - 财政年份:2009
- 资助金额:
-- - 项目类别:
Standard Grant
RI: Collaborative Proposal: Complementary Lexical Resources: Towards an Alignment of WordNet and FrameNet
RI:协作提案:补充词汇资源:实现 WordNet 和 FrameNet 的协调
- 批准号:
0705199 - 财政年份:2007
- 资助金额:
-- - 项目类别:
Standard Grant
Constructing an Enhanced Version of WordNet
构建 WordNet 的增强版本
- 批准号:
0414072 - 财政年份:2004
- 资助金额:
-- - 项目类别:
Standard Grant
Collaborative Proposal: Using the Web as a Corpus for Empirical Linguistic Research
合作提案:使用网络作为实证语言学研究的语料库
- 批准号:
0112429 - 财政年份:2001
- 资助金额:
-- - 项目类别:
Standard Grant
WordNet as an Interlingual Lexical Resource
WordNet 作为语际词汇资源
- 批准号:
9805732 - 财政年份:1998
- 资助金额:
-- - 项目类别:
Standard Grant
相似国自然基金
Research on Quantum Field Theory without a Lagrangian Description
- 批准号:24ZR1403900
- 批准年份:2024
- 资助金额:0.0 万元
- 项目类别:省市级项目
Cell Research
- 批准号:31224802
- 批准年份:2012
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Cell Research
- 批准号:31024804
- 批准年份:2010
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Cell Research (细胞研究)
- 批准号:30824808
- 批准年份:2008
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Research on the Rapid Growth Mechanism of KDP Crystal
- 批准号:10774081
- 批准年份:2007
- 资助金额:45.0 万元
- 项目类别:面上项目
相似海外基金
CRI: CI-EN: Collaborative Research: mResearch: A platform for Reproducible and Extensible Mobile Sensor Big Data Research
CRI:CI-EN:协作研究:mResearch:可复制和可扩展的移动传感器大数据研究平台
- 批准号:
1822935 - 财政年份:2018
- 资助金额:
-- - 项目类别:
Standard Grant
CRI: CI-New: Collaborative Research: Extensible, Software Enabled Unmanned Aerial Vehicles
CRI:CI-New:协作研究:可扩展、软件支持的无人机
- 批准号:
1823230 - 财政年份:2018
- 资助金额:
-- - 项目类别:
Continuing Grant
CRI: CI-EN: Collaborative Research: OpenNetVM: A Software Platform Enabling Network Function Virtualization Research
CRI:CI-EN:协作研究:OpenNetVM:支持网络功能虚拟化研究的软件平台
- 批准号:
1823236 - 财政年份:2018
- 资助金额:
-- - 项目类别:
Standard Grant
CRI: CI-EN: Collaborative Research: An Experimental Infrastructure and a Database of Real Faults to Foster Reproducibility in Software Engineering Research
CRI:CI-EN:协作研究:实验基础设施和真实故障数据库,以促进软件工程研究的可重复性
- 批准号:
1929215 - 财政年份:2018
- 资助金额:
-- - 项目类别:
Standard Grant
CRI: CI-SUSTAIN: Collaborative Research: Sustaining Lemur Project Resources for the Long-Term
CRI:CI-SUSTAIN:合作研究:长期维持狐猴项目资源
- 批准号:
1822986 - 财政年份:2018
- 资助金额:
-- - 项目类别:
Standard Grant
CRI: CI-EN: Collaborative Research: An Experimental Infrastructure and a Database of Real Faults to Foster Reproducibility in Software Engineering Research
CRI:CI-EN:协作研究:实验基础设施和真实故障数据库,以促进软件工程研究的可重复性
- 批准号:
1823172 - 财政年份:2018
- 资助金额:
-- - 项目类别:
Standard Grant
CRI: CI-New: Collaborative Research: NJR: A Normalized Java Resource
CRI:CI-New:协作研究:NJR:标准化 Java 资源
- 批准号:
1823227 - 财政年份:2018
- 资助金额:
-- - 项目类别:
Standard Grant
CRI: CI-EN: Collaborative Research: mResearch: A platform for Reproducible and Extensible Mobile Sensor Big Data Research
CRI:CI-EN:协作研究:mResearch:可复制和可扩展的移动传感器大数据研究平台
- 批准号:
1823221 - 财政年份:2018
- 资助金额:
-- - 项目类别:
Standard Grant
CRI: CI-SUSTAIN: Collaborative Research: CiteSeerX: Toward Sustainable Support of Scholarly Big Data
CRI:CI-SUSTAIN:协作研究:CiteSeerX:迈向学术大数据的可持续支持
- 批准号:
1823288 - 财政年份:2018
- 资助金额:
-- - 项目类别:
Standard Grant
CRI: CI-SUSTAIN: Collaborative Research: CiteSeerX: Toward Sustainable Support of Scholarly Big Data
CRI:CI-SUSTAIN:协作研究:CiteSeerX:迈向学术大数据的可持续支持
- 批准号:
1853919 - 财政年份:2018
- 资助金额:
-- - 项目类别:
Standard Grant














{{item.name}}会员




