Metagrammatical Knowledge for Grammars and Corpora

语法和语料库的元语法知识

基本信息

  • 批准号:
    0414409
  • 负责人:
  • 金额:
    $ 50万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Continuing Grant
  • 财政年份:
    2004
  • 资助国家:
    美国
  • 起止时间:
    2004-09-01 至 2008-08-31
  • 项目状态:
    已结题

项目摘要

There is today a broad consensus among theoretical linguists (of all frameworks) and researchers in Natural Language Processing (NLP) about what the syntactic phenomena are that we encounter in natural languages. However, there are many different frameworks in which analyses of these phenomena have been implemented, and there is even disagreement about specific analyses within one single framework. As a result, linguistic resources such as annotated corpora or grammars cannot be easily reused across frameworks. This project will investigate the common categorization of syntax that underlies work in linguistics and NLP. This underlying categorization is called a ``metagrammar''. Given a metagrammar, a tool can be produced to automatically generate grammars in different frameworks. This research contains three main activities. The first involves comparative work in several languages (including English) that will lead to coordinated metagrammars for these languages. These framework-independent specifications will catalog syntactic properties and detail their possible interaction; categories shared between languages will lead to shared portions of the metagrammar. The second concerns the development of specific grammar statements that relate metagrammatical categories to constructs in particular frameworks and for particular languages. It is these statements that, in their interaction, determine word order. The third involves annotating the Penn Treebank (PTB) corpus with the syntactic properties from the metagrammar, thus making the information implicitly encoded in the phrase structure of the PTB explicit and usable by other frameworks.This project will enable the NLP and linguistics communities to better share insights on syntactic phenomena. Additionally, the work will enable the development of new NLP tools that are less dependent on a particular representation. It will enable linguists to rapidly develop grammars and test-suites for different frameworks and languages, thus allowing for both cross- and inter-framework evaluation of linguistic grammars. Upon completion of the project, the PTB re-annotated with the high-level categories of the metagrammar will be made available to the research community .
今天,理论语言学家(所有框架)和自然语言处理(NLP)的研究人员就我们在自然语言中遇到的句法现象达成了广泛的共识。然而,对这些现象的分析已经在许多不同的框架中实现,甚至在一个框架内的具体分析也存在分歧。因此,诸如带注释的语料库或语法之类的语言资源不能轻松地跨框架重用。这个项目将研究语言学和自然语言处理工作中常见的语法分类。这种潜在的分类被称为“元语法”。给定一个元语法,可以生成一个工具来自动生成不同框架中的语法。这项研究包括三个主要活动。第一种是对几种语言(包括英语)的比较工作,这将导致这些语言的协调元语法。这些独立于框架的规范将对语法属性进行编目,并详细说明它们之间可能的交互;语言之间共享的类别将导致元语法的共享部分。第二个方面涉及特定语法语句的发展,这些语法语句将元语法类别与特定框架和特定语言中的结构联系起来。正是这些语句在相互作用中决定了语序。第三种方法是用元语法的句法属性对Penn Treebank (PTB)语料库进行注释,从而使隐含在PTB短语结构中的信息显式编码,并可被其他框架使用。该项目将使NLP和语言学社区更好地分享对句法现象的见解。此外,这项工作将使新的NLP工具的开发减少对特定表示的依赖。它将使语言学家能够快速开发不同框架和语言的语法和测试套件,从而允许跨框架和跨框架的语言语法评估。项目完成后,PTB将重新标注元语法的高级类别,并将提供给研究界。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Aravind Joshi其他文献

Cogniac: a discourse processing engine
Cogniac:话语处理引擎
  • DOI:
  • 发表时间:
    1995
  • 期刊:
  • 影响因子:
    0
  • 作者:
    F. B. Baldwin;Aravind Joshi
  • 通讯作者:
    Aravind Joshi
Quantum Circuit Optimization of Arithmetic circuits using ZX Calculus
使用 ZX 微积分对算术电路进行量子电路优化
  • DOI:
    10.48550/arxiv.2306.02264
  • 发表时间:
    2023
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Aravind Joshi;Akshara Kairali;Renju Raju;A. Athreya;R. Monica;Sanjay Vishwakarma;Srinjoy Ganguly
  • 通讯作者:
    Srinjoy Ganguly

Aravind Joshi的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Aravind Joshi', 18)}}的其他基金

CI: ADDO-EN: Significant Enhancement of the Exisitng Penn Discourse Treebank
CI:ADDO-EN:现有宾夕法尼亚大学话语树库的显着增强
  • 批准号:
    1059353
  • 财政年份:
    2011
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant
RI: Exploiting and Exploring Discourse Connectivity: Deriving New Technology and Knowledge from the Penn Discourse Treebank
RI:利用和探索话语连通性:从宾夕法尼亚大学话语树库中获取新技术和知识
  • 批准号:
    0705671
  • 财政年份:
    2007
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
CISE Research Resources: Discourse Penn Treebank and Multimodal FORM: Development of Two Richly Annotated Corpora
CISE 研究资源:Discourse Penn Treebank 和 Multimodal FORM:两个注释丰富的语料库的开发
  • 批准号:
    0224417
  • 财政年份:
    2002
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
ITR: Mining the Bibliome -- Information Extraction from the Biomedical Literature
ITR:挖掘文献库——从生物医学文献中提取信息
  • 批准号:
    0205448
  • 财政年份:
    2002
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
ITR: Language, Learning, and Modeling Biological Sequences
ITR:语言、学习和生物序列建模
  • 批准号:
    0205456
  • 财政年份:
    2002
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
Constructing Science: Materials and Activities for Kindergarten and First-Grade
构建科学:幼儿园和一年级的材料和活动
  • 批准号:
    9252885
  • 财政年份:
    1992
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
Research in Natural Language Processing: Mathematical and Computational Investigations in Constrained Grammatical Formalisms
自然语言处理研究:受限语法形式主义的数学和计算研究
  • 批准号:
    9016592
  • 财政年份:
    1991
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing grant
Center for Research in Cognitive Science
认知科学研究中心
  • 批准号:
    8920230
  • 财政年份:
    1991
  • 资助金额:
    $ 50万
  • 项目类别:
    Cooperative Agreement
Natural Language Processing (Computer Research)
自然语言处理(计算机研究)
  • 批准号:
    8410413
  • 财政年份:
    1984
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing grant
Modelling Interactive Processes: Flexible Communication With Knowledge Bases
交互过程建模:与知识库的灵活通信
  • 批准号:
    8219196
  • 财政年份:
    1983
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant

相似海外基金

Staffordshire University - Malone Group GB Limited - Knowledge transfer partnerships (KTP): 2023 to 2024 Round 3
斯塔福德郡大学 - Malone Group GB Limited - 知识转移合作伙伴关系 (KTP):2023 年至 2024 年第 3 轮
  • 批准号:
    10082161
  • 财政年份:
    2024
  • 资助金额:
    $ 50万
  • 项目类别:
    Knowledge Transfer Network
Opening Spaces and Places for the Inclusion of Indigenous Knowledge, Voice and Identity: Moving Indigenous People out of the Margins
为包容土著知识、声音和身份提供开放的空间和场所:使土著人民走出边缘
  • 批准号:
    477924
  • 财政年份:
    2024
  • 资助金额:
    $ 50万
  • 项目类别:
    Salary Programs
CRII: SaTC: Automated Knowledge Representation for IoT Cybersecurity Regulations
CRII:SaTC:物联网网络安全法规的自动化知识表示
  • 批准号:
    2348147
  • 财政年份:
    2024
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant
CRII: AF: The Impact of Knowledge on the Performance of Distributed Algorithms
CRII:AF:知识对分布式算法性能的影响
  • 批准号:
    2348346
  • 财政年份:
    2024
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant
Conference: Doctoral Consortium for the 2024 Learning Analytics & Knowledge Conference
会议:2024 年学习分析博士联盟
  • 批准号:
    2400421
  • 财政年份:
    2024
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant
CAREER: Digitize and Simulate the Large Physical World via Knowledge-Grounded Scene Representation
职业:通过基于知识的场景表示对大型物理世界进行数字化和模拟
  • 批准号:
    2340254
  • 财政年份:
    2024
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
Doctoral Dissertation Research: Health, Wellness, and Indigenous Knowledge: A Community-Based Participatory Research Study
博士论文研究:健康、保健和土著知识:一项基于社区的参与性研究
  • 批准号:
    2343306
  • 财政年份:
    2024
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant
Developing Teaching Tools to Promote Transfer of Core Concept Knowledge Across Biological Scales and Sub-disciplines.
开发教学工具以促进跨生物尺度和子学科的核心概念知识的转移。
  • 批准号:
    2336776
  • 财政年份:
    2024
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant
NGO-Prosecutorial Complex in Universal Jurisdiction Cases: Structure and Consequences for Justice and Public Knowledge about Human Rights Violations
普遍管辖权案件中的非政府组织-检察复合体:正义的结构和后果以及公众对侵犯人权行为的了解
  • 批准号:
    2314061
  • 财政年份:
    2024
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant
Partnering with local knowledge systems to impact river management
与当地知识系统合作影响河流管理
  • 批准号:
    DE240101058
  • 财政年份:
    2024
  • 资助金额:
    $ 50万
  • 项目类别:
    Discovery Early Career Researcher Award
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了