RI: Exploiting and Exploring Discourse Connectivity: Deriving New Technology and Knowledge from the Penn Discourse Treebank

RI:利用和探索话语连通性:从宾夕法尼亚大学话语树库中获取新技术和知识

基本信息

  • 批准号:
    0705671
  • 负责人:
  • 金额:
    $ 95.5万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Continuing Grant
  • 财政年份:
    2007
  • 资助国家:
    美国
  • 起止时间:
    2007-09-15 至 2012-08-31
  • 项目状态:
    已结题

项目摘要

Large scale corpora annotated at the sentence level have played a critical role in natural language research. They have enabled large scale integration of statistical knowledge (derived from the corpora) with linguistic knowledge leading to both technological and scientific applications, such as information extraction, question answering, summarization, and machine translation, among others. This approach is now being extended to the discourse level, thus going beyond the sentence level. Using a resource called the Penn Discourse Treebank (PDTB), a large scale corpus annotated with discourse structure along with the associated semantics, new major experimental work on discourse processing is being carried out, leading to the generation of more coherent summaries and texts, extraction of complex relations in texts, among others, as well as foundational research relevant to language technology. This work is also providing a deeper understanding of the relationship between sentence level and discourse level structures. While pursuing these goals, a variety of tools for making a productive use of the PDTB resource are also being developed. This research program is also coupled with a strong educational program involving training researchers in the PDTB methodology so that similar resources can be developed in other languages substantially divergent from English. This part of the research program has international components including collaboration with research groups in Czech Republic, India, and Finland. The international collaboration is funded by the NSF Office of International Science and Engineering.
在句子层面标注的大规模语料库在自然语言研究中起着至关重要的作用。它们实现了统计知识(来自语料库)与语言学知识的大规模整合,导致了技术和科学应用,如信息提取、问题回答、摘要和机器翻译等。这种方法现在正在扩展到语篇层面,从而超越了句子层面。利用被称为宾夕法尼亚大学语篇树库(PDTB)的大规模语料库,正在开展关于语篇处理的新的重大实验工作,导致生成更连贯的摘要和语篇,提取语篇中的复杂关系,以及与语言技术相关的基础研究。这项工作也加深了对句子层次和语篇层次结构之间关系的理解。在追求这些目标的同时,还在开发各种工具,以便有效地利用私营部门和私营部门的资源。这一研究计划还与一项强有力的教育计划相结合,该计划涉及对PDTB方法的研究人员进行培训,以便可以用与英语大相径庭的其他语言开发类似的资源。该研究计划的这一部分包含国际组成部分,包括与捷克共和国、印度和芬兰的研究小组合作。这项国际合作由美国国家科学基金会国际科学与工程办公室资助。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Aravind Joshi其他文献

Cogniac: a discourse processing engine
Cogniac:话语处理引擎
  • DOI:
  • 发表时间:
    1995
  • 期刊:
  • 影响因子:
    0
  • 作者:
    F. B. Baldwin;Aravind Joshi
  • 通讯作者:
    Aravind Joshi
Quantum Circuit Optimization of Arithmetic circuits using ZX Calculus
使用 ZX 微积分对算术电路进行量子电路优化
  • DOI:
    10.48550/arxiv.2306.02264
  • 发表时间:
    2023
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Aravind Joshi;Akshara Kairali;Renju Raju;A. Athreya;R. Monica;Sanjay Vishwakarma;Srinjoy Ganguly
  • 通讯作者:
    Srinjoy Ganguly

Aravind Joshi的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Aravind Joshi', 18)}}的其他基金

CI: ADDO-EN: Significant Enhancement of the Exisitng Penn Discourse Treebank
CI:ADDO-EN:现有宾夕法尼亚大学话语树库的显着增强
  • 批准号:
    1059353
  • 财政年份:
    2011
  • 资助金额:
    $ 95.5万
  • 项目类别:
    Standard Grant
Metagrammatical Knowledge for Grammars and Corpora
语法和语料库的元语法知识
  • 批准号:
    0414409
  • 财政年份:
    2004
  • 资助金额:
    $ 95.5万
  • 项目类别:
    Continuing Grant
CISE Research Resources: Discourse Penn Treebank and Multimodal FORM: Development of Two Richly Annotated Corpora
CISE 研究资源:Discourse Penn Treebank 和 Multimodal FORM:两个注释丰富的语料库的开发
  • 批准号:
    0224417
  • 财政年份:
    2002
  • 资助金额:
    $ 95.5万
  • 项目类别:
    Continuing Grant
ITR: Mining the Bibliome -- Information Extraction from the Biomedical Literature
ITR:挖掘文献库——从生物医学文献中提取信息
  • 批准号:
    0205448
  • 财政年份:
    2002
  • 资助金额:
    $ 95.5万
  • 项目类别:
    Continuing Grant
ITR: Language, Learning, and Modeling Biological Sequences
ITR:语言、学习和生物序列建模
  • 批准号:
    0205456
  • 财政年份:
    2002
  • 资助金额:
    $ 95.5万
  • 项目类别:
    Continuing Grant
Constructing Science: Materials and Activities for Kindergarten and First-Grade
构建科学:幼儿园和一年级的材料和活动
  • 批准号:
    9252885
  • 财政年份:
    1992
  • 资助金额:
    $ 95.5万
  • 项目类别:
    Continuing Grant
Research in Natural Language Processing: Mathematical and Computational Investigations in Constrained Grammatical Formalisms
自然语言处理研究:受限语法形式主义的数学和计算研究
  • 批准号:
    9016592
  • 财政年份:
    1991
  • 资助金额:
    $ 95.5万
  • 项目类别:
    Continuing grant
Center for Research in Cognitive Science
认知科学研究中心
  • 批准号:
    8920230
  • 财政年份:
    1991
  • 资助金额:
    $ 95.5万
  • 项目类别:
    Cooperative Agreement
Natural Language Processing (Computer Research)
自然语言处理(计算机研究)
  • 批准号:
    8410413
  • 财政年份:
    1984
  • 资助金额:
    $ 95.5万
  • 项目类别:
    Continuing grant
Modelling Interactive Processes: Flexible Communication With Knowledge Bases
交互过程建模:与知识库的灵活通信
  • 批准号:
    8219196
  • 财政年份:
    1983
  • 资助金额:
    $ 95.5万
  • 项目类别:
    Continuing Grant

相似海外基金

Exploring and exploiting Type IV pili for DNA and antibiotic uptake
探索和利用 IV 型菌毛进行 DNA 和抗生素吸收
  • 批准号:
    478582
  • 财政年份:
    2023
  • 资助金额:
    $ 95.5万
  • 项目类别:
    Operating Grants
Exploring and exploiting new representations for multivariate extremes
探索和利用多元极值的新表示
  • 批准号:
    EP/X010449/1
  • 财政年份:
    2023
  • 资助金额:
    $ 95.5万
  • 项目类别:
    Research Grant
CAREER: Exploring and Exploiting Data-Centric Modeling for Fairness in Machine Learning
职业:探索和利用以数据为中心的建模以实现机器学习的公平性
  • 批准号:
    2239257
  • 财政年份:
    2023
  • 资助金额:
    $ 95.5万
  • 项目类别:
    Continuing Grant
Exploring and Exploiting Epigenetic Plant Immunity
探索和利用表观遗传植物免疫
  • 批准号:
    BB/W015250/1
  • 财政年份:
    2023
  • 资助金额:
    $ 95.5万
  • 项目类别:
    Research Grant
From Sensing to Collaboration: Engineering, Exploring and Exploiting the Building Blocks of Embodied Intelligence - An EPSRC Programme Grant
从感知到协作:工程、探索和利用体现智能的构建模块 - EPSRC 计划资助
  • 批准号:
    EP/V000748/1
  • 财政年份:
    2021
  • 资助金额:
    $ 95.5万
  • 项目类别:
    Research Grant
Giglets: Exploring and exploiting opportunities in Canada and the Americas
Giglets:探索和利用加拿大和美洲的机会
  • 批准号:
    10017938
  • 财政年份:
    2021
  • 资助金额:
    $ 95.5万
  • 项目类别:
    Collaborative R&D
Re-Entraining the Brain: Exploring and Exploiting Oscillatory Models of Speech Perception in Wernicke's Aphasia
重新训练大脑:探索和利用韦尼克失语症的言语感知振荡模型
  • 批准号:
    MR/T028629/1
  • 财政年份:
    2021
  • 资助金额:
    $ 95.5万
  • 项目类别:
    Fellowship
Exploring and Exploiting the Understanding/Acceptance Assumption
探索和利用理解/接受假设
  • 批准号:
    2049935
  • 财政年份:
    2021
  • 资助金额:
    $ 95.5万
  • 项目类别:
    Continuing Grant
Exploring & Exploiting the Dynamic Optical Sky
探索
  • 批准号:
    2034437
  • 财政年份:
    2020
  • 资助金额:
    $ 95.5万
  • 项目类别:
    Continuing Grant
CNS Core: Medium: Collaborative: Exploring and Exploiting Learning for Efficient Network Control: Non-Stationarity, Inter-Dependence, and Domain-Knowledge
CNS 核心:中:协作:探索和利用学习实现高效网络控制:非平稳性、相互依赖和领域知识
  • 批准号:
    1901218
  • 财政年份:
    2019
  • 资助金额:
    $ 95.5万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了