Collaborative Research:Interlingual Annotation of Multilingual Text Corporation
合作研究:多语言文本公司的语间标注
基本信息
- 批准号:0325021
- 负责人:
- 金额:$ 16.88万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2003
- 资助国家:美国
- 起止时间:2003-09-01 至 2006-08-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
This multi-site research effort is aimed at developing a coherent, consistent, standardized Interlingual representation along with a methodology and sharable tools for annotating large bilingual corpora of parallel texts. It has four central components: First, six corpora are being compiled, each consisting of a number of texts in a particular source language along with three translations of each text into English. Second, a standardized interlingual representation is being developed based on a comparative analysis of these parallel text corpora. Third, the bilingual corpora are being annotated using the standardized interlingua and following a predefined annotation procedure. Fourth, metrics are being developed for evaluating the accuracy and appropriateness of the interlingual representations in terms of the grain size of the representation given a particular task. The metrics are based on inter-coder reliability, the growth rate of the interlingual representation, and quality of the target language text that is be generated from the interlingua. The resulting annotated, multilingual, parallel corpora will be useful as an empirical basis for developing a wide variety of interlingual NLP systems for tasks such as machine translation, question answering, web searching, summarization, or presentation generation, as well as a host of other research and development efforts in theoretical and applied linguistics, foreign language pedagogy, translation studies, and other related disciplines. The participants include CRL at NMSU, ISI at USC, UMIACS at the University of Maryland, LTI at CMU, Columbia University, and The MITRE Corporation. The source languages include Arabic, Chinese, French, Hindi, Japanese, Spanish and English.
这项多站点的研究工作旨在开发一种连贯、一致、标准化的语际表示,以及一种方法和可共享的工具,用于注释大型双语平行文本语料库。它有四个核心组成部分:首先,正在编制六个语料库,每个语料库由一种特定源语言的若干文本组成,并将每个文本翻译成英语。其次,基于对这些平行文本语料库的比较分析,正在开发标准化的语际表示。第三,使用标准化的中间语并按照预定义的注释程序对双语语料库进行注释。第四,根据给定特定任务的表示的粒度大小,正在开发用于评估语际表示的准确性和适当性的指标。这些度量是基于编码间的可靠性、语间表示的增长率以及从语间生成的目标语言文本的质量。由此产生的带注释的、多语言的、平行的语料库将作为开发各种语言间NLP系统的经验基础,用于机器翻译、问答、网络搜索、摘要或演示生成等任务,以及理论和应用语言学、外语教育学、翻译研究和其他相关学科的许多其他研究和开发工作。参与者包括NMSU的CRL、USC的ISI、马里兰大学的UMIACS、CMU的LTI、哥伦比亚大学和MITRE公司。源语言包括阿拉伯语、汉语、法语、印地语、日语、西班牙语和英语。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Eduard Hovy其他文献
A Framework for Effective Annotation of Information from Closed Captions Using Ontologies
- DOI:
10.1007/s10844-005-0188-9 - 发表时间:
2005-09-01 - 期刊:
- 影响因子:3.400
- 作者:
Latifur Khan;Dennis McLeod;Eduard Hovy - 通讯作者:
Eduard Hovy
A Sentiment Consolidation Framework for Meta-Review Generation
用于生成元评论的情绪巩固框架
- DOI:
- 发表时间:
2024 - 期刊:
- 影响因子:0
- 作者:
Miao Li;Jey Han Lau;Eduard Hovy - 通讯作者:
Eduard Hovy
ezCoref : A Scalable Approach for Collecting Crowdsourced Annotations for Coreference Resolution
ezCoref:一种收集众包注释以进行共指解析的可扩展方法
- DOI:
- 发表时间:
2022 - 期刊:
- 影响因子:0
- 作者:
A. Crowdsourced;David Bamman;Olivia Lewke;Rachel Bawden;Rico Sennrich;Alexandra Birch;Ari Bornstein;Arie Cattan;Ido Dagan;Hong Chen;Zhenhua Fan;Hao Lu;Alan Yuille;Eduard Hovy;Mitch Marcus;M. Palmer;Lance;Rodney Huddleston. 2002;Frédéric Landragin;T. Poibeau;Bernard Vic;Belinda Z. Li;Gabriel Stanovsky;Robert L Logan;Andrew McCallum;Sameer Singh - 通讯作者:
Sameer Singh
What is Your Data Worth to GPT? LLM-Scale Data Valuation with Influence Functions
您的数据对 GPT 有何价值?
- DOI:
- 发表时间:
2024 - 期刊:
- 影响因子:0
- 作者:
Sang Keun Choe;Hwijeen Ahn;Juhan Bae;Kewen Zhao;Minsoo Kang;Youngseog Chung;Adithya Pratapa;W. Neiswanger;Emma Strubell;Teruko Mitamura;Jeff Schneider;Eduard Hovy;Roger Grosse;Eric Xing - 通讯作者:
Eric Xing
Cooperative Semi-Supervised Transfer Learning of Machine Reading Comprehension
机器阅读理解的协作半监督迁移学习
- DOI:
- 发表时间:
2021 - 期刊:
- 影响因子:0
- 作者:
Oliver Bender;F. Och;Y. Bengio;Réjean Ducharme;Pascal Vincent;Kevin Clark;Quoc Minh;V. Le;J. Devlin;Ming;Kenton Lee;Adam Fisch;Alon Talmor;Robin Jia;Minjoon Seo;Michael R. Glass;A. Gliozzo;Rishav Chakravarti;Ian Goodfellow;Jean Pouget;Mehdi Mirza;Serhii Havrylov;Ivan Titov. 2017;Emergence;Jun;Jiatao Gu;Jiajun Shen;Marc’Aurelio;Matthew Henderson;I. Casanueva;Nikola Mrkˇsi´c;Pei;Tsung;Ivan Vuli´c;Yikang Shen;Yi Tay;Che Zheng;Dara Bahri;Donald;Metzler Aaron;Courville;Structformer;Ashish Vaswani;Noam M. Shazeer;Niki Parmar;Thomas Wolf;Lysandre Debut;Julien Victor Sanh;Clement Chaumond;Anthony Delangue;Pier;Tim ric Cistac;Rémi Rault;Morgan Louf;Qizhe Xie;Eduard Hovy;Silei Xu;Sina J. Semnani;Giovanni Campagna - 通讯作者:
Giovanni Campagna
Eduard Hovy的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Eduard Hovy', 18)}}的其他基金
EAGER: A Method to Retrieve Non-Textual Data from Widespread Repositories
EAGER:一种从广泛存储库中检索非文本数据的方法
- 批准号:
1450545 - 财政年份:2014
- 资助金额:
$ 16.88万 - 项目类别:
Standard Grant
III: EAGER: Automatically Building Test Collections Using Implicit Relevance Signals from the Web
III:EAGER:使用来自 Web 的隐式相关信号自动构建测试集合
- 批准号:
1304939 - 财政年份:2012
- 资助金额:
$ 16.88万 - 项目类别:
Standard Grant
EAGER: Constructing, Indexing, and Searching Super-Enriched Document Representations in the Cloud
EAGER:在云中构建、索引和搜索超级丰富的文档表示
- 批准号:
1265301 - 财政年份:2012
- 资助金额:
$ 16.88万 - 项目类别:
Standard Grant
III: EAGER: Automatically Building Test Collections Using Implicit Relevance Signals from the Web
III:EAGER:使用来自 Web 的隐式相关信号自动构建测试集合
- 批准号:
1147810 - 财政年份:2011
- 资助金额:
$ 16.88万 - 项目类别:
Standard Grant
EAGER: Constructing, Indexing, and Searching Super-Enriched Document Representations in the Cloud
EAGER:在云中构建、索引和搜索超级丰富的文档表示
- 批准号:
1143703 - 财政年份:2011
- 资助金额:
$ 16.88万 - 项目类别:
Standard Grant
Collaborative Research III-COR: From a Pile of Documents to a Collection of Information: A Framework for Multi-Dimensional Text Analysis
协作研究III-COR:从一堆文档到信息集合:多维文本分析框架
- 批准号:
0705091 - 财政年份:2007
- 资助金额:
$ 16.88万 - 项目类别:
Standard Grant
Collaborative Research: Language Processing Technology for Electronic Rulemaking
合作研究:电子规则制定的语言处理技术
- 批准号:
0429360 - 财政年份:2004
- 资助金额:
$ 16.88万 - 项目类别:
Continuing Grant
Automating the Integration of EPA Databases
自动集成 EPA 数据库
- 批准号:
0306899 - 财政年份:2003
- 资助金额:
$ 16.88万 - 项目类别:
Continuing Grant
SGER COLLABORATIVE: A Testbed for eRulemaking Data
SGER Collaborative:电子规则制定数据的测试平台
- 批准号:
0328175 - 财政年份:2003
- 资助金额:
$ 16.88万 - 项目类别:
Standard Grant
ITR: Information Discovery in Digital Government: Self-extending Topic Maps and Ontologies (GrowOnto)
ITR:数字政府中的信息发现:自扩展主题图和本体(GrowOnto)
- 批准号:
0205111 - 财政年份:2002
- 资助金额:
$ 16.88万 - 项目类别:
Continuing Grant
相似国自然基金
Research on Quantum Field Theory without a Lagrangian Description
- 批准号:24ZR1403900
- 批准年份:2024
- 资助金额:0.0 万元
- 项目类别:省市级项目
Cell Research
- 批准号:31224802
- 批准年份:2012
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Cell Research
- 批准号:31024804
- 批准年份:2010
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Cell Research (细胞研究)
- 批准号:30824808
- 批准年份:2008
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Research on the Rapid Growth Mechanism of KDP Crystal
- 批准号:10774081
- 批准年份:2007
- 资助金额:45.0 万元
- 项目类别:面上项目
相似海外基金
Collaborative Research: REU Site: Earth and Planetary Science and Astrophysics REU at the American Museum of Natural History in Collaboration with the City University of New York
合作研究:REU 地点:地球与行星科学和天体物理学 REU 与纽约市立大学合作,位于美国自然历史博物馆
- 批准号:
2348998 - 财政年份:2025
- 资助金额:
$ 16.88万 - 项目类别:
Standard Grant
Collaborative Research: REU Site: Earth and Planetary Science and Astrophysics REU at the American Museum of Natural History in Collaboration with the City University of New York
合作研究:REU 地点:地球与行星科学和天体物理学 REU 与纽约市立大学合作,位于美国自然历史博物馆
- 批准号:
2348999 - 财政年份:2025
- 资助金额:
$ 16.88万 - 项目类别:
Standard Grant
Collaborative Research: Investigating Southern Ocean Sea Surface Temperatures and Freshening during the Late Pliocene and Pleistocene along the Antarctic Margin
合作研究:调查上新世晚期和更新世沿南极边缘的南大洋海面温度和新鲜度
- 批准号:
2313120 - 财政年份:2024
- 资助金额:
$ 16.88万 - 项目类别:
Standard Grant
NSF Engines Development Award: Utilizing space research, development and manufacturing to improve the human condition (OH)
NSF 发动机发展奖:利用太空研究、开发和制造来改善人类状况(OH)
- 批准号:
2314750 - 财政年份:2024
- 资助金额:
$ 16.88万 - 项目类别:
Cooperative Agreement
Doctoral Dissertation Research: How New Legal Doctrine Shapes Human-Environment Relations
博士论文研究:新法律学说如何塑造人类与环境的关系
- 批准号:
2315219 - 财政年份:2024
- 资助金额:
$ 16.88万 - 项目类别:
Standard Grant
Collaborative Research: Non-Linearity and Feedbacks in the Atmospheric Circulation Response to Increased Carbon Dioxide (CO2)
合作研究:大气环流对二氧化碳 (CO2) 增加的响应的非线性和反馈
- 批准号:
2335762 - 财政年份:2024
- 资助金额:
$ 16.88万 - 项目类别:
Standard Grant
Collaborative Research: Using Adaptive Lessons to Enhance Motivation, Cognitive Engagement, And Achievement Through Equitable Classroom Preparation
协作研究:通过公平的课堂准备,利用适应性课程来增强动机、认知参与和成就
- 批准号:
2335802 - 财政年份:2024
- 资助金额:
$ 16.88万 - 项目类别:
Standard Grant
Collaborative Research: Using Adaptive Lessons to Enhance Motivation, Cognitive Engagement, And Achievement Through Equitable Classroom Preparation
协作研究:通过公平的课堂准备,利用适应性课程来增强动机、认知参与和成就
- 批准号:
2335801 - 财政年份:2024
- 资助金额:
$ 16.88万 - 项目类别:
Standard Grant
Collaborative Research: Holocene biogeochemical evolution of Earth's largest lake system
合作研究:地球最大湖泊系统的全新世生物地球化学演化
- 批准号:
2336132 - 财政年份:2024
- 资助金额:
$ 16.88万 - 项目类别:
Standard Grant
CyberCorps Scholarship for Service: Building Research-minded Cyber Leaders
CyberCorps 服务奖学金:培养具有研究意识的网络领导者
- 批准号:
2336409 - 财政年份:2024
- 资助金额:
$ 16.88万 - 项目类别:
Continuing Grant