ITR: Collaborative Research: Interlingual Annotation of Multilingual Text
ITR:协作研究:多语言文本的语际注释
基本信息
- 批准号:0325695
- 负责人:
- 金额:$ 16.88万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2003
- 资助国家:美国
- 起止时间:2003-09-01 至 2005-08-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
This multi-site research effort is aimed at developing a coherent, consistent, standardized Interlingual representation along with a methodology and sharable tools for annotating large bilingual corpora of parallel texts. It has four central components: First, six corpora are being compiled, each consisting of a number of texts in a particular source language along with three translations of each text into English. Second, a standardized interlingual representation is being developed based on a comparative analysis of these parallel text corpora. Third, the bilingual corpora are being annotated using the standardized interlingua and following a predefined annotation procedure. Fourth, metrics are being developed for evaluating the accuracy and appropriateness of the interlingual representations in terms of the grain size of the representation given a particular task. The metrics are based on inter-coder reliability, the growth rate of the interlingual representation, and quality of the target language text that is be generated from the interlingua. The resulting annotated, multilingual, parallel corpora will be useful as an empirical basis for developing a wide variety of interlingual NLP systems for tasks such as machine translation, question answering, web searching, summarization, or presentation generation, as well as a host of other research and development efforts in theoretical and applied linguistics, foreign language pedagogy, translation studies, and other related disciplines. The participants include CRL at NMSU, ISI at USC, UMIACS at the University of Maryland, LTI at CMU, Columbia University, and The MITRE Corporation. The source languages include Arabic, Chinese, French, Hindi, Japanese, Spanish and English.
这个多站点的研究工作的目的是开发一个连贯的,一致的,标准化的语际表示沿着的方法和共享的工具,注释大型双语语料库的平行文本。它有四个核心组成部分:首先,正在编制六个语料库,每个语料库由特定源语言的若干文本组成,沿着每个文本的三个英文译本。 其次,一个标准化的语际表示正在开发的基础上,这些平行文本语料库的比较分析。第三,双语语料库使用标准化的中间语进行注释,并遵循预定义的注释程序。第四,正在开发的指标,用于评估的准确性和适当的语际表示的粒度表示给定的特定任务。这些指标基于编码器间的可靠性、语际表示的增长率以及从语际生成的目标语言文本的质量。由此产生的注释,多语言,平行语料库将是有用的经验基础,为开发各种各样的语际NLP系统的任务,如机器翻译,问答,网络搜索,摘要,或演示文稿生成,以及在理论和应用语言学,外语教学,翻译研究和其他相关学科的其他研究和开发工作。参与者包括NMSU的CRL、USC的ISI、马里兰州大学的UMIACS、CMU的LTI、哥伦比亚大学和MITRE公司。源语言包括阿拉伯语、中文、法语、印地语、日语、西班牙语和英语。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Lorraine Levin其他文献
Lorraine Levin的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Lorraine Levin', 18)}}的其他基金
Conference: Training the US Computational Linguistics Team
会议:培训美国计算语言学团队
- 批准号:
2329963 - 财政年份:2023
- 资助金额:
$ 16.88万 - 项目类别:
Standard Grant
Collaborative Research: RI: Medium: From Acoustic Signal to Morphosyntactic Analysis in One End-to-End Neural System
合作研究:RI:媒介:从声学信号到端到端神经系统中的形态句法分析
- 批准号:
2211951 - 财政年份:2022
- 资助金额:
$ 16.88万 - 项目类别:
Standard Grant
Conference: International Linguistics Olympiad (2022)
会议:国际语言学奥林匹克(2022)
- 批准号:
2141334 - 财政年份:2022
- 资助金额:
$ 16.88万 - 项目类别:
Standard Grant
North American Computational Linguistics Olympiad (NACLO) 2020
2020 年北美计算语言学奥林匹克竞赛 (NACLO)
- 批准号:
1946109 - 财政年份:2020
- 资助金额:
$ 16.88万 - 项目类别:
Standard Grant
Workshop: International Linguistics Olympiad (ILO) July 2019; Yongin, South Korea
研讨会:国际语言学奥林匹克(ILO)2019 年 7 月;
- 批准号:
1851142 - 财政年份:2019
- 资助金额:
$ 16.88万 - 项目类别:
Standard Grant
International Linguistics Olympiad (ILO) 2018: Prague, CZ, July 26 - August 1, 2018
2018 年国际语言学奥林匹克 (ILO):捷克布拉格,2018 年 7 月 26 日至 8 月 1 日
- 批准号:
1757042 - 财政年份:2018
- 资助金额:
$ 16.88万 - 项目类别:
Standard Grant
Workshop: International Computational Linguistics Olympiad 2017
研讨会:2017 年国际计算语言学奥林匹克竞赛
- 批准号:
1654253 - 财政年份:2017
- 资助金额:
$ 16.88万 - 项目类别:
Standard Grant
The International Linguistics Olympiad: Preparing High School Students for the Study of Human Language and Computation
国际语言学奥林匹克:为高中生学习人类语言和计算做好准备
- 批准号:
1137828 - 财政年份:2011
- 资助金额:
$ 16.88万 - 项目类别:
Standard Grant
SGER: Collaborative Research: New Problem Genres for the North American Computational Linguistics Olympiad
SGER:协作研究:北美计算语言学奥林匹克竞赛的新问题类型
- 批准号:
0838848 - 财政年份:2008
- 资助金额:
$ 16.88万 - 项目类别:
Standard Grant
Active Selection of Data for Machine Translation
主动选择机器翻译数据
- 批准号:
0713292 - 财政年份:2007
- 资助金额:
$ 16.88万 - 项目类别:
Standard Grant
相似海外基金
ITR Collaborative Research: Pervasively Secure Infrastructures (PSI): Integrating Smart Sensing, Data Mining, Pervasive Networking, and Community Computing
ITR 协作研究:普遍安全基础设施 (PSI):集成智能传感、数据挖掘、普遍网络和社区计算
- 批准号:
1404694 - 财政年份:2013
- 资助金额:
$ 16.88万 - 项目类别:
Continuing Grant
ITR-SCOTUS: A Resource for Collaborative Research in Speech Technology, Linguistics, Decision Processes, and the Law
ITR-SCOTUS:语音技术、语言学、决策过程和法律合作研究的资源
- 批准号:
1139735 - 财政年份:2011
- 资助金额:
$ 16.88万 - 项目类别:
Continuing Grant
ITR/NGS: Collaborative Research: DDDAS: Data Dynamic Simulation for Disaster Management
ITR/NGS:合作研究:DDDAS:灾害管理数据动态模拟
- 批准号:
0963973 - 财政年份:2009
- 资助金额:
$ 16.88万 - 项目类别:
Continuing Grant
ITR/NGS: Collaborative Research: DDDAS: Data Dynamic Simulation for Disaster Management
ITR/NGS:合作研究:DDDAS:灾害管理数据动态模拟
- 批准号:
1018072 - 财政年份:2009
- 资助金额:
$ 16.88万 - 项目类别:
Continuing Grant
ITR Collaborative Research: A Reusable, Extensible, Optimizing Back End
ITR 协作研究:可重用、可扩展、优化的后端
- 批准号:
0838899 - 财政年份:2008
- 资助金额:
$ 16.88万 - 项目类别:
Continuing Grant
ITR Collaborative Research: Pervasively Secure Infrastructures (PSI): Integrating Smart Sensing, Data Mining, Pervasive Networking, and Community Computing
ITR 协作研究:普遍安全基础设施 (PSI):集成智能传感、数据挖掘、普遍网络和社区计算
- 批准号:
0833849 - 财政年份:2008
- 资助金额:
$ 16.88万 - 项目类别:
Continuing Grant
ITR/NGS: Collaborative Research: DDDAS: Data Dynamic Simulation for Disaster Management
ITR/NGS:合作研究:DDDAS:灾害管理数据动态模拟
- 批准号:
0808419 - 财政年份:2007
- 资助金额:
$ 16.88万 - 项目类别:
Continuing Grant
ITR: Collaborative Research - ASE - (sim+dmc): Image-based Biophysical Modeling: Scalable Registration and Inversion Algorithms and Distributed Computing
ITR:协作研究 - ASE - (sim dmc):基于图像的生物物理建模:可扩展配准和反演算法以及分布式计算
- 批准号:
0849301 - 财政年份:2007
- 资助金额:
$ 16.88万 - 项目类别:
Continuing Grant
ITR: Collaborative Research: Modeling and Display of Haptic Information for Enhanced Performance of Computer-Integrated Surgery
ITR:协作研究:触觉信息建模和显示,以提高计算机集成手术的性能
- 批准号:
0711040 - 财政年份:2007
- 资助金额:
$ 16.88万 - 项目类别:
Standard Grant
Collaborative Research: ITR-(ASE)-(dmc): Overcoming Fractionation Errors in Cancer Treatement Planning
合作研究:ITR-(ASE)-(dmc):克服癌症治疗计划中的分割错误
- 批准号:
0749671 - 财政年份:2006
- 资助金额:
$ 16.88万 - 项目类别:
Standard Grant














{{item.name}}会员




