ITR: Collaborative Research: Interlingual Annotation of Multilingual Text Corporation
ITR:协作研究:多语言文本公司的语际注释
基本信息
- 批准号:0325887
- 负责人:
- 金额:$ 26.87万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2003
- 资助国家:美国
- 起止时间:2003-09-01 至 2006-08-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
This multi-site research effort is aimed at developing a coherent, consistent, standardized Interlingual representation along with a methodology and sharable tools for annotating large bilingual corpora of parallel texts. It has four central components: First, six corpora are being compiled, each consisting of a number of texts in a particular source language along with three translations of each text into English. Second, a standardized interlingual representation is being developed based on a comparative analysis of these parallel text corpora. Third, the bilingual corpora are being annotated using the standardized interlingua and following a predefined annotation procedure. Fourth, metrics are being developed for evaluating the accuracy and appropriateness of the interlingual representations in terms of the grain size of the representation given a particular task. The metrics are based on inter-coder reliability, the growth rate of the interlingual representation, and quality of the target language text that is be generated from the interlingua. The resulting annotated, multilingual, parallel corpora will be useful as an empirical basis for developing a wide variety of interlingual NLP systems for tasks such as machine translation, question answering, web searching, summarization, or presentation generation, as well as a host of other research and development efforts in theoretical and applied linguistics, foreign language pedagogy, translation studies, and other related disciplines. The participants include CRL at NMSU, ISI at USC, UMIACS at the University of Maryland, LTI at CMU, Columbia University, and The MITRE Corporation. The source languages include Arabic, Chinese, French, Hindi, Japanese, Spanish and English.
这个多站点的研究工作的目的是开发一个连贯的,一致的,标准化的语际表示沿着的方法和共享的工具,注释大型双语语料库的平行文本。它有四个核心组成部分:首先,正在编制六个语料库,每个语料库由特定源语言的若干文本组成,沿着每个文本的三个英文译本。 其次,一个标准化的语际表示正在开发的基础上,这些平行文本语料库的比较分析。第三,双语语料库使用标准化的中间语进行注释,并遵循预定义的注释程序。第四,正在开发的指标,用于评估的准确性和适当的语际表示的粒度表示给定的特定任务。这些指标基于编码器间的可靠性、语际表示的增长率以及从语际生成的目标语言文本的质量。由此产生的注释,多语言,平行语料库将是有用的经验基础,为开发各种各样的语际NLP系统的任务,如机器翻译,问答,网络搜索,摘要,或演示文稿生成,以及在理论和应用语言学,外语教学,翻译研究和其他相关学科的其他研究和开发工作。参与者包括NMSU的CRL、USC的ISI、马里兰州大学的UMIACS、CMU的LTI、哥伦比亚大学和MITRE公司。源语言包括阿拉伯语、中文、法语、印地语、日语、西班牙语和英语。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Kathleen McKeown其他文献
Detecting Grief Online Among Black Harlem Residents
在哈莱姆区黑人居民中在线检测悲伤情绪
- DOI:
10.1016/j.biopsych.2025.02.119 - 发表时间:
2025-05-01 - 期刊:
- 影响因子:9.000
- 作者:
Desmond Patton;Shana Kleiner;Shug Miller;Nick Deas;Jessie Grieser;James Shepherd;Elsbeth Turcan;Kathleen McKeown - 通讯作者:
Kathleen McKeown
Cross-Document Temporal Relation Extraction with Temporal Anchoring Events
使用时间锚定事件进行跨文档时间关系提取
- DOI:
- 发表时间:
2021 - 期刊:
- 影响因子:0
- 作者:
Miguel Ballesteros;Rishita Anubhai;Shuai Wang;minder Bhatia;Kathleen McKeown;Yaser Al;Iz Beltagy;Matthew E. Peters;Arman Cohan;Steven Bethard;James H. Martin;Sara Klingenstein;Taylor Cassidy;Bill McDowell;Nathanael Chambers;Danqi Chen;Adam Fisch;Jason Weston;Anne;Manuela Speranza;Eneko Agirre;N. Mostafazadeh;Alyson Grealish - 通讯作者:
Alyson Grealish
Sources of Hallucination by Large Language Models on Inference Tasks Anonymous EMNLP
推理任务中大型语言模型的幻觉来源 Anonymous EMNLP
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
Nick McKenna;Mark Steedman. 2022;Smooth;Todor Mihaylov;Peter Clark;Tushar Khot;Dat Ba Nguyen;Johannes Hoffart;Martin Theobald;Ouyang Long;Jeff Wu;Xu Jiang;Car;L. Wainwright;Pamela Mishkin;Chong Zhang;Paul Christiano;J. Leike;Ryan Lowe;Adam Poliak;Jason Naradowsky;Aparajita Haldar;Rachel Rudinger;Benjamin Van;Eleanor Rosch;Carolyn B. Mervis;Wayne D Gray;David M Johnson;P. Boyes;Krishna Srinivasan;K. Raman;Anupam Samanta;Lingrui Liao;Luca Bertelli;Rohan Taori;Ishaan Gulrajani;Tianyi Zhang;Yann Dubois;Xuechen Li;Carlos Guestrin;Percy Liang;Tatsunori Hashimoto;Stan;Kushal Tirumala;A. Markosyan;Luke Zettlemoyer;Hugo Touvron;Thibaut Lavril;Gautier Izacard;Xavier Martinet;Marie;Timothée Lacroix;Baptiste Rozière;Naman Goyal;Eric Hambro;Faisal Azhar;Aurelien Rodriguez;Armand Joulin;Jason Wei;Xuezhi Wang;D. Schuurmans;Maarten Bosma;Brian Ichter;Fei Xia;E. Chi;V. Quoc;Le;Denny Zhou. 2022;Orion Weller;Marc Marone;Nathaniel Weir;Dawn Lawrie;Daniel Khashabi;Faisal Ladhak;Esin Durmus;Kathleen McKeown - 通讯作者:
Kathleen McKeown
TinyStyler: Efficient Few-Shot Text Style Transfer with Authorship Embeddings
TinyStyler:通过作者嵌入进行高效的少量文本样式传输
- DOI:
- 发表时间:
2024 - 期刊:
- 影响因子:0
- 作者:
Zachary Horvitz;Ajay Patel;Kanishk Singh;Christopher Callison;Kathleen McKeown;Zhou Yu - 通讯作者:
Zhou Yu
Kathleen McKeown的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Kathleen McKeown', 18)}}的其他基金
RI: Medium: Automatically Understanding and Identifying Digital Expression of Black Grief
RI:媒介:自动理解和识别黑人悲伤的数字表达
- 批准号:
2106666 - 财政年份:2021
- 资助金额:
$ 26.87万 - 项目类别:
Standard Grant
RI: Small: Describing Disasters and the Ensuing Personal Toll
RI:小:描述灾难和随之而来的个人损失
- 批准号:
1422863 - 财政年份:2014
- 资助金额:
$ 26.87万 - 项目类别:
Standard Grant
EAGER: Corpus-Based Narrative Semantics
EAGER:基于语料库的叙事语义
- 批准号:
0935360 - 财政年份:2009
- 资助金额:
$ 26.87万 - 项目类别:
Standard Grant
RI: Large: Collaborative Research: Richer Representations for Machine Translation
RI:大型:协作研究:更丰富的机器翻译表示
- 批准号:
0910778 - 财政年份:2009
- 资助金额:
$ 26.87万 - 项目类别:
Standard Grant
Text-to-Text Generation for Summarizing Informal Genres
用于总结非正式流派的文本到文本生成
- 批准号:
0534871 - 财政年份:2006
- 资助金额:
$ 26.87万 - 项目类别:
Continuing Grant
DLI-Phase 2: A Patient Care Digital Library: Personalized Retrieval Summarization of Multimedia Information
DLI-阶段 2:患者护理数字图书馆:多媒体信息的个性化检索摘要
- 批准号:
9817434 - 财政年份:1999
- 资助金额:
$ 26.87万 - 项目类别:
Cooperative Agreement
STIMULATE: An Environment for Illustrated Briefing and Follow-up Search Over Live Multimedia Information
STIMULATE:通过实时多媒体信息进行图解简报和后续搜索的环境
- 批准号:
9619124 - 财政年份:1997
- 资助金额:
$ 26.87万 - 项目类别:
Continuing Grant
STIMULATE: Generating Coherent Summaries of On-Line Documents: Combining Statistical and Symbolic Techniques
刺激:生成在线文档的连贯摘要:结合统计和符号技术
- 批准号:
9618797 - 财政年份:1997
- 资助金额:
$ 26.87万 - 项目类别:
Standard Grant
CARD: Corpus Analysis Resources for Discourse
CARD:话语语料库分析资源
- 批准号:
9528998 - 财政年份:1996
- 资助金额:
$ 26.87万 - 项目类别:
Continuing Grant
CISE Research Infrastructure: Scalable Multimedia Information Processing
CISE 研究基础设施:可扩展多媒体信息处理
- 批准号:
9625374 - 财政年份:1996
- 资助金额:
$ 26.87万 - 项目类别:
Continuing Grant
相似海外基金
ITR Collaborative Research: Pervasively Secure Infrastructures (PSI): Integrating Smart Sensing, Data Mining, Pervasive Networking, and Community Computing
ITR 协作研究:普遍安全基础设施 (PSI):集成智能传感、数据挖掘、普遍网络和社区计算
- 批准号:
1404694 - 财政年份:2013
- 资助金额:
$ 26.87万 - 项目类别:
Continuing Grant
ITR-SCOTUS: A Resource for Collaborative Research in Speech Technology, Linguistics, Decision Processes, and the Law
ITR-SCOTUS:语音技术、语言学、决策过程和法律合作研究的资源
- 批准号:
1139735 - 财政年份:2011
- 资助金额:
$ 26.87万 - 项目类别:
Continuing Grant
ITR/NGS: Collaborative Research: DDDAS: Data Dynamic Simulation for Disaster Management
ITR/NGS:合作研究:DDDAS:灾害管理数据动态模拟
- 批准号:
0963973 - 财政年份:2009
- 资助金额:
$ 26.87万 - 项目类别:
Continuing Grant
ITR/NGS: Collaborative Research: DDDAS: Data Dynamic Simulation for Disaster Management
ITR/NGS:合作研究:DDDAS:灾害管理数据动态模拟
- 批准号:
1018072 - 财政年份:2009
- 资助金额:
$ 26.87万 - 项目类别:
Continuing Grant
ITR Collaborative Research: A Reusable, Extensible, Optimizing Back End
ITR 协作研究:可重用、可扩展、优化的后端
- 批准号:
0838899 - 财政年份:2008
- 资助金额:
$ 26.87万 - 项目类别:
Continuing Grant
ITR Collaborative Research: Pervasively Secure Infrastructures (PSI): Integrating Smart Sensing, Data Mining, Pervasive Networking, and Community Computing
ITR 协作研究:普遍安全基础设施 (PSI):集成智能传感、数据挖掘、普遍网络和社区计算
- 批准号:
0833849 - 财政年份:2008
- 资助金额:
$ 26.87万 - 项目类别:
Continuing Grant
ITR/NGS: Collaborative Research: DDDAS: Data Dynamic Simulation for Disaster Management
ITR/NGS:合作研究:DDDAS:灾害管理数据动态模拟
- 批准号:
0808419 - 财政年份:2007
- 资助金额:
$ 26.87万 - 项目类别:
Continuing Grant
ITR: Collaborative Research - ASE - (sim+dmc): Image-based Biophysical Modeling: Scalable Registration and Inversion Algorithms and Distributed Computing
ITR:协作研究 - ASE - (sim dmc):基于图像的生物物理建模:可扩展配准和反演算法以及分布式计算
- 批准号:
0849301 - 财政年份:2007
- 资助金额:
$ 26.87万 - 项目类别:
Continuing Grant
ITR: Collaborative Research: Modeling and Display of Haptic Information for Enhanced Performance of Computer-Integrated Surgery
ITR:协作研究:触觉信息建模和显示,以提高计算机集成手术的性能
- 批准号:
0711040 - 财政年份:2007
- 资助金额:
$ 26.87万 - 项目类别:
Standard Grant
ITR Collaborative Research: GEON: A Research Project to Create Cyberinfrastructure for the Geosciences
ITR 合作研究:GEON:为地球科学创建网络基础设施的研究项目
- 批准号:
0724265 - 财政年份:2006
- 资助金额:
$ 26.87万 - 项目类别:
Continuing Grant














{{item.name}}会员




