INCA: An Integrated Cluster Computing Architecture for Machine Translation
INCA:用于机器翻译的集成集群计算架构
基本信息
- 批准号:0844507
- 负责人:
- 金额:$ 44.91万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2009
- 资助国家:美国
- 起止时间:2009-02-15 至 2012-01-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Progress in the field of machine translation (MT) has come to depend heavily on open-source toolkits, which make it easier for new research groups to tackle the problem at lower cost, broadening participation. Unfortunately, toolkits have not kept up with modern computing infrastructure (e.g., the MapReduce framework) required for modern "big data" approaches to MT, the "primitives" in most toolkits are hardly extensible to new models since they focus on pipeline components rather than algorithmic concepts, and experiment management has been all but ignored.This project is developing the Integrated Cluster Computing Architecture (INCA) for translation to overcome these challenges, by implementing an extensible, open-source toolkit that can leverage MapReduce clusters and flexibly implement many types of MT systems. MT is not a perfect fit for MapReduce (it has massive memory footprints and requires iterative algorithms); new algorithms are being developed to take advantage of the framework without being limited by it. Experiment management, evaluation, and advice about "best practices" are also part of the toolkit, to make it as widely accessible as possible.This project is expected to have broad impact in MT research through the open-source toolkit to be made available to the research community. A course project suitable for undergraduates will be developed and shared openly using the toolkit. Technical solutions to problems in large-scale, parallelized MT will be applicable in areas of data-intensive natural language processing and machine learning, and elements of the toolkit are expected to be useful in such research efforts as well.
机器翻译(MT)领域的进步很大程度上依赖于开源工具包,这使得新的研究小组更容易以更低的成本解决问题,扩大参与。不幸的是,工具包并没有跟上现代MT“大数据”方法所需的现代计算基础设施(例如,MapReduce框架),大多数工具包中的“原语”几乎无法扩展到新模型,因为它们专注于管道组件而不是算法概念,并且实验管理几乎被忽略了。该项目正在开发用于翻译的集成集群计算架构(INCA),通过实现一个可扩展的开源工具包来克服这些挑战,该工具包可以利用MapReduce集群并灵活地实现多种类型的机器翻译系统。机器翻译并不完全适合MapReduce(它占用大量内存,需要迭代算法);新的算法正在开发中,以利用该框架而不受其限制。实验管理、评估和关于“最佳实践”的建议也是工具包的一部分,以使其尽可能广泛地可用。该项目预计将通过向研究社区提供的开源工具包对机器翻译研究产生广泛的影响。将开发一个适合本科生的课程项目,并使用该工具包公开分享。大规模并行机器翻译问题的技术解决方案将适用于数据密集型自然语言处理和机器学习领域,并且该工具包的元素也有望在此类研究工作中发挥作用。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Stephan Vogel其他文献
QCRI’s Machine Translation Systems for IWSLT’16
QCRI 的 IWSLT’16 机器翻译系统
- DOI:
- 发表时间:
2016 - 期刊:
- 影响因子:0
- 作者:
Nadir Durrani;Fahim Dalvi;Hassan Sajjad;Stephan Vogel - 通讯作者:
Stephan Vogel
Dynamic De/Centralization in Germany, 1949–2010
德国的动态去中心化/集权化,1949-2010
- DOI:
- 发表时间:
2019 - 期刊:
- 影响因子:0
- 作者:
André Kaiser;Stephan Vogel - 通讯作者:
Stephan Vogel
Labels for Disorder Mentions in Online Health Forums
在线健康论坛中提及疾病的标签
- DOI:
- 发表时间:
2013 - 期刊:
- 影响因子:0
- 作者:
Ryen W. White;Bill Hersh;Patricia Driscoll;S. Gorman;Noémie Elhadad;L. Fernández;André Mourão;Flávio Martins;João Magalhães;Haggai Roitman;Sivan Yogev;Yevgenia Tsimerman;Y. Peres;Avare Stewart;Nattiya Kanhabua;Sara Romano;Ernesto Diaz;W. Siberski;W. Nejdl;Ahmed Ali;Walid Magdy;Stephan Vogel;Lorraine Goeuriot;Liadh Kelly;G. Jones;G. Jones;A. Hanbury;Henning Müller;Bernhard Haslhofer;Balaji Polepalli;Ramesh;Hongfeng Yu;Martin Wiesner;M. Pobiruchin;D. Pfeifer;Danny T. Y. Wu;Lei Yang;Qiaozhu Mei;D. Hanauer;Kai Zheng;Stephen Wu;Dongqing Zhu;W. Hersh;Hongfang Liu;Andrew Yates;Nazli Goharian;O. Frieder - 通讯作者:
O. Frieder
Upward lightning attachment analysis on wind turbines and correlated current parameters
风力发电机上行雷击附着分析及相关电流参数
- DOI:
- 发表时间:
2017 - 期刊:
- 影响因子:0
- 作者:
Stephan Vogel;齋藤 幹久;石井 勝 - 通讯作者:
石井 勝
Correlations of current parameters with flash density from winter thunderstorms in Japan
日本冬季雷暴的电流参数与闪光密度的相关性
- DOI:
- 发表时间:
2017 - 期刊:
- 影响因子:0
- 作者:
Stephan Vogel;齋藤 幹久;石井 勝 - 通讯作者:
石井 勝
Stephan Vogel的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Stephan Vogel', 18)}}的其他基金
Workshop Proposal: Student Research Workshop at AMTA-2010
研讨会提案:AMTA-2010 学生研究研讨会
- 批准号:
1048559 - 财政年份:2010
- 资助金额:
$ 44.91万 - 项目类别:
Standard Grant
RI-Small: Exploiting Comparable Corpora for Machine Translation (CC4MT)
RI-Small:利用可比语料库进行机器翻译 (CC4MT)
- 批准号:
0916866 - 财政年份:2009
- 资助金额:
$ 44.91万 - 项目类别:
Standard Grant
相似国自然基金
greenwashing behavior in China:Basedon an integrated view of reconfiguration of environmental authority and decoupling logic
- 批准号:
- 批准年份:2024
- 资助金额:万元
- 项目类别:外国学者研究基金项目
焦虑症小鼠模型整合模式(Integrated)
行为和精细行为评价体系的构建
- 批准号:
- 批准年份:2024
- 资助金额:0.0 万元
- 项目类别:省市级项目
相似海外基金
Integrated Quantum Frequency Combs for Cluster States Generation
用于生成簇态的集成量子频率梳
- 批准号:
EP/V062492/1 - 财政年份:2021
- 资助金额:
$ 44.91万 - 项目类别:
Research Grant
Effectiveness of an Integrated Care Pathway for Adolescent Depression: A Pilot Multi-site Cluster Randomized Controlled Trial
青少年抑郁症综合护理途径的有效性:试点多地点集群随机对照试验
- 批准号:
420251 - 财政年份:2020
- 资助金额:
$ 44.91万 - 项目类别:
Operating Grants
A cluster randomized trial of an mHealth integrated model of hypertension, diabetes and antenatal care in primary care settings in India and Nepal
印度和尼泊尔初级保健机构中高血压、糖尿病和产前保健的 mHealth 综合模型的整群随机试验
- 批准号:
MR/R022127/1 - 财政年份:2018
- 资助金额:
$ 44.91万 - 项目类别:
Research Grant
Repeat Ivermectin Mass Drug Administrations for MALaria control II (RIMDAMAL II): a double-blind cluster randomized trial for integrated control of malaria
重复伊维菌素大规模药物管理用于疟疾控制 II (RIMDAMAL II):疟疾综合控制的双盲整群随机试验
- 批准号:
9754784 - 财政年份:2018
- 资助金额:
$ 44.91万 - 项目类别:
Repeat Ivermectin Mass Drug Administrations for MALaria control II (RIMDAMAL II): a double-blind cluster randomized trial for integrated control of malaria
重复伊维菌素大规模药物管理用于疟疾控制 II (RIMDAMAL II):疟疾综合控制的双盲整群随机试验
- 批准号:
10468728 - 财政年份:2018
- 资助金额:
$ 44.91万 - 项目类别:
Repeat Ivermectin Mass Drug Administrations for MALaria control II (RIMDAMAL II): a double-blind cluster randomized trial for integrated control of malaria
重复伊维菌素大规模药物管理用于疟疾控制 II (RIMDAMAL II):疟疾综合控制的双盲整群随机试验
- 批准号:
10223135 - 财政年份:2018
- 资助金额:
$ 44.91万 - 项目类别:
An integrated health-sector strategy to combat COPD and asthma in Vietnam: A pragmatic stepped intervention cluster randomized trial
越南抗击慢性阻塞性肺病和哮喘的综合卫生部门战略:一项务实的分步干预整群随机试验
- 批准号:
nhmrc : 1116020 - 财政年份:2017
- 资助金额:
$ 44.91万 - 项目类别:
Targeted Calls
An integrated health-sector strategy to combat COPD and asthma in Vietnam: A pragmatic stepped intervention cluster randomized trial
越南抗击慢性阻塞性肺病和哮喘的综合卫生部门战略:一项务实的分步干预整群随机试验
- 批准号:
nhmrc : GNT1116020 - 财政年份:2017
- 资助金额:
$ 44.91万 - 项目类别:
International Collaborations
Integrated solutions for healthy birth, growth, and development: A cluster-randomized controlled trial to evaluate the effectiveness of a mixed nutrition intervention package in reducing child undernutrition in Lao People's Democratic Republic
健康出生、生长和发育的综合解决方案:一项整群随机对照试验,旨在评估混合营养干预方案在减少老挝人民民主共和国儿童营养不良方面的有效性
- 批准号:
nhmrc : GNT1106556 - 财政年份:2016
- 资助金额:
$ 44.91万 - 项目类别:
Project Grants
Child developmental health, maternal psychosocial distress, and health system costs at 18 months corrected age: Effectiveness of a cluster randomized controlled trial of Family Integrated Care in Level II NICUs
儿童发育健康、孕产妇社会心理困扰和 18 个月校正年龄时的卫生系统成本:二级 NICU 家庭综合护理整群随机对照试验的有效性
- 批准号:
351170 - 财政年份:2016
- 资助金额:
$ 44.91万 - 项目类别:
Operating Grants