Compiling and Optimizing Iterative Data Analysis Programs with Shared State on Evolving Datasets
在不断变化的数据集上编译和优化具有共享状态的迭代数据分析程序
基本信息
- 批准号:248356729
- 负责人:
- 金额:--
- 依托单位:
- 依托单位国家:德国
- 项目类别:Research Units
- 财政年份:2013
- 资助国家:德国
- 起止时间:2012-12-31 至 2016-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
The goal of Project A within the Stratosphere II research unit is to research, design, and develop a data programming language, an associated optimizing compiler, an intermediate data- and control- flow representation, and an optimizer that determines efficient execution strategies for workloads of data analysis programs with iterations and stateful operators, both over static data and over infinite, evolving datasets. The project will research language abstractions for specifying iterations and state, novel optimizations for iterative and stateful programs targeting performance as well as novel fault tolerance schemes for massively parallel iterative algorithms, and optimization of workloads of programs including work sharing between programs. The project will also demonstrate the overall effectiveness of Stratosphere II by integrating the results of all projects in a coherent system, identifying a relevant use-case workload, and evaluating and benchmarking the system performance. In particular, this project aims at answering following questions:1. What are the necessary language and system primitives to abstract parallelization and state, and expose mutable state management to the programmer of DAPs without compromizing scalability, performance, and fault tolerance?2. What are the optimizing program transformations that apply to a DAP with state and iterations and create a more efficient program?3. To what extent can an optimizing language compiler optimize advanced data analytics applications with state and iterations?4. To what extent can we support the fault-tolerant and efficient execution of DAPs with state via languagelevel features that expose algorithmic aspects of programs?5. How can we build an optimizer for workloads of DAPs to optimize state management across DAPs?
Stratosphere II研究单元内的项目A的目标是研究、设计和开发一种数据编程语言、一个相关的优化编译器、一个中间数据流和控制流表示以及一个优化器,该优化器为具有迭代和有状态运算符的数据分析程序的工作负载确定有效的执行策略,无论是在静态数据上还是在无限的、不断发展的数据集上。该项目将研究用于指定迭代和状态的语言抽象,针对性能的迭代和有状态程序的新颖优化,以及大规模并行迭代算法的新颖容错方案,以及程序工作负载的优化,包括程序之间的工作共享。该项目还将通过将所有项目的成果整合到一个连贯的系统中,确定相关的用例工作量,并对系统性能进行评估和基准测试,来展示平流层II的总体有效性。具体而言,本项目旨在回答以下问题:1。什么是必要的语言和系统原语来抽象并行化和状态,并向DAP的程序员公开可变的状态管理,而不会损害可伸缩性,性能和容错性?2.什么是应用于具有状态和迭代的DAP并创建更有效程序的优化程序转换?3.优化语言编译器可以在多大程度上优化具有状态和迭代的高级数据分析应用程序?4.在多大程度上,我们可以通过暴露程序算法方面的语言级功能来支持具有状态的DAP的容错和有效执行?5.我们如何为DAP的工作负载构建优化器,以优化DAP之间的状态管理?
项目成果
期刊论文数量(11)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
BlockJoin: Efficient Matrix Partitioning Through Joins
BlockJoin:通过连接进行高效的矩阵分区
- DOI:10.14778/3151106.3151110
- 发表时间:2017
- 期刊:
- 影响因子:0
- 作者:Andreas
- 通讯作者:Andreas
"All roads lead to Rome": optimistic recovery for distributed iterative data processing
- DOI:10.1145/2505515.2505753
- 发表时间:2013-10
- 期刊:
- 影响因子:0
- 作者:Sebastian Schelter;Stephan Ewen;K. Tzoumas;V. Markl
- 通讯作者:Sebastian Schelter;Stephan Ewen;K. Tzoumas;V. Markl
Spinning Fast Iterative Data Flows
- DOI:10.14778/2350229.2350245
- 发表时间:2012-07
- 期刊:
- 影响因子:0
- 作者:Stephan Ewen;K. Tzoumas;Moritz Kaufmann;V. Markl
- 通讯作者:Stephan Ewen;K. Tzoumas;Moritz Kaufmann;V. Markl
Emma in Action: Declarative Dataflows for Scalable Data Analysis
- DOI:10.1145/2882903.2899396
- 发表时间:2016-06
- 期刊:
- 影响因子:0
- 作者:Alexander B. Alexandrov;Andreas Salzmann;Georgi Krastev;Asterios Katsifodimos;V. Markl
- 通讯作者:Alexander B. Alexandrov;Andreas Salzmann;Georgi Krastev;Asterios Katsifodimos;V. Markl
Massively parallel data analysis with PACTs on Nephele
- DOI:10.14778/1920841.1921056
- 发表时间:2010-09
- 期刊:
- 影响因子:2.5
- 作者:Alexander B. Alexandrov;Dominic Battré;Stephan Ewen;Max Heimel;Fabian Hueske;O. Kao;V. Markl;Erik Nijkamp;Daniel Warneke
- 通讯作者:Alexander B. Alexandrov;Dominic Battré;Stephan Ewen;Max Heimel;Fabian Hueske;O. Kao;V. Markl;Erik Nijkamp;Daniel Warneke
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Professor Dr. Volker Markl其他文献
Professor Dr. Volker Markl的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Professor Dr. Volker Markl', 18)}}的其他基金
Query Compilation for the Heterogeneous Many Core Age
异构多核时代的查询编译
- 批准号:
361497736 - 财政年份:2017
- 资助金额:
-- - 项目类别:
Priority Programmes
Stratosphere Data and Processing Model, its Optimization and Parallelization
平流层数据和处理模型、优化和并行化
- 批准号:
174426733 - 财政年份:2010
- 资助金额:
-- - 项目类别:
Research Units
Adaptive Query Compilation for Stream Processing
流处理的自适应查询编译
- 批准号:
447268056 - 财政年份:
- 资助金额:
-- - 项目类别:
Priority Programmes
相似海外基金
SBIR Phase I: Optimizing Safety and Fuel Efficiency in Autonomous Rendezvous and Proximity Operations (RPO) of Uncooperative Objects
SBIR 第一阶段:优化不合作物体自主交会和邻近操作 (RPO) 的安全性和燃油效率
- 批准号:
2311379 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Standard Grant
Optimizing Intelligent Vehicular Routing with Edge Computing through Multi-Agent Reinforcement Learning
通过多智能体强化学习利用边缘计算优化智能车辆路由
- 批准号:
24K14913 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Grant-in-Aid for Scientific Research (C)
Optimizing the STEM Teacher Recruitment, Development and Retention Pathway: Aligning STEM and Teaching Identities
优化 STEM 教师的招聘、发展和保留途径:协调 STEM 和教学身份
- 批准号:
2345197 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Continuing Grant
I-Corps: Translation Potential of Optimizing Regression Testing in Software Development
I-Corps:软件开发中优化回归测试的转化潜力
- 批准号:
2405355 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Standard Grant
CAREER: Characterizing and Optimizing Control in Neural Interfaces
职业:表征和优化神经接口控制
- 批准号:
2338662 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Standard Grant
SBIR Phase I: Optimizing Composition of Novel Molten Alkali Metal Borates for Carbon Dioxide Capture
SBIR 第一阶段:优化用于二氧化碳捕获的新型熔融碱金属硼酸盐的成分
- 批准号:
2332658 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Standard Grant
SCC-PG: Internet of Waste: A Low-Cost Geospatial Sensor Network for Optimizing Solid Waste Management and Fostering Resident's Recycling Effectiveness Through Evidential Education
SCC-PG:废物互联网:通过循证教育优化固体废物管理并提高居民回收效率的低成本地理空间传感器网络
- 批准号:
2341996 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Standard Grant
DMREF: Optimizing Problem formulation for prinTable refractory alloys via Integrated MAterials and processing co-design (OPTIMA)
DMREF:通过集成材料和加工协同设计 (OPTIMA) 优化可打印耐火合金的问题表述
- 批准号:
2323611 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Continuing Grant
CAREER: Optimizing Healthcare Operations through Incentive Mechanism Design
职业生涯:通过激励机制设计优化医疗运营
- 批准号:
2339767 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Continuing Grant
SCC-PG: Optimizing sociotechnical interventions for healthcare access using community sensing
SCC-PG:利用社区感知优化医疗保健获取的社会技术干预措施
- 批准号:
2334724 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Standard Grant