Collaborative Research: A Comparative Study of Approaches to Cluster-Based Large Scale Data Analysis
协作研究:基于集群的大规模数据分析方法的比较研究
基本信息
- 批准号:0844480
- 负责人:
- 金额:$ 23.93万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2009
- 资助国家:美国
- 起止时间:2009-02-01 至 2012-07-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
This goal of this research project is to understand the tradeoffs between the MapReduce and parallel DBMS approaches to performing large-scale data analysis over large clusters of computers, and to bring together ideas from both communities. Both MapReduce and parallel database systems provide scalable data processing over hundreds to thousands of nodes. Both provide a stylized, high-level programming environment that allows users to efficiently filter and combine datasets while masking much of the complexity of parallelizing computation over a cluster. But they differ in substantial ways as well, such as their approaches to dealing with fault tolerance, their data modeling requirements, their query flexibility, and their ability to function in a heterogeneous processing environment.This multi-university team of researchers is investigating the effect of these differences on the performance and scalability of these two approaches. The research team is running a set of experiments that compare an open source MapReduce implementation (Hadoop) to two commercial parallel database systems (DB2 and Vertica) on a benchmark that includes a range of tasks designed to assess the tradeoffs between both approaches. The research team is seeking to understand which differences between the two approaches to performing large scale data analysis are fundamental tradeoffs, and which differences are possible to combine inside a single solution, so that ideas from one community can benefit the other.
本研究项目的目标是了解MapReduce和并行DBMS方法之间的权衡,以在大型计算机集群上执行大规模数据分析,并将两个社区的想法结合在一起。MapReduce和并行数据库系统都提供了在数百到数千个节点上的可扩展数据处理。 两者都提供了一个程式化的高级编程环境,允许用户有效地过滤和联合收割机数据集,同时掩盖了在集群上并行计算的大部分复杂性。但它们在本质上也有不同,例如它们处理容错的方法,它们的数据建模要求,它们的查询灵活性,以及它们在异构处理环境中运行的能力。这个由多所大学组成的研究小组正在研究这些差异对这两种方法的性能和可扩展性的影响。该研究团队正在运行一组实验,将开源MapReduce实现(Hadoop)与两个商业并行数据库系统(DB2和Vertica)进行比较,其中包括一系列旨在评估两种方法之间权衡的任务。研究团队正在试图了解这两种执行大规模数据分析的方法之间的哪些差异是根本的权衡,以及哪些差异可能在单个解决方案中联合收割机组合,以便来自一个社区的想法可以使另一个社区受益。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Daniel Abadi其他文献
Utility of a multidimensional recovery framework in understanding lived experiences of Chilean and Brazilian mental health service users
多维恢复框架在了解智利和巴西心理健康服务使用者的生活经历中的效用
- DOI:
- 发表时间:
2021 - 期刊:
- 影响因子:0
- 作者:
M. Agrest;Silvia Alves Nishioka;PhuongThao D. Le;Gabriella A. Dishy;Catarina Magalhães Dahl;N. Vera San Juan;Franco Mascayano;Tanvi Kankan;Saloni Dev;Daniel Abadi;María Tavares Cavalancti;R. Whitley;E. Valência;Ruben Alvarado Muñoz;Lawrence H. Yang;Ezra Susser - 通讯作者:
Ezra Susser
Daniel Abadi的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Daniel Abadi', 18)}}的其他基金
III: Small: Peer-to-peer Database (P2PDB): A decentralized, scalable data sharing and management platform
III:小型:点对点数据库(P2PDB):去中心化、可扩展的数据共享和管理平台
- 批准号:
1910613 - 财政年份:2019
- 资助金额:
$ 23.93万 - 项目类别:
Continuing Grant
III: Small: Multi-Version Concurrency Control on Modern Hardware
III:小:现代硬件上的多版本并发控制
- 批准号:
1718581 - 财政年份:2017
- 资助金额:
$ 23.93万 - 项目类别:
Continuing Grant
III: Small: Scalable, Practical Deterministic Database Systems
III:小型:可扩展、实用的确定性数据库系统
- 批准号:
1763797 - 财政年份:2017
- 资助金额:
$ 23.93万 - 项目类别:
Standard Grant
III: Small: Scalable, Practical Deterministic Database Systems
III:小型:可扩展、实用的确定性数据库系统
- 批准号:
1527118 - 财政年份:2015
- 资助金额:
$ 23.93万 - 项目类别:
Standard Grant
EAGER: Scaling the Preprocessor and Making it More Intelligent in Deterministic Database Systems
EAGER:扩展预处理器并使其在确定性数据库系统中更加智能
- 批准号:
1249722 - 财政年份:2012
- 资助金额:
$ 23.93万 - 项目类别:
Standard Grant
CAREER: Architecting A Database Management System for Semantic Web Data
职业:为语义 Web 数据构建数据库管理系统
- 批准号:
0845643 - 财政年份:2009
- 资助金额:
$ 23.93万 - 项目类别:
Continuing Grant
相似国自然基金
Research on Quantum Field Theory without a Lagrangian Description
- 批准号:24ZR1403900
- 批准年份:2024
- 资助金额:0.0 万元
- 项目类别:省市级项目
Cell Research
- 批准号:31224802
- 批准年份:2012
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Cell Research
- 批准号:31024804
- 批准年份:2010
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Cell Research (细胞研究)
- 批准号:30824808
- 批准年份:2008
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Research on the Rapid Growth Mechanism of KDP Crystal
- 批准号:10774081
- 批准年份:2007
- 资助金额:45.0 万元
- 项目类别:面上项目
相似海外基金
Collaborative Research: How to manipulate a plant? Testing for conserved effectors and plant responses in gall induction and growth using a multi-species comparative approach.
合作研究:如何操纵植物?
- 批准号:
2305880 - 财政年份:2023
- 资助金额:
$ 23.93万 - 项目类别:
Standard Grant
Collaborative Research: Ecologies of Participation in Island Karst Science and Conservation: A Comparative Multimethods Approach
合作研究:参与岛屿喀斯特科学与保护的生态学:比较多方法方法
- 批准号:
2236152 - 财政年份:2023
- 资助金额:
$ 23.93万 - 项目类别:
Standard Grant
Collaborative Research: Ecologies of Participation in Island Karst Science and Conservation: A Comparative Multimethods Approach
合作研究:参与岛屿喀斯特科学与保护的生态学:比较多方法方法
- 批准号:
2236151 - 财政年份:2023
- 资助金额:
$ 23.93万 - 项目类别:
Standard Grant
Collaborative Research: RESEARCH-PGR: Comparative genomics of the capitulum: deciphering the molecular basis of a key floral innovation
合作研究:RESEARCH-PGR:头状花序的比较基因组学:破译关键花卉创新的分子基础
- 批准号:
2214473 - 财政年份:2022
- 资助金额:
$ 23.93万 - 项目类别:
Standard Grant
Collaborative Research: Comparative genomics and physiology to discover integrated mechanisms that support phenotypic plasticity
合作研究:比较基因组学和生理学,发现支持表型可塑性的综合机制
- 批准号:
2200320 - 财政年份:2022
- 资助金额:
$ 23.93万 - 项目类别:
Continuing Grant
Collaborative Research: RESEARCH-PGR: Comparative genomics of the capitulum: deciphering the molecular basis of a key floral innovation
合作研究:RESEARCH-PGR:头状花序的比较基因组学:破译关键花卉创新的分子基础
- 批准号:
2214472 - 财政年份:2022
- 资助金额:
$ 23.93万 - 项目类别:
Standard Grant
Collaborative Research: RESEARCH-PGR: Comparative genomics of the capitulum: deciphering the molecular basis of a key floral innovation
合作研究:RESEARCH-PGR:头状花序的比较基因组学:破译关键花卉创新的分子基础
- 批准号:
2214474 - 财政年份:2022
- 资助金额:
$ 23.93万 - 项目类别:
Standard Grant
Collaborative Research: Comparative genomics and physiology to discover integrated mechanisms that support phenotypic plasticity
合作研究:比较基因组学和生理学,发现支持表型可塑性的综合机制
- 批准号:
2200319 - 财政年份:2022
- 资助金额:
$ 23.93万 - 项目类别:
Standard Grant
Collaborative Research: RUI: Dynamic Learning in Comparative Courts: A Cross-National Analysis of Judicial Decision Making in Canada, the United States, and the United Kingdom
合作研究:RUI:比较法院的动态学习:加拿大、美国和英国司法决策的跨国分析
- 批准号:
2325460 - 财政年份:2022
- 资助金额:
$ 23.93万 - 项目类别:
Standard Grant
Collaborative Research: RUI: Comparative analysis of endocytic trafficking during cell division
合作研究:RUI:细胞分裂过程中内吞运输的比较分析
- 批准号:
2052517 - 财政年份:2021
- 资助金额:
$ 23.93万 - 项目类别:
Standard Grant