Optimizing compute task scheduling at Shopify
优化 Shopify 的计算任务调度
基本信息
- 批准号:514614-2017
- 负责人:
- 金额:$ 1.82万
- 依托单位:
- 依托单位国家:加拿大
- 项目类别:Engage Grants Program
- 财政年份:2017
- 资助国家:加拿大
- 起止时间:2017-01-01 至 2018-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Big Data processing frameworks allow enterprises to drive their business based on key insights extracted fromnear real-time data. Apache Spark is a framework that Shopify leverages to collect and analyze customerinformation and expose data insights to the shop owners who are users of the Shopify platform. For example,the enterprise which built its online store using the Shopify platform can send pre-set queries to the platform tolearn about the impact of their recent marketing campaign (e.g., with an improved Google ad keywords) bygeographical region or by age group, and see if the marketing campaign resulted in actual product sales.Shopify uses Spark to analyze data by passing the data through a processing pipeline. The pipeline consists ofmultiple jobs, each applying some computation to the data. A set of connected jobs form a flow which mightcorrespond to the query exposed to the shop owner. In this project we will explore algorithmic solutions tomaintain a useful flow execution frequency while absorbing the variance caused by job failures, and ensuringco-execution of pre-scheduled and ad-hoc jobs without resource starvation. We intend to apply approachesbased on heuristics and constraint satisfaction, study their tradeoffs, and develop a solution that works best inpractice.
大数据处理框架允许企业基于从近乎实时的数据中提取的关键见解来驱动业务。Apache Spark是一个框架,Shopify利用它来收集和分析客户信息,并向作为Shopify平台用户的店主公开数据见解。例如,使用Shopify平台建立在线商店的企业可以向该平台发送预先设置的查询,以了解其最近的营销活动(例如,改进b谷歌广告关键字)按地理区域或年龄组的影响,并查看营销活动是否导致实际产品销售。Shopify使用Spark通过处理管道传递数据来分析数据。管道由多个作业组成,每个作业对数据应用一些计算。一组连接的作业形成一个流,该流可能对应于向商店所有者公开的查询。在这个项目中,我们将探索算法解决方案,在吸收作业失败引起的变化的同时,保持有用的流执行频率,并确保在没有资源短缺的情况下,预先安排和临时作业的共同执行。我们打算应用基于启发式和约束满足的方法,研究它们的权衡,并开发一个在实践中最有效的解决方案。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Beschastnikh, Ivan其他文献
Using Declarative Specification to Improve the Understanding, Extensibility, and Comparison of Model-Inference Algorithms
- DOI:
10.1109/tse.2014.2369047 - 发表时间:
2015-04-01 - 期刊:
- 影响因子:7.4
- 作者:
Beschastnikh, Ivan;Brun, Yuriy;Krishnamurthy, Arvind - 通讯作者:
Krishnamurthy, Arvind
Biscotti: A Blockchain System for Private and Secure Federated Learning
- DOI:
10.1109/tpds.2020.3044223 - 发表时间:
2021-07-01 - 期刊:
- 影响因子:5.3
- 作者:
Shayan, Muhammad;Fung, Clement;Beschastnikh, Ivan - 通讯作者:
Beschastnikh, Ivan
Visualizing Distributed System Executions
可视化分布式系统执行
- DOI:
10.1145/3375633 - 发表时间:
2020 - 期刊:
- 影响因子:4.4
- 作者:
Beschastnikh, Ivan;Liu, Perry;Xing, Albert;Wang, Patty;Brun, Yuriy;Ernst, Michael D. - 通讯作者:
Ernst, Michael D.
Beschastnikh, Ivan的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Beschastnikh, Ivan', 18)}}的其他基金
Compiling Distributed System Models into Implementations
将分布式系统模型编译为实现
- 批准号:
RGPIN-2020-05203 - 财政年份:2022
- 资助金额:
$ 1.82万 - 项目类别:
Discovery Grants Program - Individual
Compiling Distributed System Models into Implementations
将分布式系统模型编译为实现
- 批准号:
RGPIN-2020-05203 - 财政年份:2021
- 资助金额:
$ 1.82万 - 项目类别:
Discovery Grants Program - Individual
Compiling Distributed System Models into Implementations
将分布式系统模型编译为实现
- 批准号:
RGPIN-2020-05203 - 财政年份:2020
- 资助金额:
$ 1.82万 - 项目类别:
Discovery Grants Program - Individual
Improving the Construction of Correct Distributed Systems
改进正确的分布式系统的构建
- 批准号:
RGPIN-2019-05090 - 财政年份:2019
- 资助金额:
$ 1.82万 - 项目类别:
Discovery Grants Program - Individual
Model inference and testing of distributed systems
分布式系统的模型推理与测试
- 批准号:
RGPIN-2014-04870 - 财政年份:2018
- 资助金额:
$ 1.82万 - 项目类别:
Discovery Grants Program - Individual
Model inference and testing of distributed systems
分布式系统的模型推理与测试
- 批准号:
RGPIN-2014-04870 - 财政年份:2017
- 资助金额:
$ 1.82万 - 项目类别:
Discovery Grants Program - Individual
Model inference and testing of distributed systems
分布式系统的模型推理与测试
- 批准号:
RGPIN-2014-04870 - 财政年份:2016
- 资助金额:
$ 1.82万 - 项目类别:
Discovery Grants Program - Individual
Model inference and testing of distributed systems
分布式系统的模型推理与测试
- 批准号:
RGPIN-2014-04870 - 财政年份:2015
- 资助金额:
$ 1.82万 - 项目类别:
Discovery Grants Program - Individual
Model inference and testing of distributed systems
分布式系统的模型推理与测试
- 批准号:
RGPIN-2014-04870 - 财政年份:2014
- 资助金额:
$ 1.82万 - 项目类别:
Discovery Grants Program - Individual
相似海外基金
CC* Campus Compute: UTEP Cyberinfrastructure for Scientific and Machine Learning Applications
CC* 校园计算:用于科学和机器学习应用的 UTEP 网络基础设施
- 批准号:
2346717 - 财政年份:2024
- 资助金额:
$ 1.82万 - 项目类别:
Standard Grant
SHF: Small: Redesigning the Memory System in the Era of Compute Express Link
SHF:小型:重新设计 Compute Express Link 时代的内存系统
- 批准号:
2333049 - 财政年份:2024
- 资助金额:
$ 1.82万 - 项目类别:
Standard Grant
CC* Campus Compute: Building a Computational Cluster for Scientific Discovery
CC* 校园计算:构建科学发现计算集群
- 批准号:
2346673 - 财政年份:2024
- 资助金额:
$ 1.82万 - 项目类别:
Standard Grant
CC* Campus Compute: Interdisciplinary GPU-Enabled Compute
CC* 校园计算:支持 GPU 的跨学科计算
- 批准号:
2346343 - 财政年份:2024
- 资助金额:
$ 1.82万 - 项目类别:
Standard Grant
MYRTUS: Multi-layer 360° dYnamic orchestrion and interopeRable design environmenT for compute-continUum Systems
MYRTUS:用于连续计算系统的多层 360° 动态编排和可互操作设计环境
- 批准号:
10087666 - 财政年份:2024
- 资助金额:
$ 1.82万 - 项目类别:
EU-Funded
CAREER: Reinventing Computer Vision through Bio-inspired Retinomorphic Vision Sensors, Corticomorphic Compute-In-Memory Processors and Event-based Algorithms
职业:通过仿生视网膜形态视觉传感器、皮质形态内存计算处理器和基于事件的算法重塑计算机视觉
- 批准号:
2338171 - 财政年份:2024
- 资助金额:
$ 1.82万 - 项目类别:
Continuing Grant
Equipment: CC* Campus Compute: A High-Performance Computing System for Research and Education in Arkansas
设备:CC* 校园计算:用于阿肯色州研究和教育的高性能计算系统
- 批准号:
2346752 - 财政年份:2024
- 资助金额:
$ 1.82万 - 项目类别:
Standard Grant
Research Infrastructure: CC* Campus Compute: Lawrence 2.0: Advancing Multi-Disciplinary Research and Education in South Dakota
研究基础设施:CC* 校园计算:Lawrence 2.0:推进南达科他州的多学科研究和教育
- 批准号:
2346643 - 财政年份:2024
- 资助金额:
$ 1.82万 - 项目类别:
Standard Grant
Collaborative Research: FET: Medium:Compact and Energy-Efficient Compute-in-Memory Accelerator for Deep Learning Leveraging Ferroelectric Vertical NAND Memory
合作研究:FET:中型:紧凑且节能的内存计算加速器,用于利用铁电垂直 NAND 内存进行深度学习
- 批准号:
2312886 - 财政年份:2023
- 资助金额:
$ 1.82万 - 项目类别:
Standard Grant
Collaborative Research: FET: Medium:Compact and Energy-Efficient Compute-in-Memory Accelerator for Deep Learning Leveraging Ferroelectric Vertical NAND Memory
合作研究:FET:中型:紧凑且节能的内存计算加速器,用于利用铁电垂直 NAND 内存进行深度学习
- 批准号:
2312884 - 财政年份:2023
- 资助金额:
$ 1.82万 - 项目类别:
Standard Grant