CAREER: Understanding the Performance of Distributed Systems through Causal Tracing
职业:通过因果追踪了解分布式系统的性能
基本信息
- 批准号:1452712
- 负责人:
- 金额:$ 57.63万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2015
- 资助国家:美国
- 起止时间:2015-03-01 至 2020-02-29
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
CAREER: Understanding the Performance of Distributed Systems through Causal Tracing Society increasingly depends on shared software systems that are large, decentralized, with many components that interact in complex and subtle ways. These systems include financial and banking services, Web and cloud resources, airline reservations, and big data and scientific computing, to name a few. Despite their unquestionable reach and success, in these systems it is very hard to answer questions about the causes of failures, to uncover dependency issues among their components, to determine the impact of one operation on the rest of the system, or to provide guarantees about their performance to users. By developing and applying techniques to enable deep and real-time understanding of the performance characteristics of large-scale distributed systems, this project?s goal is to develop techniques that will enable users and providers of these systems to better express their needs and their guarantees in terms of performance, and better plan for and mitigate the effects of failures. The main insight in this project is that because of the many components in distributed systems, the context of an operation initiated in one component gets lost as the operation involves other components. This makes it hard for a component deep in the system to discern with which client it is working, making it also hard to apply consistent policies or account for the cost of operations across component boundaries. This research will create the abstraction of a Tracing Plane that preserves this context throughout the entire execution of the system, allowing for debugging and diagnosis of performance problems, and for real-time provisioning of performance guarantees. This Tracing Plane will be a pervasive infrastructure to collect causal information from the execution of a distributed system and facilitate the efficient deployment of analytics and diagnostic tasks. Further, by aggregating information about tasks in the system across all components in a coherent way, the Tracing Plane enables the implementation of resource management policies that can act locally, in real-time, and with global knowledge - which is presently not possible. We are better today at building large-scale distributed systems than we are at understanding precisely how they work, and how they fail and this will provide a core educational aspect, as the Tracing Plane is a strong pedagogical tool for the understanding of distributed systems structure and execution. This work will engage undergraduate and graduate students, as well as industry partners that operate such large-scale distributed systems. By starting from increased visibility into these systems, the ultimate goal of this project is to provide tools and methods to allow building, operation, and management of large-scale, shared distributed systems that are efficient, reliable, and predictable. As society increasingly depends on systems of this kind, this research has a large and long lasting potential impact.
职业:通过因果追踪了解分布式系统的性能社会越来越依赖于大型的、分散的、有许多组件以复杂而微妙的方式相互作用的共享软件系统。这些系统包括金融和银行服务、网络和云资源、机票预订、大数据和科学计算等等。尽管它们的影响范围和成功是毋庸置疑的,但在这些系统中,很难回答有关故障原因的问题,发现组件之间的依赖问题,确定一个操作对系统其余部分的影响,或者向用户提供性能保证。通过开发和应用技术来实现对大规模分布式系统性能特征的深入和实时理解,该项目?S的目标是开发技术,使这些系统的用户和提供者能够更好地表达他们在性能方面的需求和保证,并更好地计划和减轻故障的影响。这个项目的主要观点是,由于分布式系统中有许多组件,在一个组件中发起的操作的上下文会随着操作涉及到其他组件而丢失。这使得系统深处的组件很难辨别它正在与哪个客户机一起工作,也很难应用一致的策略或考虑跨组件边界的操作成本。本研究将创建跟踪平面的抽象,该平面在整个系统执行过程中保留此上下文,允许调试和诊断性能问题,并实时提供性能保证。这个跟踪平面将是一个无处不在的基础设施,用于从分布式系统的执行中收集因果信息,并促进分析和诊断任务的有效部署。此外,通过以一致的方式跨所有组件聚合有关系统中任务的信息,跟踪平面能够实现可以在本地、实时和具有全局知识的情况下执行的资源管理策略——这在目前是不可能的。今天,我们更擅长于构建大规模分布式系统,而不是精确地理解它们是如何工作的,以及它们是如何失败的,这将提供一个核心的教育方面,因为跟踪平面是一个强大的教学工具,用于理解分布式系统的结构和执行。这项工作将吸引本科生和研究生,以及操作这种大规模分布式系统的行业合作伙伴。从增加对这些系统的可见性开始,这个项目的最终目标是提供工具和方法,以允许构建、操作和管理高效、可靠和可预测的大规模共享分布式系统。随着社会越来越依赖于这类系统,这项研究具有巨大而持久的潜在影响。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Rodrigo Fonseca其他文献
Hyaluronic Acid for Skin Booster on the Face
透明质酸可促进面部肌肤
- DOI:
10.1007/978-3-319-16802-9_23 - 发表时间:
2018 - 期刊:
- 影响因子:0
- 作者:
S. Ypiranga;Rodrigo Fonseca - 通讯作者:
Rodrigo Fonseca
TIPSY: predicting where traffic will ingress a WAN
TIPSY:预测流量将进入 WAN 的位置
- DOI:
- 发表时间:
2022 - 期刊:
- 影响因子:0
- 作者:
Michael Markovitch;S. Agarwal;Rodrigo Fonseca;Ryan Beckett;Chuanji Zhang;I. Atov;Somesh Chaturmohta - 通讯作者:
Somesh Chaturmohta
Safety Profiles of Bispecific Antibodies in Non-Hodgkin Lymphoma: A Meta-Analysis of Phase I-III Trials
- DOI:
10.1016/s2152-2650(24)00843-7 - 发表时间:
2024-09-01 - 期刊:
- 影响因子:
- 作者:
Rodrigo Fonseca;Alex Liu;Blake Langlais;Diana Almader-Douglas;Holenarasipur Vikram;Talal Hilal - 通讯作者:
Talal Hilal
Echocardiographic Evaluation of Left and Right Ventricular Function in Subclinical Hypothyroidism
超声心动图评估亚临床甲状腺功能减退症的左右心室功能
- DOI:
- 发表时间:
2009 - 期刊:
- 影响因子:0
- 作者:
Dr. P. F. S. Teixeira;Rodrigo Fonseca;M. Ferreira;M. M. Duarte;V. Reuters;C. Almeida;M. Vaisman;A. liations - 通讯作者:
A. liations
Datacenter Network Large Flow Detection and Scheduling from the Edge Rui ( Ray )
从边缘进行数据中心网络大流量检测与调度 Rui(Ray)
- DOI:
- 发表时间:
2014 - 期刊:
- 影响因子:0
- 作者:
Rodrigo Fonseca - 通讯作者:
Rodrigo Fonseca
Rodrigo Fonseca的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Rodrigo Fonseca', 18)}}的其他基金
NeTS: Small: Participatory Software Defined Networking
NeTS:小型:参与式软件定义网络
- 批准号:
1320397 - 财政年份:2013
- 资助金额:
$ 57.63万 - 项目类别:
Standard Grant
相似国自然基金
Navigating Sustainability: Understanding Environm ent,Social and Governanc e Challenges and Solution s for Chinese Enterprises
in Pakistan's CPEC Framew
ork
- 批准号:
- 批准年份:2024
- 资助金额:万元
- 项目类别:外国学者研究基金项目
Understanding structural evolution of galaxies with machine learning
- 批准号:n/a
- 批准年份:2022
- 资助金额:10.0 万元
- 项目类别:省市级项目
Understanding complicated gravitational physics by simple two-shell systems
- 批准号:12005059
- 批准年份:2020
- 资助金额:24.0 万元
- 项目类别:青年科学基金项目
相似海外基金
Understanding the materials performance of additive manufactured stainless steel components in high temperature water
了解增材制造不锈钢部件在高温水中的材料性能
- 批准号:
2879262 - 财政年份:2023
- 资助金额:
$ 57.63万 - 项目类别:
Studentship
In situ TEM for understanding the electrochemical performance of iridium based catalysts
原位 TEM 了解铱基催化剂的电化学性能
- 批准号:
2905956 - 财政年份:2023
- 资助金额:
$ 57.63万 - 项目类别:
Studentship
Baermann's Body: Understanding Embodiment in Historically Informed Performance
贝尔曼的身体:理解历史性表演中的体现
- 批准号:
AH/W010178/1 - 财政年份:2023
- 资助金额:
$ 57.63万 - 项目类别:
Fellowship
Understanding functional performance in bird skulls: advanced computational modelling to investigate cranial biomechanics and kinesis
了解鸟类头骨的功能表现:研究颅骨生物力学和运动的先进计算模型
- 批准号:
BB/X014479/1 - 财政年份:2023
- 资助金额:
$ 57.63万 - 项目类别:
Research Grant
Understanding materials at the nanoscale for improved thermoelectric performance
了解纳米级材料以提高热电性能
- 批准号:
2826006 - 财政年份:2023
- 资助金额:
$ 57.63万 - 项目类别:
Studentship
Designing porous carbon electrodes for high performance LIBs based on understanding electrochemical reactions in the pores
基于对孔内电化学反应的理解,设计高性能锂离子电池的多孔碳电极
- 批准号:
23H02048 - 财政年份:2023
- 资助金额:
$ 57.63万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Improving building performance through better feedback: understanding the effectiveness of Post Occupancy Evaluation
通过更好的反馈提高建筑性能:了解入住后评估的有效性
- 批准号:
2863121 - 财政年份:2023
- 资助金额:
$ 57.63万 - 项目类别:
Studentship
Post-Consumer Resin - Understanding the quality-performance linkage for packaging
消费后树脂 - 了解包装的质量与性能之间的联系
- 批准号:
NE/V010778/2 - 财政年份:2023
- 资助金额:
$ 57.63万 - 项目类别:
Research Grant
Understanding how servitization can impact UK economic productivity and environmental performance
了解服务化如何影响英国经济生产力和环境绩效
- 批准号:
ES/W010194/1 - 财政年份:2022
- 资助金额:
$ 57.63万 - 项目类别:
Research Grant
Advancing Our Understanding of How Cardiovascular Control, Function and Plasticity Influence Fish Performance and Ecophysiology
增进我们对心血管控制、功能和可塑性如何影响鱼类性能和生态生理学的理解
- 批准号:
RGPIN-2022-03790 - 财政年份:2022
- 资助金额:
$ 57.63万 - 项目类别:
Discovery Grants Program - Individual