AitF: Collaborative Research: Fast, Accurate, and Practical: Adaptive Sublinear Algorithms for Scalable Visualization
AitF:协作研究:快速、准确和实用:用于可扩展可视化的自适应次线性算法
基本信息
- 批准号:1733878
- 负责人:
- 金额:$ 23.4万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2017
- 资助国家:美国
- 起止时间:2017-09-15 至 2019-08-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
With the wealth of data being generated in every sphere of human endeavor, data exploration--analyzing, understanding, and extracting value from data--has become absolutely vital. Data visualization is by far the most common data exploration mechanism, used by novice and expert data analysts alike. Yet data visualization on increasingly larger datasets remains difficult: even simple visualizations of a large dataset can be slow and non-interactive, while visualizations of a sampled fraction of a dataset can mislead an analyst. The project aims to develop FastViz, a scalable visualization engine, that will not only enable visualization on datasets that are orders of magnitude larger in the same time, but also ensure the resulting visualizations satisfy key properties essential for correct analysis by end-users. To ensure immediate utilization, FastViz will be applied to three real-world application domains: battery science, advertising analysis, and genomic data analysis, and implemented in Zenvisage, an open-source visual exploration platform developed by the PIs. Students in the project gain invaluable experience in combining the algorithmic and systems considerations that enable data exploration. FastViz's development is driven by simultaneous investigation of systems considerations, such as indexing and storage techniques that enable various forms of online sampling, and algorithmic considerations for (a) visualization generation, where the goal is to produce incrementally improving visualizations in which the important features are displayed first, and (b) visualization selection, where the goal is to select, from a collection of as yet not generated visualizations, those that satisfy desired criteria. On the systems front, FastViz will leverage and contribute back to recent developments on online sampling systems that enable the use of more powerful sampling modalities. On the algorithms front, FastViz will draw ideas from testing, distribution learning, and sublinear algorithms literature that, to the best knowledge of the PIs, have not been adapted in practice. The algorithms developed will obey optimality guarantees, and wherever possible, instance-optimality guarantees, ensuring that they will adapt to data characteristics in the most efficient way possible. The project will lead to a better understanding of the interplay between sampling algorithms development and systems design, facilitating the adoption of more realistic models and algorithms on the one hand, and the development of more powerful sampling engines that enable the models required within the algorithms.
随着人类奋进的各个领域产生大量数据,数据探索-分析,理解和从数据中提取价值-已经变得绝对重要。数据可视化是迄今为止最常见的数据探索机制,新手和专家数据分析师都使用它。然而,在越来越大的数据集上进行数据可视化仍然很困难:即使是大型数据集的简单可视化也可能很慢且不具有交互性,而数据集的采样部分的可视化可能会误导分析师。该项目旨在开发FastViz,这是一种可扩展的可视化引擎,它不仅能够同时对更大数量级的数据集进行可视化,而且还确保生成的可视化满足最终用户正确分析所必需的关键属性。为了确保立即使用,FastViz将应用于三个现实世界的应用领域:电池科学,广告分析和基因组数据分析,并在由PI开发的开源视觉探索平台Zenvision中实现。 该项目的学生在结合算法和系统考虑因素,使数据探索方面获得宝贵的经验。FastViz的开发是由系统考虑因素的同时调查驱动的,例如能够实现各种形式的在线采样的索引和存储技术,以及用于(a)可视化生成的算法考虑因素,其中目标是产生增量改进的可视化,其中首先显示重要特征,以及(B)可视化选择,其中目标是选择,从尚未生成的可视化的集合中选择满足期望标准的可视化。在系统方面,FastViz将利用并促进在线采样系统的最新发展,从而能够使用更强大的采样模式。 在算法方面,FastViz将从测试、分布学习和次线性算法文献中汲取想法,据PI所知,这些想法尚未在实践中得到应用。 开发的算法将遵守最优性保证,并在可能的情况下,实例最优性保证,确保它们将以最有效的方式适应数据特征。 该项目将导致更好地了解采样算法开发和系统设计之间的相互作用,一方面促进采用更现实的模型和算法,并开发更强大的采样引擎,使算法中所需的模型。
项目成果
期刊论文数量(11)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
How Developers Iterate on Machine Learning Workflows
开发人员如何迭代机器学习工作流程
- DOI:
- 发表时间:2018
- 期刊:
- 影响因子:0
- 作者:Xin, D;Song, S;Parameswaran, A.
- 通讯作者:Parameswaran, A.
Anti-Freeze for Large and Complex Spreadsheets: Asynchronous Formula Computation
大型复杂电子表格的防冻:异步公式计算
- DOI:10.1145/3299869.3319876
- 发表时间:2019
- 期刊:
- 影响因子:0
- 作者:Bendre, Mangesh;Wattanawaroon, Tana;Mack, Kelly;Chang, Kevin;Parameswaran, Aditya
- 通讯作者:Parameswaran, Aditya
H elix: accelerating human-in-the-loop machine learning
He elix:加速人机循环机器学习
- DOI:10.14778/3229863.3236234
- 发表时间:2018
- 期刊:
- 影响因子:2.5
- 作者:Xin, Doris;Ma, Litian;Liu, Jialin;Macke, Stephen;Song, Shuchen;Parameswaran, Aditya
- 通讯作者:Parameswaran, Aditya
An Exploratory User Study of Visual Causality Analysis
- DOI:10.1111/cgf.13680
- 发表时间:2019-06
- 期刊:
- 影响因子:2.5
- 作者:Chi-Hsien Yen;Aditya G. Parameswaran;W. Fu
- 通讯作者:Chi-Hsien Yen;Aditya G. Parameswaran;W. Fu
Faster, Higher, Stronger: Redesigning Spreadsheets for Scale
更快、更高、更强:重新设计电子表格以实现规模化
- DOI:10.1109/icde.2019.00217
- 发表时间:2019
- 期刊:
- 影响因子:0
- 作者:Bendre, Mangesh;Wattanawaroon, Tana;Rahman, Sajjadur;Mack, Kelly;Liu, Yuyang;Zhu, Shichu;Lu, Yu;Yang, Ping-Jing;Zhou, Xinyan;Chang, Kevin Chen-Chuan
- 通讯作者:Chang, Kevin Chen-Chuan
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Aditya Parameswaran其他文献
$$\varvec{\textsc {Orpheus}}$$ DB: bolt-on versioning for relational databases (extended version)
- DOI:
10.1007/s00778-019-00594-5 - 发表时间:
2019-12-20 - 期刊:
- 影响因子:3.800
- 作者:
Silu Huang;Liqi Xu;Jialin Liu;Aaron J. Elmore;Aditya Parameswaran - 通讯作者:
Aditya Parameswaran
Aditya Parameswaran的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Aditya Parameswaran', 18)}}的其他基金
FW-HTF-R: Human-Machine Teaming for Effective Data Work at Scale: Upskilling Defense Lawyers Working with Police and Court Process Data
FW-HTF-R:大规模有效数据工作的人机协作:提高辩护律师处理警察和法院流程数据的技能
- 批准号:
2129008 - 财政年份:2021
- 资助金额:
$ 23.4万 - 项目类别:
Standard Grant
AitF: Collaborative Research: Fast, Accurate, and Practical: Adaptive Sublinear Algorithms for Scalable Visualization
AitF:协作研究:快速、准确和实用:用于可扩展可视化的自适应次线性算法
- 批准号:
1940759 - 财政年份:2019
- 资助金额:
$ 23.4万 - 项目类别:
Standard Grant
CAREER: Advancing Open-Ended Crowdsourcing: The Next Frontier in Crowdsourced Data Management
职业:推进开放式众包:众包数据管理的下一个前沿
- 批准号:
1940757 - 财政年份:2019
- 资助金额:
$ 23.4万 - 项目类别:
Continuing Grant
CAREER: Advancing Open-Ended Crowdsourcing: The Next Frontier in Crowdsourced Data Management
职业:推进开放式众包:众包数据管理的下一个前沿
- 批准号:
1652750 - 财政年份:2017
- 资助金额:
$ 23.4万 - 项目类别:
Continuing Grant
III: Medium: Collaborative Research: DataHub - A Collaborative Dataset Management Platform for Data Science
III:媒介:协作研究:DataHub - 数据科学协作数据集管理平台
- 批准号:
1513407 - 财政年份:2015
- 资助金额:
$ 23.4万 - 项目类别:
Continuing Grant
相似海外基金
AitF: Collaborative Research: Topological Algorithms for 3D/4D Cardiac Images: Understanding Complex and Dynamic Structures
AitF:协作研究:3D/4D 心脏图像的拓扑算法:理解复杂和动态结构
- 批准号:
2051197 - 财政年份:2020
- 资助金额:
$ 23.4万 - 项目类别:
Standard Grant
AitF: Collaborative Research: Fast, Accurate, and Practical: Adaptive Sublinear Algorithms for Scalable Visualization
AitF:协作研究:快速、准确和实用:用于可扩展可视化的自适应次线性算法
- 批准号:
1940759 - 财政年份:2019
- 资助金额:
$ 23.4万 - 项目类别:
Standard Grant
AitF: Collaborative Research: Fast, Accurate, and Practical: Adaptive Sublinear Algorithms for Scalable Visualization
AitF:协作研究:快速、准确和实用:用于可扩展可视化的自适应次线性算法
- 批准号:
2006206 - 财政年份:2019
- 资助金额:
$ 23.4万 - 项目类别:
Standard Grant
AiTF: Collaborative Research: Distributed and Stochastic Algorithms for Active Matter: Theory and Practice
AiTF:协作研究:活跃物质的分布式随机算法:理论与实践
- 批准号:
1733812 - 财政年份:2018
- 资助金额:
$ 23.4万 - 项目类别:
Standard Grant
AitF: Collaborative Research: A Framework of Simultaneous Acceleration and Storage Reduction on Deep Neural Networks Using Structured Matrices
AitF:协作研究:使用结构化矩阵的深度神经网络同时加速和存储减少的框架
- 批准号:
1854742 - 财政年份:2018
- 资助金额:
$ 23.4万 - 项目类别:
Standard Grant
AiTF: Collaborative Research: Distributed and Stochastic Algorithms for Active Matter: Theory and Practice
AiTF:协作研究:活跃物质的分布式随机算法:理论与实践
- 批准号:
1733680 - 财政年份:2018
- 资助金额:
$ 23.4万 - 项目类别:
Standard Grant
AitF: Collaborative Research: Topological Algorithms for 3D/4D Cardiac Images: Understanding Complex and Dynamic Structures
AitF:协作研究:3D/4D 心脏图像的拓扑算法:理解复杂和动态结构
- 批准号:
1855760 - 财政年份:2018
- 资助金额:
$ 23.4万 - 项目类别:
Standard Grant
AitF: Collaborative Research: Automated Medical Image Segmentation via Object Decomposition
AitF:协作研究:通过对象分解进行自动医学图像分割
- 批准号:
1733742 - 财政年份:2017
- 资助金额:
$ 23.4万 - 项目类别:
Standard Grant
AitF: Collaborative Research: Fast, Accurate, and Practical: Adaptive Sublinear Algorithms for Scalable Visualization
AitF:协作研究:快速、准确和实用:用于可扩展可视化的自适应次线性算法
- 批准号:
1733796 - 财政年份:2017
- 资助金额:
$ 23.4万 - 项目类别:
Standard Grant
AitF: Collaborative Research: Algorithms and Mechanisms for the Distribution Grid
AitF:协作研究:配电网算法和机制
- 批准号:
1733832 - 财政年份:2017
- 资助金额:
$ 23.4万 - 项目类别:
Standard Grant