III: Medium: Collaborative Research: Database-As-A-Service for Long Tail Science
III:媒介:合作研究:长尾科学的数据库即服务
基本信息
- 批准号:1064505
- 负责人:
- 金额:$ 34.3万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2011
- 资助国家:美国
- 起止时间:2011-08-01 至 2015-07-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
With tremendous amounts of data existing in scientific applications, database management becomes a critical issue, but database technology is not keeping pace. This problem is especially acute in the long tail of science: the large number of relatively small labs and individual researchers who collectively produce the majority of scientific results. These researchers lack the IT staff and specialized skills to deploy technology at scale, but have begun to routinely access hundreds of files and potentially terabytes of data to answer a scientific question. This project develops the architecture for a database-as-a-service platform for science. It explores techniques to automate the remaining barriers to use: ingesting data from native sources and automatically bootstrapping an initial set of queries and visualizations, in part by aggressively mining a shared corpus of data, queries, and user activity. It investigates methods to extract global knowledge and patterns while offering scientists access control over their data, and some formal privacy guarantees. The Intellectual Merit of this proposal consists of automating non-trivial cognitive tasks associated with data work: information extraction from unstructured data sources, data cleaning, logical schema design, privacy control, visualization, and application-building. As Broader Impacts, the project helps scientists reduce the proportion of time spent "handling data" rather than "doing science." All software resulting from this project are open source, and all findings are disseminated broadly through publications and workshops. Sustainable support for science users of the software is coordinated through the University of Washington eScience Institute. The research is incorporated in both undergraduate and graduate computer science courses, and the software is also incorporated into domain science courses as well. The project's outreach activities include advising students through special programs geared toward under-represented groups such as the CRA-W DREU. More information about this project is found at http://escience.washington.edu/dbaas.
随着科学应用中存在大量数据,数据库管理成为一个关键问题,但数据库技术却没有跟上。这个问题在科学的长尾中尤其严重:大量相对较小的实验室和个体研究人员共同产生了大多数科学成果。这些研究人员缺乏IT人员和专业技能来大规模部署技术,但已经开始例行访问数百个文件和潜在的tb级数据来回答一个科学问题。本项目开发了一个科学数据库即服务平台的体系结构。它探索了自动化其余使用障碍的技术:从本地源摄取数据并自动引导一组初始查询和可视化,部分是通过积极挖掘数据、查询和用户活动的共享语料库。它研究了提取全球知识和模式的方法,同时为科学家提供对数据的访问控制,以及一些正式的隐私保证。该建议的智力优势包括自动化与数据工作相关的重要认知任务:从非结构化数据源提取信息、数据清理、逻辑模式设计、隐私控制、可视化和应用程序构建。作为更广泛的影响,该项目帮助科学家减少了“处理数据”而不是“做科学”的时间比例。这个项目产生的所有软件都是开源的,所有的发现都通过出版物和研讨会广泛传播。对该软件的科学用户的可持续支持是通过华盛顿大学科学研究所进行协调的。该研究被纳入了本科和研究生的计算机科学课程,该软件也被纳入了领域科学课程。该项目的外展活动包括通过针对代表性不足的群体(如CRA-W DREU)的特殊方案为学生提供建议。关于这个项目的更多信息可以在http://escience.washington.edu/dbaas上找到。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Bill Howe其他文献
Optimizing Large-Scale Semi-Naïve Datalog Evaluation in Hadoop
优化 Hadoop 中的大规模半简单数据记录评估
- DOI:
- 发表时间:
2012 - 期刊:
- 影响因子:0
- 作者:
Marianne Shaw;Paraschos Koutris;Bill Howe;Dan Suciu - 通讯作者:
Dan Suciu
Perfopticon: Visual Query Analysis for Distributed Databases
Perfopticon:分布式数据库的可视化查询分析
- DOI:
- 发表时间:
2015 - 期刊:
- 影响因子:0
- 作者:
Dominik Moritz;D. Halperin;Bill Howe;Jeffrey Heer - 通讯作者:
Jeffrey Heer
VizioMetrix: A Platform for Analyzing the Visual Information in Big Scholarly Data
VizioMetrix:分析学术大数据中的视觉信息的平台
- DOI:
- 发表时间:
2016 - 期刊:
- 影响因子:0
- 作者:
Po;Jevin D. West;Bill Howe - 通讯作者:
Bill Howe
SQLShare : Scientific Workflow via Relational View Sharing
SQLShare:通过关系视图共享的科学工作流程
- DOI:
- 发表时间:
2013 - 期刊:
- 影响因子:0
- 作者:
Bill Howe;F. Ribalet;D. Halperin;Sagar Chitnis;E. Armbrust - 通讯作者:
E. Armbrust
MusicDB: A Platform for Longitudinal Music Analytics
MusicDB:纵向音乐分析平台
- DOI:
- 发表时间:
2016 - 期刊:
- 影响因子:0
- 作者:
Jeremy Hyrkas;Bill Howe - 通讯作者:
Bill Howe
Bill Howe的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Bill Howe', 18)}}的其他基金
Workshop on Foundations of Responsible Data Science (FoRDS)
负责任数据科学基础研讨会 (FoRDS)
- 批准号:
1902959 - 财政年份:2019
- 资助金额:
$ 34.3万 - 项目类别:
Standard Grant
Collaborative Research: Framework for Integrative Data Equity Systems
协作研究:综合数据公平系统框架
- 批准号:
1934405 - 财政年份:2019
- 资助金额:
$ 34.3万 - 项目类别:
Continuing Grant
BIGDATA: F: Collaborative Research: Foundations of Responsible Data Management
大数据:F:协作研究:负责任的数据管理的基础
- 批准号:
1740996 - 财政年份:2017
- 资助金额:
$ 34.3万 - 项目类别:
Standard Grant
Collaborative Research: Conceptualizing An Institute for Empowering Long Tail Research
合作研究:构想一个促进长尾研究的研究所
- 批准号:
1216879 - 财政年份:2012
- 资助金额:
$ 34.3万 - 项目类别:
Standard Grant
CIC: EAGER: Scalable Algebraic Visualization in the Cloud
CIC:EAGER:云中的可扩展代数可视化
- 批准号:
1060213 - 财政年份:2010
- 资助金额:
$ 34.3万 - 项目类别:
Standard Grant
Where the Ocean Meets the Cloud: Ad Hoc Longitudinal Analysis and Collaboration Over Massive Mesh Data
海洋与云的交汇:海量网格数据的临时纵向分析和协作
- 批准号:
0844572 - 财政年份:2009
- 资助金额:
$ 34.3万 - 项目类别:
Standard Grant
相似海外基金
III : Medium: Collaborative Research: From Open Data to Open Data Curation
III:媒介:协作研究:从开放数据到开放数据管理
- 批准号:
2420691 - 财政年份:2024
- 资助金额:
$ 34.3万 - 项目类别:
Standard Grant
Collaborative Research: III: Medium: Designing AI Systems with Steerable Long-Term Dynamics
合作研究:III:中:设计具有可操纵长期动态的人工智能系统
- 批准号:
2312865 - 财政年份:2023
- 资助金额:
$ 34.3万 - 项目类别:
Standard Grant
Collaborative Research: III: MEDIUM: Responsible Design and Validation of Algorithmic Rankers
合作研究:III:媒介:算法排序器的负责任设计和验证
- 批准号:
2312932 - 财政年份:2023
- 资助金额:
$ 34.3万 - 项目类别:
Standard Grant
III: Medium: Collaborative Research: Integrating Large-Scale Machine Learning and Edge Computing for Collaborative Autonomous Vehicles
III:媒介:协作研究:集成大规模机器学习和边缘计算以实现协作自动驾驶汽车
- 批准号:
2348169 - 财政年份:2023
- 资助金额:
$ 34.3万 - 项目类别:
Continuing Grant
Collaborative Research: III: Medium: Algorithms for scalable inference and phylodynamic analysis of tumor haplotypes using low-coverage single cell sequencing data
合作研究:III:中:使用低覆盖率单细胞测序数据对肿瘤单倍型进行可扩展推理和系统动力学分析的算法
- 批准号:
2415562 - 财政年份:2023
- 资助金额:
$ 34.3万 - 项目类别:
Standard Grant
Collaborative Research: III: Medium: New Machine Learning Empowered Nanoinformatics System for Advancing Nanomaterial Design
合作研究:III:媒介:新的机器学习赋能纳米信息学系统,促进纳米材料设计
- 批准号:
2347592 - 财政年份:2023
- 资助金额:
$ 34.3万 - 项目类别:
Standard Grant
Collaborative Research: III: Medium: Knowledge discovery from highly heterogeneous, sparse and private data in biomedical informatics
合作研究:III:中:生物医学信息学中高度异构、稀疏和私有数据的知识发现
- 批准号:
2312862 - 财政年份:2023
- 资助金额:
$ 34.3万 - 项目类别:
Standard Grant
Collaborative Research: III: MEDIUM: Responsible Design and Validation of Algorithmic Rankers
合作研究:III:媒介:算法排序器的负责任设计和验证
- 批准号:
2312930 - 财政年份:2023
- 资助金额:
$ 34.3万 - 项目类别:
Standard Grant
Collaborative Research: III: Medium: VirtualLab: Integrating Deep Graph Learning and Causal Inference for Multi-Agent Dynamical Systems
协作研究:III:媒介:VirtualLab:集成多智能体动态系统的深度图学习和因果推理
- 批准号:
2312501 - 财政年份:2023
- 资助金额:
$ 34.3万 - 项目类别:
Standard Grant
Collaborative Research: III: Medium: Graph Neural Networks for Heterophilous Data: Advancing the Theory, Models, and Applications
合作研究:III:媒介:异质数据的图神经网络:推进理论、模型和应用
- 批准号:
2406648 - 财政年份:2023
- 资助金额:
$ 34.3万 - 项目类别:
Standard Grant