BD Spokes: SPOKE: NORTHEAST: Collaborative: A Licensing Model and Ecosystem for Data Sharing
BD Spokes:SPOKE:NORTHEAST:协作:数据共享的许可模型和生态系统
基本信息
- 批准号:1947440
- 负责人:
- 金额:$ 27.09万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2019
- 资助国家:美国
- 起止时间:2019-09-01 至 2021-08-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Sharing of data sets can provide tremendous mutual benefits for industry, researchers and nonprofit organizations. For example, companies can profit from the fact that university researchers explore their data sets and make discoveries, which help the company to improve their business. At the same time, researchers are always on the search for real world data sets to show that their newly developed techniques work in practice. Unfortunately, many attempts to share relevant data sets between different stakeholders in industry and academia fail or require a large investment to make data sharing possible. A major obstacle is that data often comes with prohibitive restrictions on how it can be used (requiring e.g., the enforcement of legal terms or other policies, handling data privacy issues, etc.). In order to enforce these requirements today, lawyers are usually involved in negotiation the terms of each contract. It is not atypical that this process of creating an individual contract for data sharing ends up in protracted negotiations, which are both disconnected from what the actual stakeholders aim to do and fraught as both sides struggle with the implications and possibilities of modern security, privacy, and data sharing techniques. Worse, fear of missing a loophole in how the data might be (mis)used often prevents many data sharing efforts from even getting off the ground. To address these challenges, our new data sharing spoke will enable data providers to easily share data while enforcing constraints on the use of the data. This effort has two key components:(1) Creating a licensing model for data that facilitates sharing data that is not necessarily open or free between different organizations and (2) Developing a prototype data sharing software platform, ShareDB, which enforces the terms and restrictions of the developed licenses. We believe these efforts will have a transformative impact on how data sharing takes place. By moving data out of the silos of individuals and single organizations and into the hands of broader society, we can tackle many societally significant problems.This new data sharing spoke will enable data providers to easily share data while enforcing constraints on the use of the data. Many services and platforms that provide access to data sets exist already today. However, these platforms generally promote completely open access and do not address the aforementioned issues that arise when dealing with proprietary data. Thus, the effort has three key components: (1) Creating a licensing model for data that facilitates sharing data that is not necessarily open or free between different organizations and (2) developing a prototype data sharing software platform, ShareDB, which enforces the terms and restrictions of the developed licenses, and (3) developing and integrating relevant metadata that will accompany the datasets shared under the different licenses, making them easily searchable and interpretable. To ensure that the developed tools and licenses are useful, the project will form the Northeast Data Sharing Group, comprising of many different stakeholders to make the licensing model widely accepted and usable in many application domains (e.g., health and finance). The intellectual merit of this proposal is to design a licensing model and a data sharing platform that is widely accepted and usable as a template in many different domains. While there exist other efforts to enable data sharing (e.g., Creative Commons), they focus on the case where the data owner is willing to openly share the data on the Internet. This licensing model and the ecosystem is different since it allows data owners to enforce certain requirements stated in a data sharing agreement (e.g., on who is allowed to access the data) and also provides tools to make data sharing of sensitive information safe. The licenses and software we propose to investigate will make it easier for organizations to open up their data to the appropriate organizations, while maintaining the ability to ensure it is protected, that access is revocable, and that access controls and audit logs are maintained.
数据集的共享可以为行业,研究人员和非营利组织提供巨大的互惠利益。例如,公司可以从大学研究人员探索他们的数据集并进行发现的事实中获利,从而帮助公司改善业务。同时,研究人员始终正在寻找现实世界数据集,以表明其新开发的技术在实践中起作用。不幸的是,许多尝试在行业和学术界不同利益相关者之间共享相关数据集失败或需要大量投资以使数据共享成为可能。一个主要的障碍是,数据通常受到对如何使用的限制(例如,执行法律条款或其他策略,处理数据隐私问题等)。为了今天执行这些要求,律师通常参与谈判每个合同的条款。建立单个数据共享合同的过程最终以旷日持久的谈判结束,这并不是非典型的,这两者都与实际利益相关者的旨在做到的事情脱节,并且随着双方都在与现代安全,隐私和数据共享技术的含义和可能性中挣扎时,这并不是一定的。更糟糕的是,担心缺少数据的漏洞(MIS)通常会阻止许多数据共享工作甚至无法下台。为了应对这些挑战,我们的新数据共享辐条将使数据提供商能够轻松共享数据,同时对使用数据的使用进行约束。这项工作有两个关键组成部分:(1)为数据创建许可模型,该模型促进共享数据不一定在不同组织之间开放或免费的数据,并且(2)开发一个原型数据共享软件平台共享B共享,该平台强制执行开发许可的条款和限制。我们认为,这些努力将对数据共享的发生方式产生变革性的影响。通过将数据从个人和单一组织的孤岛移出,并将其移至更广泛的社会的手中,我们可以解决许多社会上重要的问题。这一新数据共享辐条将使数据提供商能够轻松共享数据,同时对数据使用的限制进行约束。今天已经存在许多提供对数据集访问的服务和平台。但是,这些平台通常会促进完全开放的访问权限,并且没有解决处理专有数据时出现的上述问题。因此,这项工作具有三个关键组成部分:(1)为数据创建许可模型,该模型促进共享数据不一定在不同组织之间打开或免费的数据,并且(2)开发一个原型数据共享软件平台共享B,共享BSSEASEB,强制执行已开发的许可的条款和限制,并在不明显的范围内搜索并集成了较大的元数据,从而可以易于开发和集成了不同的元数据。为了确保开发的工具和许可证很有用,该项目将组成东北数据共享组,包括许多不同的利益相关者,以使许可模型在许多应用领域(例如健康和金融)中广泛接受和可用。该提案的智力优点是设计一个许可模型和一个数据共享平台,该平台被广泛接受且可作为许多不同域中的模板。尽管还有其他努力来启用数据共享(例如,创意共享),但他们着重于数据所有者愿意在Internet上公开共享数据的情况。该许可模型和生态系统有所不同,因为它允许数据所有者执行数据共享协议中所述的某些要求(例如,关于允许谁访问数据),还提供了使敏感信息的数据共享安全的工具。我们建议进行调查的许可证和软件将使组织更容易向适当的组织打开数据,同时保持确保其受到保护,访问权限以及可以维护访问控件和审核日志的能力。
项目成果
期刊论文数量(3)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Towards instance-optimized data systems
- DOI:10.14778/3476311.3476392
- 发表时间:2021-07
- 期刊:
- 影响因子:0
- 作者:Tim Kraska
- 通讯作者:Tim Kraska
Poly'19 Workshop Summary: GDPR
Poly19 研讨会摘要:GDPR
- DOI:10.1145/3444831.3444842
- 发表时间:2020
- 期刊:
- 影响因子:0
- 作者:Stonebraker, Michael;Mattson, Timothy;Kraska, Tim;Gadepally, Vijay
- 通讯作者:Gadepally, Vijay
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Tim Kraska其他文献
Building Database Applications in the Cloud
- DOI:
10.3929/ethz-a-006007449 - 发表时间:
2010 - 期刊:
- 影响因子:0
- 作者:
Tim Kraska - 通讯作者:
Tim Kraska
Towards a Benchmark for the Cloud
迈向云基准
- DOI:
- 发表时间:
2018 - 期刊:
- 影响因子:0
- 作者:
Carsten Binnig;Donald Kossmann;Tim Kraska;Simon Losing - 通讯作者:
Simon Losing
Safe Visual Data Exploration
安全的可视化数据探索
- DOI:
- 发表时间:
2017 - 期刊:
- 影响因子:0
- 作者:
Zheguang Zhao;Emanuel Zgraggen;L. Stefani;Carsten Binnig;E. Upfal;Tim Kraska - 通讯作者:
Tim Kraska
Self-Organizing Data Containers
自组织数据容器
- DOI:
- 发表时间:
2022 - 期刊:
- 影响因子:0
- 作者:
S. Madden;Jialin Ding;Tim Kraska;Sivaprasad Sudhir;David Cohen;T. Mattson;Nesime Tatbul - 通讯作者:
Nesime Tatbul
Making the Case for Query-by-Voice with EchoQuery
使用 EchoQuery 进行语音查询的案例
- DOI:
10.1145/2882903.2899394 - 发表时间:
2016 - 期刊:
- 影响因子:0
- 作者:
Gabriel Lyons;Vinh Q. Tran;Carsten Binnig;U. Çetintemel;Tim Kraska - 通讯作者:
Tim Kraska
Tim Kraska的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Tim Kraska', 18)}}的其他基金
III: Medium: Quantifying the Unknown Unknowns for Data Integration
III:媒介:量化数据集成的未知因素
- 批准号:
2033792 - 财政年份:2020
- 资助金额:
$ 27.09万 - 项目类别:
Continuing Grant
III: Medium: Learning-based Synthesis of Data Processing Engines
III:媒介:基于学习的数据处理引擎综合
- 批准号:
1900933 - 财政年份:2019
- 资助金额:
$ 27.09万 - 项目类别:
Continuing Grant
III: Medium: Quantifying the Unknown Unknowns for Data Integration
III:媒介:量化数据集成的未知因素
- 批准号:
1562657 - 财政年份:2016
- 资助金额:
$ 27.09万 - 项目类别:
Continuing Grant
BD Spokes: SPOKE: NORTHEAST: Collaborative: A Licensing Model and Ecosystem for Data Sharing
BD Spokes:SPOKE:NORTHEAST:协作:数据共享的许可模型和生态系统
- 批准号:
1636698 - 财政年份:2016
- 资助金额:
$ 27.09万 - 项目类别:
Standard Grant
CAREER: Query Compilation Techniques for Complex Analytics on Enterprise Clusters
职业:企业集群上复杂分析的查询编译技术
- 批准号:
1453171 - 财政年份:2015
- 资助金额:
$ 27.09万 - 项目类别:
Continuing Grant
相似国自然基金
磁控溅射等离子体中旋转辐条模的形成机理及其对电子和离子输运性质的影响
- 批准号:12305221
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
部分磁化等离子体中旋转辐条的系统研究
- 批准号:62201238
- 批准年份:2022
- 资助金额:30.00 万元
- 项目类别:青年科学基金项目
部分磁化等离子体中旋转辐条的系统研究
- 批准号:
- 批准年份:2022
- 资助金额:30 万元
- 项目类别:青年科学基金项目
新型纤毛辐条蛋白的鉴定及功能研究
- 批准号:31772456
- 批准年份:2017
- 资助金额:59.0 万元
- 项目类别:面上项目
相似海外基金
BD Spokes: SPOKE: MIDWEST: Collaborative: Advanced Computational Neuroscience Network (ACNN)
BD 辐条:辐条:中西部:协作:高级计算神经科学网络 (ACNN)
- 批准号:
2148729 - 财政年份:2021
- 资助金额:
$ 27.09万 - 项目类别:
Standard Grant
BD Spokes: SPOKE: NORTHEAST: Collaborative Research: Integration of Environmental Factors and Causal Reasoning Approaches for Large-Scale Observational Health Research
BD 发言:发言:东北:合作研究:大规模观察健康研究的环境因素和因果推理方法的整合
- 批准号:
1636786 - 财政年份:2017
- 资助金额:
$ 27.09万 - 项目类别:
Standard Grant
BD Spokes: SPOKE: NORTHEAST: Collaborative Research: Integration of Environmental Factors and Causal Reasoning Approaches for Large-Scale Observational Health Research
BD 发言:发言:东北:合作研究:大规模观察健康研究的环境因素和因果推理方法的整合
- 批准号:
1636795 - 财政年份:2017
- 资助金额:
$ 27.09万 - 项目类别:
Standard Grant
BD Spokes: SPOKE: NORTHEAST: Collaborative Research: Integration of Environmental Factors and Causal Reasoning Approaches for Large-Scale Observational Health Research
BD 发言:发言:东北:合作研究:大规模观察健康研究的环境因素和因果推理方法的整合
- 批准号:
1636832 - 财政年份:2017
- 资助金额:
$ 27.09万 - 项目类别:
Standard Grant
BD Spokes: SPOKE: MIDWEST: Collaborative: Integrative Materials Design (IMaD): Leverage, Innovate, and Disseminate
BD 辐条:辐条:中西部:协作:集成材料设计 (IMaD):利用、创新和传播
- 批准号:
1636950 - 财政年份:2017
- 资助金额:
$ 27.09万 - 项目类别:
Standard Grant