Integrated resource for reproducibility in macromolecular crystallography

大分子晶体学重现性的综合资源

基本信息

  • 批准号:
    8875830
  • 负责人:
  • 金额:
    $ 46万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
  • 财政年份:
    2015
  • 资助国家:
    美国
  • 起止时间:
    2015-06-01 至 2018-05-31
  • 项目状态:
    已结题

项目摘要

 DESCRIPTION (provided by applicant): We propose the development of a collection of data wrangling tools to store, parse, manipulate, validate, curate, analyze, and disseminate macromolecular diffraction images together with all associated relevant metadata. The proposed system will have several benefits, by (1) creating a means to improve existing structures as technology for processing diffraction image advances, (2) detecting errors (and potentially, fraud) in existing structures to ensure structure quality and reproducibility, (3) preventing the loss of data collected by structural genomics and other programs that have closed or will close, (4) providing data for analysis of diffuse diffraction effects, and (5) buildng a "training set" for new diffraction analysis algorithms and hardware. Biologists, bioinformaticians, and software and hardware developers will all be beneficiaries of these tools. The proposed research is designed for semantic rather than syntactic analysis of diffraction images, and has several specific goals. First, we will develop tools for automatically extracting and curating diffraction images and associated metadata, as well as producing descriptions of all data needed for reprocessing when methods for structure determination improve. Second, we will create a web-based system for organizing, searching, analyzing, and data mining of appropriate subsets of diffraction images and associated metadata in machine- readable formats. This will include a comprehensive API for programmatic access, the ability to link multiple instances into a distributed federation, and state-of-the-art compression and transfer technologies. Third, we will develop tools to automatically validate, preprocess, and score diffraction images, and to detect potential issues and errors. These tools will make use of new and existing programs for image and data analysis, contain heuristics to identify possible errors, and provide statistics to correlate errors with specific metadata. Fourth, we will create a mechanism to discover diffraction data that have not yielded X-ray structures with currently available methods. Fifth, we will set up a pilot resource incorporating all the developed tools, and collect a test data set for the development of new tools for validation and error detection. We will work closely with multiple collaborators. Most important is the RCSB Protein Data Bank (PDB), who will help us ensure the accuracy and completeness of the diffraction metadata. Other partners will include the diffuse X-ray scattering community, detector vendors, synchrotron beamline managers, members of the IUCr Diffraction Data Deposition Working Group (DDDWG) and the crystallographic community in general. Together with the RCSB PDB, we will organize workshop(s) with these communities in order to (a) improve metadata extraction and (b) better define subsets of diffraction images. By addressing the currently common, irreversible and unnecessary loss of raw diffraction data during the data reduction process, our project helps ensure that the discipline of macromolecular crystallography is capable of continuous self-improvement.
 描述(由申请人提供):我们提出开发一系列数据整理工具,以存储、解析、操作、验证、策划、分析和传播大分子衍射图像以及所有相关的相关元数据。所提出的系统将有几个好处,通过(1)创建一种手段,以改善现有的结构,作为技术处理衍射图像的进步,(2)检测错误(以及潜在的欺诈),以确保结构质量和可再现性,(3)防止结构基因组学和其他已经关闭或即将关闭的项目收集的数据丢失,(4)为漫衍射效应的分析提供数据;(5)为新的衍射分析算法和硬件建立“训练集”。生物学家,生物信息学家, 软件和硬件开发人员都将成为这些工具的受益者。拟议的研究是专为语义而不是句法分析的衍射图像,并有几个具体的目标。首先,我们将开发用于自动提取和管理衍射图像和相关元数据的工具,以及在结构确定方法改进时生成再处理所需的所有数据的描述。其次,我们将建立一个基于网络的系统,用于组织、搜索、分析和数据挖掘衍射图像的适当子集和机器可读格式的相关元数据。这将包括用于编程访问的全面API、将多个实例链接到分布式联合的能力,以及最先进的压缩和传输技术。第三,我们将开发工具来自动验证,预处理和评分衍射图像,并检测潜在的问题和错误。这些工具将利用新的和现有的程序进行图像和数据分析,包含识别可能错误的统计数据,并提供统计数据以将错误与特定元数据相关联。第四,我们将创建一个机制来发现衍射数据,这些数据还没有用目前可用的方法产生X射线结构。五是 将建立一个试点资源,包括所有开发的工具,并收集测试数据集, 开发用于验证和错误检测的新工具。我们将与多个合作伙伴密切合作。最重要的是RCSB蛋白质数据库(PDB),它将帮助我们确保衍射元数据的准确性和完整性。其他合作伙伴将包括扩散X射线散射社区,探测器供应商,同步加速器光束线管理人员,IUCr衍射数据沉积工作组(DDDWG)的成员和晶体学社区。与RCS B PD B一起,我们将与这些社区组织研讨会,以便(a)改进元数据提取和(B)更好地定义衍射图像的子集。通过解决当前常见的,不可逆的和不必要的原始衍射数据在数据处理过程中的损失,我们的项目有助于确保大分子晶体学学科能够不断自我完善。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

WLADEK MINOR其他文献

WLADEK MINOR的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('WLADEK MINOR', 18)}}的其他基金

Reproducible, Unbiased Ligand Identification Assisted by Artificial Intelligence and Development of Ligand Reference Libraries
人工智能辅助的可重复、公正的配体鉴定和配体参考文库的开发
  • 批准号:
    10019572
  • 财政年份:
    2019
  • 资助金额:
    $ 46万
  • 项目类别:
Reproducible, Unbiased Ligand Identification Assisted by Artificial Intelligence and Development of Ligand Reference Libraries
人工智能辅助的可重复、公正的配体鉴定和配体参考文库的开发
  • 批准号:
    10432049
  • 财政年份:
    2019
  • 资助金额:
    $ 46万
  • 项目类别:
Reproducible, Unbiased Ligand Identification Assisted by Artificial Intelligence and Development of Ligand Reference Libraries
人工智能辅助的可重复、公正的配体鉴定和配体参考文库的开发
  • 批准号:
    10200091
  • 财政年份:
    2019
  • 资助金额:
    $ 46万
  • 项目类别:
Metal binding sites in macromolecular structures
大分子结构中的金属结合位点
  • 批准号:
    9233159
  • 财政年份:
    2016
  • 资助金额:
    $ 46万
  • 项目类别:
Metal binding sites in macromolecular structures
大分子结构中的金属结合位点
  • 批准号:
    9008644
  • 财政年份:
    2016
  • 资助金额:
    $ 46万
  • 项目类别:
Integrated resource for reproducibility in macromolecular crystallography
大分子晶体学重现性的综合资源
  • 批准号:
    9280987
  • 财政年份:
    2015
  • 资助金额:
    $ 46万
  • 项目类别:
X-ray data analysis in the presence of structural variability
存在结构变异时的 X 射线数据分析
  • 批准号:
    9147618
  • 财政年份:
    2015
  • 资助金额:
    $ 46万
  • 项目类别:
X-ray data analysis in the presence of structural variability
存在结构变异时的 X 射线数据分析
  • 批准号:
    9552204
  • 财政年份:
    2015
  • 资助金额:
    $ 46万
  • 项目类别:
Integrated resource for reproducibility in macromolecular crystallography
大分子晶体学重现性的综合资源
  • 批准号:
    9069902
  • 财政年份:
    2015
  • 资助金额:
    $ 46万
  • 项目类别:
Centers for High-Throughput Structure Determination
高通量结构测定中心
  • 批准号:
    8152878
  • 财政年份:
    2010
  • 资助金额:
    $ 46万
  • 项目类别:

相似海外基金

Rational design of rapidly translatable, highly antigenic and novel recombinant immunogens to address deficiencies of current snakebite treatments
合理设计可快速翻译、高抗原性和新型重组免疫原,以解决当前蛇咬伤治疗的缺陷
  • 批准号:
    MR/S03398X/2
  • 财政年份:
    2024
  • 资助金额:
    $ 46万
  • 项目类别:
    Fellowship
CAREER: FEAST (Food Ecosystems And circularity for Sustainable Transformation) framework to address Hidden Hunger
职业:FEAST(食品生态系统和可持续转型循环)框架解决隐性饥饿
  • 批准号:
    2338423
  • 财政年份:
    2024
  • 资助金额:
    $ 46万
  • 项目类别:
    Continuing Grant
Re-thinking drug nanocrystals as highly loaded vectors to address key unmet therapeutic challenges
重新思考药物纳米晶体作为高负载载体以解决关键的未满足的治疗挑战
  • 批准号:
    EP/Y001486/1
  • 财政年份:
    2024
  • 资助金额:
    $ 46万
  • 项目类别:
    Research Grant
Metrology to address ion suppression in multimodal mass spectrometry imaging with application in oncology
计量学解决多模态质谱成像中的离子抑制问题及其在肿瘤学中的应用
  • 批准号:
    MR/X03657X/1
  • 财政年份:
    2024
  • 资助金额:
    $ 46万
  • 项目类别:
    Fellowship
CRII: SHF: A Novel Address Translation Architecture for Virtualized Clouds
CRII:SHF:一种用于虚拟化云的新型地址转换架构
  • 批准号:
    2348066
  • 财政年份:
    2024
  • 资助金额:
    $ 46万
  • 项目类别:
    Standard Grant
The Abundance Project: Enhancing Cultural & Green Inclusion in Social Prescribing in Southwest London to Address Ethnic Inequalities in Mental Health
丰富项目:增强文化
  • 批准号:
    AH/Z505481/1
  • 财政年份:
    2024
  • 资助金额:
    $ 46万
  • 项目类别:
    Research Grant
ERAMET - Ecosystem for rapid adoption of modelling and simulation METhods to address regulatory needs in the development of orphan and paediatric medicines
ERAMET - 快速采用建模和模拟方法的生态系统,以满足孤儿药和儿科药物开发中的监管需求
  • 批准号:
    10107647
  • 财政年份:
    2024
  • 资助金额:
    $ 46万
  • 项目类别:
    EU-Funded
BIORETS: Convergence Research Experiences for Teachers in Synthetic and Systems Biology to Address Challenges in Food, Health, Energy, and Environment
BIORETS:合成和系统生物学教师的融合研究经验,以应对食品、健康、能源和环境方面的挑战
  • 批准号:
    2341402
  • 财政年份:
    2024
  • 资助金额:
    $ 46万
  • 项目类别:
    Standard Grant
Ecosystem for rapid adoption of modelling and simulation METhods to address regulatory needs in the development of orphan and paediatric medicines
快速采用建模和模拟方法的生态系统,以满足孤儿药和儿科药物开发中的监管需求
  • 批准号:
    10106221
  • 财政年份:
    2024
  • 资助金额:
    $ 46万
  • 项目类别:
    EU-Funded
Recite: Building Research by Communities to Address Inequities through Expression
背诵:社区开展研究,通过表达解决不平等问题
  • 批准号:
    AH/Z505341/1
  • 财政年份:
    2024
  • 资助金额:
    $ 46万
  • 项目类别:
    Research Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了