Development Of Theoretical Methods For Studying Biological Macromolecules
生物大分子研究理论方法的发展
基本信息
- 批准号:10929079
- 负责人:
- 金额:$ 111.78万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:
- 资助国家:美国
- 起止时间:至
- 项目状态:未结题
- 来源:
- 关键词:AblationActive SitesAlgorithmsAmberArchitectureAreaBasic ScienceBehaviorBindingBinding SitesBiochemical PathwayBiochemical ProcessBiochemical ReactionBiologicalBiological ProcessBiological ProductsBiophysicsBreathingCatalysisCellsChargeChemicalsChemistryCodeCommunicationComplementComputational BiologyComputational TechniqueComputer AssistedComputersComputing MethodologiesConsumptionCoupledCrystallographyDataData SetDatabasesDecision TreesDescriptorDevelopmentDrug DesignElasticityElectron MicroscopyElectronsElectrostaticsEnzymesEvaluationExhibitsFourier TransformFree EnergyFutureGene Expression Microarray AnalysisGene Expression ProfilingGoalsGrainGraphHardnessHealthHumanHybridsHydrophobicityImageImage AnalysisIonsLaboratoriesLearningLibrariesLigand BindingLigandsLightLiteratureMachine LearningMapsMeasurementMembrane ProteinsMetalloproteinsMetalsMethodologyMethodsModelingModernizationMolecularNational Heart, Lung, and Blood InstituteOxygenPerformancePharmaceutical PreparationsPhasePhysiologicalPlayPotential EnergyProcessPropertyProteinsQuantum MechanicsRadialRadiation induced damageReactionResearchResearch Project GrantsRoentgen RaysRoleRunningSamplingSchemeScienceScientistSpecificitySpeedStructureSurfaceSystemTechniquesTestingTherapeutic InterventionTimeTrainingTreesWorkbiological systemschemical propertycofactorcomputing resourcescostdensitydesigndrug discoveryeffectiveness validationelectronic structureevaluation/testingexperimental studygene functiongradient boostinggraph neural networkhigh dimensionalityhuman diseaseinterestmachine learning modelmachine learning predictionmacromoleculematerials sciencemetallicitymetalloenzymemodels and simulationmolecular dynamicsmolecular mechanicsmolecular modelingmulti-scale modelingneuralneural networkneural network classifiernoveloxidationparallelizationphotosystem IIpredictive toolsprocess optimizationprogramsquantumrandom forestsimulationsmall moleculesoftware developmenttheoriestherapeutic lead compoundtherapy design/developmenttool
项目摘要
pKa prediction by machine learning
Machine learning techniques are developing rapidly in recent years and have been applied to numerous scientific fields. Previously, we presented four tree-based machine learning models for protein pKa prediction. The four models, Random Forest, Extra Trees, eXtreme Gradient Boosting (XGBoost), and Light Gradient Boosting Machine (LightGBM), were trained on three experimental PDB and pKa datasets, two of which included a notable portion of internal residues. The best model trained on the largest dataset performs 37% better than the widely used empirical pKa prediction tool PROPKA. Error analyses were performed and showed that coupled ionizable residues are the most difficult ones in pKa prediction. To curate a more decent pKa database, we collected detailed information for the PDB structures (e.g., co-factors, whether it is membrane protein or not, etc.), which could be new features for our next version of pKa predictor. In addition, the calculated pKa's based on continuum electrostatic, will be added to the model. This feature will assure the inclusion of long-range electrostatics and the hydrophobic effect. Additional experimental pKa data were found in literatures and added to the new database. We also examined experimental conditions for both PDB structure measurements and pKa measurements to check if the structure-pKa pairs are compatible.
Conceptual DFT for studying catalytic reactions in metalloproteins
Metalloenzymes play a crucial role in maintaining human health by participating in essential biochemical processes that are vital for various physiological functions. These specialized enzymes require metal ions as cofactors to catalyze specific reactions with remarkable efficiency and specificity. Understanding the electronic structure of the active site helps reveal reaction mechanisms. In this project we use Conceptual DFT (CDFT) to study the electronic structure of metallic active sites in Photosystem II. CDFT uses quantum mechanical calculations for understanding and predicting the electronic structure and the inter-molecular and intra-molecular reactivity of molecules. It extends DFT by introducing the concept of quantum descriptors, which describe the tendency of a system to donate or accept electrons based on properties that include electronegativity, hardness, softness, the Fukui functions, and the dual descriptor. Furthermore, the local version of these descriptors is used to predict the reactive sites within the molecule. For example, the condensed Fukui functions and dual descriptors have been used to identify the atoms within the molecule that participate in nucleophilic/electrophilic attack. Our results show that metal cluster in Photosystem II divide the oxygen ligands into nucleophilic and electrophilic ligands to reduce the barrier of the O=O bond formation.
Machine Learning models for predicting oxidation states of metals in proteins
Imaging metalloenzymes with X-ray causes radiation damage, which changes the physical/chemical properties of the metal active site. However, the catalytic reactions in metalloproteins are usually coupled to the metal oxidation states. Thus, identifying the correct oxidation states of the metals is essential for understanding the reaction mechanisms. We built a huge data set of small coordination compounds collected from Cambridge Crystallographic Database (CCDC) for different metals (Fe, Mn, Cu, Co) at different oxidation states and used it to build a machine learning models (Decision Tree and Neural Networks classifiers) to predict the oxidation states of the metals in proteins structures imaged with X-ray and XFEL crystallography.
Equivariant graph neural based electrostatic embedding in QM/MM simulations
We present a novel methodology for efficiently and accurately integrating classical force fields with Quantum Mechanical (QM) theory in additive Quantum Mechanics/Molecular Mechanics (QM/MM) simulations. We design sparse E(3) equivariant neural networks, incorporating a sparse connection scheme that emulates the one-electron integrals in the ab-initio QM and MM regions of the calculations. Specifically, point charges from the MM engage in messaging solely with the QM region, eliminating direct interactions among themselves. To validate the effectiveness of our methodology, we conduct ablation studies and compare it to prominent alternatives, including High-Dimensional Neural Networks (HDNNs) and graph convolutional networks. Our findings demonstrate that our approach not only exhibits superior data efficiency but also outperforms HDNNs and graph convolutional networks by an order of magnitude in terms of computational efficiency. These advancements hold significant promise for diverse applications in areas such as materials science, chemistry, and drug discovery.
bEDS for evaluation of free energies
Free energies of binding and solvation are key quantities in modern drug-design, and being able to accurately and rapidly evaluate them remains an ongoing challenge. Taking inspiration from the Enveloping Distribution Sampling method (EDS), we developed the bridge-EDS (bEDS) method. EDS defines a smoothed potential energy function to overcome barriers on the potential energy surface. bridge-EDS applies this to all alchemical states typically used for solvation or ligand-binding processes. This samples a phase space with better overlap amongst alchemical states, freed from PES energy barriers, thus requiring shorter overall simulations. Thanks to our apoCHARMM code architecture and its multistate handling capability, energies of all different alchemical states are computed on the fly at no extra cost, and the post-processing effort becomes negligible, making bEDS a valuable asset for free energy prediction.
Implementation of the Spherical Grids and Treecode Summation Algorithm
Evaluating pair-wise electrostatic interactions is the most time consuming part when running molec- ular dynamics (MD) simulations, because of the slow decay of the Coulomb operator. The Spherical Grid and Treecode (SGT) summation algorithm speeds up the calculation by factoring the Coulomb operator into short and long ranged terms, where the short ranged term is handled by computing only the pair-wise interactions within a cutoff radius and the long ranged term is approximated using nu- merical cubature techniques. The SGT algorithms primary advantage over traditional methods is that it does not use the fast Fourier transform (FFT) which requires multiple all-to-all communications on multi-node systems. Avoiding the FFT makes the SGT algorithm well suited to be implemented for use on large CPU clusters and multi-GPU systems. In this work, we plan to develop a standalone library which implements the SGT algorithm. We have currently developed pilot code implementations and are in the process of developing high-performance versions for CPU and GPU based platforms. The CPU version is parallelized using a hybrid OpenMPI/OpenMP approach and the GPU version utilizes the CUDA API. We are currently in the process of optimizing the CPU and single GPU versions and plan to develop a multi-GPU version in the future.
Other active projects:
Rotatable Grids for Grid Inhomogeneous Solvation Theory Calculations
Free energy calculation with Double Exponential Potential
A spatial Self-Guided Molecular Simulation Method
Free Energy Profile Decomposition Analysis
Machine Learning Potentials (MLPs) for Reactive Systems
Conceptual Density Functional Theory (DFT) Analysis for Enzyme Catalysis
通过机器学习进行pKa预测
项目成果
期刊论文数量(46)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Global organization of a binding site network gives insight into evolution and structure-function relationships of proteins.
- DOI:10.1038/s41598-017-10412-z
- 发表时间:2017-09-14
- 期刊:
- 影响因子:4.6
- 作者:Lee J;Konc J;Janežič D;Brooks BR
- 通讯作者:Brooks BR
The Extended Eighth-Shell method for periodic boundary conditions with rotational symmetry.
用于旋转对称周期性边界条件的扩展八壳法。
- DOI:10.1002/jcc.26545
- 发表时间:2021
- 期刊:
- 影响因子:3
- 作者:Prasad,Samarjeet;Simmonett,AndrewC;Meana-Pañeda,Rubén;Brooks,BernardR
- 通讯作者:Brooks,BernardR
Reformulation of the self-guided molecular simulation method.
- DOI:10.1063/5.0019086
- 发表时间:2020-09
- 期刊:
- 影响因子:0
- 作者:Xiongwu Wu;B. Brooks
- 通讯作者:Xiongwu Wu;B. Brooks
Predicting hydration free energies with a hybrid QM/MM approach: an evaluation of implicit and explicit solvation models in SAMPL4.
使用混合 QM/MM 方法预测水合自由能:SAMPL4 中隐式和显式溶剂化模型的评估
- DOI:10.1007/s10822-014-9708-4
- 发表时间:2014-03
- 期刊:
- 影响因子:3.5
- 作者:Koenig, Gerhard;Pickard, Frank C.;Mei, Ye;Brooks, Bernard R.
- 通讯作者:Brooks, Bernard R.
A replica exchange umbrella sampling (REUS) approach to predict host-guest binding free energies in SAMPL8 challenge.
- DOI:10.1007/s10822-021-00385-7
- 发表时间:2021-05
- 期刊:
- 影响因子:3.5
- 作者:Ghorbani M;Hudson PS;Jones MR;Aviat F;Meana-Pañeda R;Klauda JB;Brooks BR
- 通讯作者:Brooks BR
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Bernard R Brooks其他文献
Bernard R Brooks的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Bernard R Brooks', 18)}}的其他基金
Development Of Theoretical Methods For Studying Biological Macromolecules
生物大分子研究理论方法的发展
- 批准号:
8557904 - 财政年份:
- 资助金额:
$ 111.78万 - 项目类别:
Molecular Dynamics Simulations Of Biological Macromolecules
生物大分子的分子动力学模拟
- 批准号:
7968988 - 财政年份:
- 资助金额:
$ 111.78万 - 项目类别:
Molecular Dynamics Simulations Of Biological Macromolecules
生物大分子的分子动力学模拟
- 批准号:
8939759 - 财政年份:
- 资助金额:
$ 111.78万 - 项目类别:
Three-dimensional Structures Of Biological Macromolecules
生物大分子的三维结构
- 批准号:
7594372 - 财政年份:
- 资助金额:
$ 111.78万 - 项目类别:
Molecular Dynamics Simulations Of Biological Macromolecules
生物大分子的分子动力学模拟
- 批准号:
10262664 - 财政年份:
- 资助金额:
$ 111.78万 - 项目类别:
Development Of Advanced Computer Hardware And Software
先进计算机硬件和软件的开发
- 批准号:
10706226 - 财政年份:
- 资助金额:
$ 111.78万 - 项目类别:
Development Of Theoretical Methods For Studying Biological Macromolecules
生物大分子研究理论方法的发展
- 批准号:
7734954 - 财政年份:
- 资助金额:
$ 111.78万 - 项目类别:
Development Of Theoretical Methods For Studying Biological Macromolecules
生物大分子研究理论方法的发展
- 批准号:
8158018 - 财政年份:
- 资助金额:
$ 111.78万 - 项目类别:
Molecular Dynamics Simulations of Biological Macromolecules
生物大分子的分子动力学模拟
- 批准号:
6109190 - 财政年份:
- 资助金额:
$ 111.78万 - 项目类别:
Development of Advanced Computer Hardware and Software
先进计算机硬件和软件的开发
- 批准号:
6109192 - 财政年份:
- 资助金额:
$ 111.78万 - 项目类别:
相似海外基金
NSF-BSF: Towards a Molecular Understanding of Dynamic Active Sites in Advanced Alkaline Water Oxidation Catalysts
NSF-BSF:高级碱性水氧化催化剂动态活性位点的分子理解
- 批准号:
2400195 - 财政年份:2024
- 资助金额:
$ 111.78万 - 项目类别:
Standard Grant
Collaborative Research: Beyond the Single-Atom Paradigm: A Priori Design of Dual-Atom Alloy Active Sites for Efficient and Selective Chemical Conversions
合作研究:超越单原子范式:双原子合金活性位点的先验设计,用于高效和选择性化学转化
- 批准号:
2334970 - 财政年份:2024
- 资助金额:
$ 111.78万 - 项目类别:
Standard Grant
Collaborative Research: Beyond the Single-Atom Paradigm: A Priori Design of Dual-Atom Alloy Active Sites for Efficient and Selective Chemical Conversions
合作研究:超越单原子范式:双原子合金活性位点的先验设计,用于高效和选择性化学转化
- 批准号:
2334969 - 财政年份:2024
- 资助金额:
$ 111.78万 - 项目类别:
Standard Grant
Mechanochemical synthesis of nanocarbon and design of active sites for oxygen reducton/evolution reactions
纳米碳的机械化学合成和氧还原/演化反应活性位点的设计
- 批准号:
23K04919 - 财政年份:2023
- 资助金额:
$ 111.78万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Creation of porous inorganic frameworks with controlled structure of metal active sites by the building block method.
通过积木法创建具有金属活性位点受控结构的多孔无机框架。
- 批准号:
22KJ2957 - 财政年份:2023
- 资助金额:
$ 111.78万 - 项目类别:
Grant-in-Aid for JSPS Fellows
Catalysis of Juxaposed Active Sites Created in Nanospaces and Their Applications
纳米空间中并置活性位点的催化及其应用
- 批准号:
23K04494 - 财政年份:2023
- 资助金额:
$ 111.78万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Generation of carbon active sites by modifying the oxygen containing functional groups and structures of carbons for utilizing to various catalytic reactions.
通过修饰碳的含氧官能团和结构来产生碳活性位点,用于各种催化反应。
- 批准号:
23K13831 - 财政年份:2023
- 资助金额:
$ 111.78万 - 项目类别:
Grant-in-Aid for Early-Career Scientists
CAREER: CAS: Understanding the Chemistry of Palladium and Silyl Compounds to Design Catalyst Active Sites
职业:CAS:了解钯和甲硅烷基化合物的化学性质以设计催化剂活性位点
- 批准号:
2238379 - 财政年份:2023
- 资助金额:
$ 111.78万 - 项目类别:
Continuing Grant
CAS: Collaborative Research: Tailoring the Distribution of Transient vs. Dynamic Active Sites in Solid-Acid Catalysts and Their Impacts on Chemical Conversions
CAS:合作研究:定制固体酸催化剂中瞬时活性位点与动态活性位点的分布及其对化学转化的影响
- 批准号:
2154399 - 财政年份:2022
- 资助金额:
$ 111.78万 - 项目类别:
Standard Grant
Engineering of Active Sites in Heterogeneous Catalysts for Sustainable Chemical and Fuel Production.
用于可持续化学和燃料生产的多相催化剂活性位点工程。
- 批准号:
RGPIN-2019-06633 - 财政年份:2022
- 资助金额:
$ 111.78万 - 项目类别:
Discovery Grants Program - Individual