Development Of Theoretical Methods For Studying Biological Macromolecules
生物大分子研究理论方法的发展
基本信息
- 批准号:10706160
- 负责人:
- 金额:$ 100.46万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:
- 资助国家:美国
- 起止时间:至
- 项目状态:未结题
- 来源:
- 关键词:Active SitesAffectAlgorithmsAmberAttentionBasic ScienceBehaviorBindingBiochemical PathwayBiochemical ReactionBiologicalBiological ProcessBiological ProductsBiophysicsCatalysisCellsChargeChemicalsCommunitiesComplementComputational BiologyComputational TechniqueComputer AssistedComputersComputing MethodologiesConsumptionCoupledDataData ReportingData SetDevelopmentDimensionsDrug DesignElectron MicroscopyElectrostaticsEnsureEntropyEnzymesEvaluationFree EnergyFutureGene Expression ProfilingGoalsGrainGraphHot SpotHydrophobicityImage AnalysisIonsKineticsLaboratoriesLearningLightMachine LearningMapsMarkov ChainsMechanicsMembrane ProteinsMethodologyMethodsModelingMolecularNational Heart, Lung, and Blood InstituteNatural Language ProcessingPerformancePharmacotherapyPlayProcessPropertyProteinsPublishingQuantum MechanicsResearchResearch Project GrantsResolutionRoleRotationRunningSamplingSchemeScienceScientistSolventsSpeedStructureSystemTechniquesTestingTherapeutic InterventionThermodynamicsTimeTrainingTranslationsTreesUrsidae FamilyVariantWaterWorkantimicrobial peptideautoencoderbasebiological systemscomputing resourcesdeep learningdesigndrug of abuseenthalpyexperimental studygene functiongradient boostinggraph neural networkhuman diseaseimprovedinsightinterestkinetic modelmachine learning algorithmmachine learning modelmachine learning predictionmacromoleculemarkov modelmodels and simulationmolecular dynamicsmolecular mechanicsmolecular modelingmulti-scale modelingneuralneural networknoveloperationparticlepathogenprogramsprotein foldingquantumrandom forestrestraintsimulationsmall moleculesoftware developmentsolutetheoriestherapeutic lead compoundtherapy designtoolvector
项目摘要
GraphVAMPNet, using graph neural networks and variational approach to Markov processes for dynamical modeling of biomolecules
Finding a low dimensional representation of data from long-timescale trajectories of biomolecular processes, such as protein folding or ligandreceptor binding, is of fundamental importance, and kinetic models, such as Markov modeling, have proven useful in describing the kinetics of these systems. We combine VAMPNet and graph neural networks to generate an end-to-end framework to efficiently learn high-level dynamics and metastable states from the long-timescale molecular dynamics trajectories. This method bears the advantages of graph representation learning and uses graph message passing operations to generate an embedding for each datapoint, which is used in the VAMPNet to generate a coarse-grained dynamical model. This type of molecular representation results in a higher resolution and a more interpretable Markov model than the standard VAMPNet, enabling a more detailed kinetic study of the biomolecular processes.
Deep attention based variational autoencoder for antimicrobial peptide discovery
Antimicrobial peptides (AMPs) have been proposed as a potential solution against multiresistant pathogens. Designing novel AMPs requires exploration of a vast chemical space which makes it a challenging problem. Recently natural language processing and generative deep learning have shown great promise in exploring the vast chemical space and generating new chemicals with desired properties. We leverage a variational attention mechanism in the generative variational autoencoder where attention vector is also modeled as a latent vector. Variational attention helps with the diversity and quality of the generated AMPs. The generated AMPs from this model are novel, have high statistical fidelity and have similar physicochemical properties such as charge, hydrophobicity and hydrophobic moment to the real to the real antimicrobial peptides.
pKa prediction by machine learning
Machine learning techniques are developing rapidly in recent years and have been applied to numerous scientific fields. We have presented four tree-based machine learning models for protein pKa prediction. The four models, Random Forest, Extra Trees, eXtreme Gradient Boosting (XGBoost), and Light Gradient Boosting Machine (LightGBM), were trained on three experimental PDB and pKa datasets, two of which included a notable portion of internal residues. We observed similar performance among the four machine learning algorithms. The best model trained on the largest dataset performs 37% better than the widely used empirical pKa prediction tool PROPKA and 15% better than the published result from the pKa prediction method DelPhiPKa.
Equivariant graph neural based electrostatic embedding in QM/MM simulations
MM empirical force field calculations on GPUs are much faster than QM calculations on an even small region. One solution to this problem is to replace the expensive QM calculations with neural networks. However, this is challenging because the model is not transferable and has to be trained on the bulk region data for every simulation. The main contribution of this work is to develop a novel equivariant neural network where the intra-QM interactions are modeled with a GNN and the MM to QM region is modeled with a sparse network. The E(3)-equivariant convolutions over higher-order tensors ensure the features at each layer remain equivariant. Hence, this gives up to 2 orders of magnitudes higher sampling efficiency while training, thus making a NN based QM/MM simulation feasible.
Automatic differentiation for Particle Mesh Ewald
There has been recent interest in using automatic differentiation for simulations using end-to-end differentiable implementation of molecular dynamics in packages like JaxMD, TorchMD, DIffTaichi. These methods utilize the progress made in automatic differentiation for neural networks for empirical and machine learned force fields. However, these methods lack a long range electrostatics and rely only on cutoff schemes. In this work, we are adding long range ewald terms to the package JaxMD. We plan to extend the approach to higher order multipoles in the future as well since it will provide the complicated hessian terms automatically.
Evaluation of Binding Free Energies
The ability to accurately predict binding free energies is a cornerstone of rational drug design. The SAMPL challenges propose each year several sets of host and guests pairing for which no experimental data is available yet. It is designed as a test for the molecular simulations community to assess the capability and robustness of free energies estimation methods. Our submission to the drugs-of-abuse section of the SAMPL8 challenge yielded excellent performances by bridging the gap between Quantum Mechanical (QM) and Molecular Mechanical (MM) methods, which we further analyzed to offer some deeper insights on the methods, demonstrating e.g. the adequacy of semi-empirical methods for this type of work.
Hessians for Permanent Electrostatics and Polarizability models
Based on an implementation of the first derivative of the electrostatic interaction terms for multipole terms, we pursued our efforts to propose an implementation of the second degree derivative, namely the Hessian terms. This second order term is the core of tools such as Normal Mode Analysis and are key to predicting structural properties, amongst other.
Hessian terms for polarizable force fields (classical force fields including a representation of the electronic mobility) have also been implemented.
Enhancements and Extensions of Grid Inhomogeneous Solvation Theory Calculations
Grid Inhomogeneous Solvation Theory (GIST) provides a statistical mechanical formalism for determining the thermodynamics of water in a region of interest by mapping various solvation properties onto a grid. This helps to identify thermodynamic hot spots for solvation, indicating where solvent binding may be favorable or unfavorable, thus helping to e.g. guide rational drug design. Enthalpy-related properties requires the calculation of water-water and water-solute energy on this grid, which is usually the most time-consuming part of the calculation. We have greatly increased the speed of this calculation by parallelizing it with MPI. In addition, we have both improved the speed of the entropy calculation and its convergence via the introduction of a correction term for the "reference volume" in the nearest neighbor entropy and a new version of the nearest neighbor search. The new search allows a user to choose how many layers of neighboring voxels should be used, with higher values improving convergence of the entropy of bulk solvent. Finally, we have extended GIST to be able to handle solvents other than water and including ions.
Evaluating the Effect of Positional Restraints on GIST Calculations
In order to ensure that global rotations and translations of the solute of interest are removed, previous GIST calculations have required either rotating the entire system to fit the grid or running simulations with positional restraints on the solute to keep it oriented in the grid. The former makes it difficult to do energy calculations with a long cutoff (since the coordinates are rotated out of the system reference unit cell), and the latter can potentially introduce some bias into the results from the use of restraints. We have developed a third option where the grid itself is rotated to match the solute of interest. This allows GIST to be used on simulations with no positional restraints, and facilitates the use of GIST with the particle mesh Ewald method for calculating electrostatics, which improves its convergence. Work is currently underway to evaluate what affect past methods.
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Bernard R Brooks其他文献
Bernard R Brooks的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Bernard R Brooks', 18)}}的其他基金
Development Of Theoretical Methods For Studying Biological Macromolecules
生物大分子研究理论方法的发展
- 批准号:
8557904 - 财政年份:
- 资助金额:
$ 100.46万 - 项目类别:
Molecular Dynamics Simulations Of Biological Macromolecules
生物大分子的分子动力学模拟
- 批准号:
7968988 - 财政年份:
- 资助金额:
$ 100.46万 - 项目类别:
Molecular Dynamics Simulations Of Biological Macromolecules
生物大分子的分子动力学模拟
- 批准号:
8939759 - 财政年份:
- 资助金额:
$ 100.46万 - 项目类别:
Three-dimensional Structures Of Biological Macromolecules
生物大分子的三维结构
- 批准号:
7594372 - 财政年份:
- 资助金额:
$ 100.46万 - 项目类别:
Molecular Dynamics Simulations Of Biological Macromolecules
生物大分子的分子动力学模拟
- 批准号:
10262664 - 财政年份:
- 资助金额:
$ 100.46万 - 项目类别:
Development Of Advanced Computer Hardware And Software
先进计算机硬件和软件的开发
- 批准号:
10706226 - 财政年份:
- 资助金额:
$ 100.46万 - 项目类别:
Development Of Theoretical Methods For Studying Biological Macromolecules
生物大分子研究理论方法的发展
- 批准号:
7734954 - 财政年份:
- 资助金额:
$ 100.46万 - 项目类别:
Development Of Theoretical Methods For Studying Biological Macromolecules
生物大分子研究理论方法的发展
- 批准号:
10929079 - 财政年份:
- 资助金额:
$ 100.46万 - 项目类别:
Development Of Theoretical Methods For Studying Biological Macromolecules
生物大分子研究理论方法的发展
- 批准号:
8158018 - 财政年份:
- 资助金额:
$ 100.46万 - 项目类别:
Molecular Dynamics Simulations of Biological Macromolecules
生物大分子的分子动力学模拟
- 批准号:
6109190 - 财政年份:
- 资助金额:
$ 100.46万 - 项目类别:
相似海外基金
RII Track-4:NSF: From the Ground Up to the Air Above Coastal Dunes: How Groundwater and Evaporation Affect the Mechanism of Wind Erosion
RII Track-4:NSF:从地面到沿海沙丘上方的空气:地下水和蒸发如何影响风蚀机制
- 批准号:
2327346 - 财政年份:2024
- 资助金额:
$ 100.46万 - 项目类别:
Standard Grant
BRC-BIO: Establishing Astrangia poculata as a study system to understand how multi-partner symbiotic interactions affect pathogen response in cnidarians
BRC-BIO:建立 Astrangia poculata 作为研究系统,以了解多伙伴共生相互作用如何影响刺胞动物的病原体反应
- 批准号:
2312555 - 财政年份:2024
- 资助金额:
$ 100.46万 - 项目类别:
Standard Grant
How Does Particle Material Properties Insoluble and Partially Soluble Affect Sensory Perception Of Fat based Products
不溶性和部分可溶的颗粒材料特性如何影响脂肪基产品的感官知觉
- 批准号:
BB/Z514391/1 - 财政年份:2024
- 资助金额:
$ 100.46万 - 项目类别:
Training Grant
Graduating in Austerity: Do Welfare Cuts Affect the Career Path of University Students?
紧缩毕业:福利削减会影响大学生的职业道路吗?
- 批准号:
ES/Z502595/1 - 财政年份:2024
- 资助金额:
$ 100.46万 - 项目类别:
Fellowship
感性個人差指標 Affect-X の構築とビスポークAIサービスの基盤確立
建立个人敏感度指数 Affect-X 并为定制人工智能服务奠定基础
- 批准号:
23K24936 - 财政年份:2024
- 资助金额:
$ 100.46万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Insecure lives and the policy disconnect: How multiple insecurities affect Levelling Up and what joined-up policy can do to help
不安全的生活和政策脱节:多种不安全因素如何影响升级以及联合政策可以提供哪些帮助
- 批准号:
ES/Z000149/1 - 财政年份:2024
- 资助金额:
$ 100.46万 - 项目类别:
Research Grant
How does metal binding affect the function of proteins targeted by a devastating pathogen of cereal crops?
金属结合如何影响谷类作物毁灭性病原体靶向的蛋白质的功能?
- 批准号:
2901648 - 财政年份:2024
- 资助金额:
$ 100.46万 - 项目类别:
Studentship
ERI: Developing a Trust-supporting Design Framework with Affect for Human-AI Collaboration
ERI:开发一个支持信任的设计框架,影响人类与人工智能的协作
- 批准号:
2301846 - 财政年份:2023
- 资助金额:
$ 100.46万 - 项目类别:
Standard Grant
Investigating how double-negative T cells affect anti-leukemic and GvHD-inducing activities of conventional T cells
研究双阴性 T 细胞如何影响传统 T 细胞的抗白血病和 GvHD 诱导活性
- 批准号:
488039 - 财政年份:2023
- 资助金额:
$ 100.46万 - 项目类别:
Operating Grants
How motor impairments due to neurodegenerative diseases affect masticatory movements
神经退行性疾病引起的运动障碍如何影响咀嚼运动
- 批准号:
23K16076 - 财政年份:2023
- 资助金额:
$ 100.46万 - 项目类别:
Grant-in-Aid for Early-Career Scientists