What’s in the Lab?...CONFIDENTIAL © Cresset > Collaboration with Julien Michel at University of...

Post on 13-Oct-2020

1 views 0 download

Transcript of What’s in the Lab?...CONFIDENTIAL © Cresset > Collaboration with Julien Michel at University of...

What’s in the Lab?Dr Mark Mackey

CSO, Cresset

© CressetCONFIDENTIAL

Free Energy Perturbation (FEP)

© CressetCONFIDENTIAL

> Evaluation via Thermodynamic Integration (TI) or Multistate Bennett Acceptance Ratio (MBAR)

Thermodynamic cycle

Cl

N+

O

N

O

N

N+

O

N

O

N

Shirts, M. et al.,J. Chem. Phys. 2008, 129, 124105

λ = 0

λ = 1

∆Gbind, A

∆Gbind, B

∆Gsolvated

∆𝐺𝐺𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏,𝐴𝐴−𝐵𝐵= ∆𝐺𝐺𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏 − ∆𝐺𝐺𝑠𝑠𝑏𝑏𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑏𝑏

∆Gbound

© CressetCONFIDENTIAL

Example: Two ligands in PTP1B

> Number of intermediate states depends on how difficult is the modification from A to B> Need to ensure that any adjacent pair of systems are similar enough

A 10% C90% B

90% A10% C

90% A10% C BC 90% C

10% B

λ = 0

λ = 1

λ = 0.5

λ = 1

λ = 0

λ = 0.5

A 5% A95% B

90% A10% B

95% A5% B B

λ =0 λ =0.05 λ =0.1 λ =0.95 λ =1

© CressetCONFIDENTIAL

> Collaboration with Julien Michel at University of Edinburgh

> Building on top of best open-source software> AmberTools> OpenMM> Improved and extended LOMAP> SIRE> BioSimSpace

> Combines them with Cresset expertise in delivering easy-to-use software

FEP at Cresset

© CressetCONFIDENTIAL

FEP workflow in Flare

Assessment of ∆∆G

Production run (SOMD using OpenMM as the MD engine, 4.0 ns each)

Generation of SOMD input files (BioSimSpace)

Multi-step equilibration (OpenMM, 0.25ns each)

Generation of input files (AmberTools, OFF toolkit)

Generation of perturbation network (LOMAP and/or manually)

© CressetCONFIDENTIAL

Results of Flare final implementation on standard data sets

DatasetCresset/ UoE Schrödinger

(Wang et al.)AMBER TI

(Song et al.)GROMACS

(Gapsys et al.), GAFFR2 (95%CI) MUE R2 (95%CI) MUE R2 (95%CI) MUE R2 (95%CI) MUE

Thrombin 0.82 (0.67-0.91) 0.20 ± 0.05 0.50

(0.23-0.71) 0.42 ± 0.11 0.57(0.31-0.76) 0.37 ± 0.11 0.06

(0.01-0.29) 0.66 ± 0.11

TYK2 0.53(0.32-0.69) 0.70 ± 0.14 0.79

(0.67-0.88) 0.44 ± 0.09 0.33(0.13-0.53) 0.89 ± 0.18 0.47

(0.26-0.65) 0.87 ± 0.11

JNK1 0.52(0.35-0.67) 0.76 ± 0.13 0.71

(0.58-0.81) 1.05 ± 0.10 0.22(0.07-0.40) 0.90 ± 0.15 0.64

(0.49-0.76) 0.80 ± 0.12

CDK2 0.54(0.34-0.71) 0.69 ± 0.14 0.23

(0.06-0.44) 0.88 ± 0.14 0.22(0.07-0.40) 0.91 ± 0.15 0.72

(0.56-0.83) 0.47 ± 0.12

MCL1 0.55(0.43-0.65) 1.18 ± 0.15 0.60

(0.49-0.69) 0.84 ± 0.10 0.42(0.30-0.54) 1.24 ± 0.12 0.59

(0.48-0.68) 0.86 ± 0.11

p38 0.40(0.26-0.53) 0.94 ± 0.14 0.42

(0.29-0.55) 0.86 ± 0.08 0.15(0.05-0.28) 1.28 ± 0.18 0.46

(0.33-0.59) 0.62 ± 0.08

BACE 0.19(0.08-0.32) 0.76 ± 0.10 0.61

(0.50-0.71) 0.67 ± 0.09 0.19(0.08-0.32) 1.03 ± 0.14 0.21

(0.09-0.34) 0.74 ± 0.08

PTP1B 0.48(0.31-0.63) 0.72 ± 0.13 0.65

(0.50-0.76) 0.60 ± 0.11 0.50(0.33-0.64) 0.76 ± 0.14 0.52

(0.35-0.66) 0.77 ± 0.10

MUE = per compound MUE [kcal/mol]

© CressetCONFIDENTIAL

Results of Flare final implementation on standard data sets

© CressetCONFIDENTIAL

Results of Flare final implementation on standard data sets

© CressetCONFIDENTIAL

Limitations

> Ring size changes

> Aromaticity changes

© CressetCONFIDENTIAL

Link direction

> In default mode, we compute both A→B and B→A> For fast mode, we only do one of the two. Which one?

Target R², all R², S2L R², L2S MUEc, all MUEc, S2L MUEc, L2S

Thrombin 0.82 0.91 0.27 0.20 0.14 0.40CDK2 0.54 0.53 0.52 0.69 0.70 0.70TYK2 0.53 0.19 0.51 0.70 1.10 0.72JNK1 0.52 0.49 0.54 0.76 0.77 0.84

PTP1b 0.48 0.49 0.11 0.72 0.75 1.10MCL1 0.55 0.51 0.57 1.18 1.24 1.14BACE 0.19 0.12 0.25 0.76 0.80 0.74

© CressetCONFIDENTIAL

> How reproducible are FEP calculations?

> Repeated several transformations multiple times to get stats> Cl ↔ Br> O ↔ CH2

> H ↔ cyclopropyl> H ↔ cycloheptyl

> Unsurprisingly, variance is larger for larger transformations

Repeatability of calculations

© CressetCONFIDENTIAL

AI charging models

© CressetCONFIDENTIAL

> DNN-based method for computing electrostatic potentials> Builds an underlying charge model that

is similar to XED

> Can we utilise this to replace/augment XED charging?

Improved charge models for field-based similarity

© CressetCONFIDENTIAL

Field patterns and surface potentials are similar

Additional charge points on Cl giveneutral electrostatics along bond axis XED polarizes the system slightly more

Astex AstexXED XED

© CressetCONFIDENTIAL

> ‘Proper’ electrostatics on cations/anions are useless

> Heavily influenced by (unknown) protein environment

> Heuristic: Scale formal charges down (equivalent to a higher dielectric)> XED uses 0.125 or 0.25> Astex use 0.4

> Issues with unrealistic charge distributions – not solved yet

Charged systems are an issue for both methods

Astex XED

© CressetCONFIDENTIAL

> 119 proteins> ~1450 ligands> Generate unbiased confs for each

ligand, align using every other ligand as a reference

> Preliminary numbers (chirality problems)

Tested on AZ/CCDC alignment test data set

Formal charge scale factor 0.4

Formal charge scale factor 0.25

© CressetCONFIDENTIAL

Future research plans

© CressetCONFIDENTIAL

Continue Astex charge model investigations

> Does the Astex charge model improve electrostatic complementarity calculations?> Does it give sensible results for proteins?

> Is it worth re-training the model on the XED geometries?

© CressetCONFIDENTIAL

> HPC-Europa 3 grant> Hosted by Dr. Piero Procacci of the

Department of Chemistry, University of Florence

> 250K hours of GPU time on MARCONI100 cluster at CINECA

> 980 nodes, 4 x Tesla V100 cards per node> Difficulty is the PowerPC architecture…

FEP experiments

© CressetCONFIDENTIAL

> Running several large-scale experiments> What is the reproducibility of FEP calculations?> Can we speed the calculations up without sacrificing accuracy?

> Water box buffer size> Hydrogen mass repartitioning> Improved OpenMM integrator

> Improved handling of ‘difficult’ transformations> Split steric/electronic perturbations

> Adaptive lambda schedules

FEP experiments

© CressetCONFIDENTIAL

More

Docking in Spark (see Giovanna’s talk)

GIST / GCT as alternative water analysis methods

Improvements to ligand alignment scoring function

Improvements to docking algorithm (protein flexibility, waters)

Very high-throughput (billions) 3D virtual

screening of libraries

© CressetCONFIDENTIAL

cressetgroup

mark@cresset-group.com

Mark Mackey

Thanks to Paolo Tosco, Giovanna Tedesco, Stuart Firth-Clark, Nathan Kidley, Peter Cherry, Max Kuhnand Julien Michel, Antonia Mey at the University of Edinburgh