kdeep: protein-ligand absolute binding affinity prediction ... · dissociation constants (kd)...

18
K DEEP : Protein-Ligand Absolute Binding Affinity Prediction via 3D-Convolutional Neural Networks Jose Jimenez, Miha Skalic, Gerard Martinez-Rosell, and Gianni De Fabritiis Computational Biophysics Laboratory, Universitat Pompeu Fabra, Barcelona, Spain Presenter: Eli Draizen https://qdata.github.io/deep2Read Jose Jimenez, Miha Skalic, Gerard Martinez-Rosell, and Gianni De Fabritiis (Universitat Pompeu Fabra, Parc de Recerca Biom` K DEEP : Protein-Ligand Absolute Binding Affinity Prediction via 3D-Convolutional Neural N Presenter: Eli Draizen https://qdata.githu

Upload: others

Post on 09-Aug-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: KDEEP: Protein-Ligand Absolute Binding Affinity Prediction ... · dissociation constants (Kd) inhibition constants (Ki) half-concentration values (IC50) A smaller re ned subset (nr

KDEEP : Protein-Ligand Absolute Binding AffinityPrediction via 3D-Convolutional Neural Networks

Jose Jimenez, Miha Skalic, Gerard Martinez-Rosell, and Gianni DeFabritiis

Computational Biophysics Laboratory, Universitat Pompeu Fabra, Barcelona, Spain

Presenter: Eli Draizenhttps://qdata.github.io/deep2Read

Jose Jimenez, Miha Skalic, Gerard Martinez-Rosell, and Gianni De Fabritiis (Universitat Pompeu Fabra, Parc de Recerca Biomedica de Barcelona, Carrer del Dr. Aiguader 88, Barcelona 08003, Spain)KDEEP : Protein-Ligand Absolute Binding Affinity Prediction via 3D-Convolutional Neural NetworksPresenter: Eli Draizen https://qdata.github.io/deep2Read 1 / 18

Page 2: KDEEP: Protein-Ligand Absolute Binding Affinity Prediction ... · dissociation constants (Kd) inhibition constants (Ki) half-concentration values (IC50) A smaller re ned subset (nr

Outline

1 Introduction

2 Method

3 Comparison

4 Results

5 Conclusion

Jose Jimenez, Miha Skalic, Gerard Martinez-Rosell, and Gianni De Fabritiis (Universitat Pompeu Fabra, Parc de Recerca Biomedica de Barcelona, Carrer del Dr. Aiguader 88, Barcelona 08003, Spain)KDEEP : Protein-Ligand Absolute Binding Affinity Prediction via 3D-Convolutional Neural NetworksPresenter: Eli Draizen https://qdata.github.io/deep2Read 2 / 18

Page 3: KDEEP: Protein-Ligand Absolute Binding Affinity Prediction ... · dissociation constants (Kd) inhibition constants (Ki) half-concentration values (IC50) A smaller re ned subset (nr

IntroductionComputational Drug Design

How will small molecules (drugs) bind to a protein?

Where will the drug bind to the protein? DeepSite

How strong of an interaction will it be? This paper

Jose Jimenez, Miha Skalic, Gerard Martinez-Rosell, and Gianni De Fabritiis (Universitat Pompeu Fabra, Parc de Recerca Biomedica de Barcelona, Carrer del Dr. Aiguader 88, Barcelona 08003, Spain)KDEEP : Protein-Ligand Absolute Binding Affinity Prediction via 3D-Convolutional Neural NetworksPresenter: Eli Draizen https://qdata.github.io/deep2Read 3 / 18

Page 4: KDEEP: Protein-Ligand Absolute Binding Affinity Prediction ... · dissociation constants (Kd) inhibition constants (Ki) half-concentration values (IC50) A smaller re ned subset (nr

IntroductionStructure Representations

Figure: Left: Protein Structure. Right: Voxel Space (from David Koes)

Jose Jimenez, Miha Skalic, Gerard Martinez-Rosell, and Gianni De Fabritiis (Universitat Pompeu Fabra, Parc de Recerca Biomedica de Barcelona, Carrer del Dr. Aiguader 88, Barcelona 08003, Spain)KDEEP : Protein-Ligand Absolute Binding Affinity Prediction via 3D-Convolutional Neural NetworksPresenter: Eli Draizen https://qdata.github.io/deep2Read 4 / 18

Page 5: KDEEP: Protein-Ligand Absolute Binding Affinity Prediction ... · dissociation constants (Kd) inhibition constants (Ki) half-concentration values (IC50) A smaller re ned subset (nr

MethodVoxelization: Lennard-Jones Potential

Extract a 24A Volume Around Known BindingSite

Calculate the contribution, n, of every atom toevery voxel in the 24A Volume using therepulsive force for L-J Potential

For every voxel, take the maximum featurevalue from all atoms that contribute to it

n(r) = 1 − exp

(−(rvdwr

)12)

Jose Jimenez, Miha Skalic, Gerard Martinez-Rosell, and Gianni De Fabritiis (Universitat Pompeu Fabra, Parc de Recerca Biomedica de Barcelona, Carrer del Dr. Aiguader 88, Barcelona 08003, Spain)KDEEP : Protein-Ligand Absolute Binding Affinity Prediction via 3D-Convolutional Neural NetworksPresenter: Eli Draizen https://qdata.github.io/deep2Read 5 / 18

Page 6: KDEEP: Protein-Ligand Absolute Binding Affinity Prediction ... · dissociation constants (Kd) inhibition constants (Ki) half-concentration values (IC50) A smaller re ned subset (nr

MethodVoxelization: Lennard-Jones Potential

The Lennard-Jones potential describes the energy potential between twonon-bonded atoms based on their distance to each other.

V = 4ε

[(σr

)12

−(

σr

)6]V = intermolecularpotentialσ = distance where V is 0r = distance between atoms,measured from one center tothe otherε = interaction strength

Jose Jimenez, Miha Skalic, Gerard Martinez-Rosell, and Gianni De Fabritiis (Universitat Pompeu Fabra, Parc de Recerca Biomedica de Barcelona, Carrer del Dr. Aiguader 88, Barcelona 08003, Spain)KDEEP : Protein-Ligand Absolute Binding Affinity Prediction via 3D-Convolutional Neural NetworksPresenter: Eli Draizen https://qdata.github.io/deep2Read 6 / 18

Page 7: KDEEP: Protein-Ligand Absolute Binding Affinity Prediction ... · dissociation constants (Kd) inhibition constants (Ki) half-concentration values (IC50) A smaller re ned subset (nr

MethodFeaturization

16 Features/Channels for every voxel

8 descriptors doubled for protein and ligand

Jose Jimenez, Miha Skalic, Gerard Martinez-Rosell, and Gianni De Fabritiis (Universitat Pompeu Fabra, Parc de Recerca Biomedica de Barcelona, Carrer del Dr. Aiguader 88, Barcelona 08003, Spain)KDEEP : Protein-Ligand Absolute Binding Affinity Prediction via 3D-Convolutional Neural NetworksPresenter: Eli Draizen https://qdata.github.io/deep2Read 7 / 18

Page 8: KDEEP: Protein-Ligand Absolute Binding Affinity Prediction ... · dissociation constants (Kd) inhibition constants (Ki) half-concentration values (IC50) A smaller re ned subset (nr

Method3D-CNN

Jose Jimenez, Miha Skalic, Gerard Martinez-Rosell, and Gianni De Fabritiis (Universitat Pompeu Fabra, Parc de Recerca Biomedica de Barcelona, Carrer del Dr. Aiguader 88, Barcelona 08003, Spain)KDEEP : Protein-Ligand Absolute Binding Affinity Prediction via 3D-Convolutional Neural NetworksPresenter: Eli Draizen https://qdata.github.io/deep2Read 8 / 18

Page 9: KDEEP: Protein-Ligand Absolute Binding Affinity Prediction ... · dissociation constants (Kd) inhibition constants (Ki) half-concentration values (IC50) A smaller re ned subset (nr

MethodData Sources

PDBbind (v.2016), (n=13,308) complexes with experimental:

dissociation constants (Kd)inhibition constants (Ki)half-concentration values (IC50)

A smaller refined subset (nr = 4057) by exluding complexes with:

a resolution higher than 2.5 Aan R-factor higher than 0.25ligands bound through covalent bondsternary complexes or steric clashesaffinity not reported either in Kd or Kifalling out of a desired range (Kd ¡ 1pM)

A core set (nc = 290) is a representative non-redundant subset

But there was no difference in performance: ”We tested this by usingthe PDBbind full minus core set as training and saw no significantperformance difference in any test set.”

Jose Jimenez, Miha Skalic, Gerard Martinez-Rosell, and Gianni De Fabritiis (Universitat Pompeu Fabra, Parc de Recerca Biomedica de Barcelona, Carrer del Dr. Aiguader 88, Barcelona 08003, Spain)KDEEP : Protein-Ligand Absolute Binding Affinity Prediction via 3D-Convolutional Neural NetworksPresenter: Eli Draizen https://qdata.github.io/deep2Read 9 / 18

Page 10: KDEEP: Protein-Ligand Absolute Binding Affinity Prediction ... · dissociation constants (Kd) inhibition constants (Ki) half-concentration values (IC50) A smaller re ned subset (nr

MethodData Sources – Target Set

A target set (n=58) of thecore clustered using a 90%sequence similarity

average standard deviation of1.77 pK units.

Jose Jimenez, Miha Skalic, Gerard Martinez-Rosell, and Gianni De Fabritiis (Universitat Pompeu Fabra, Parc de Recerca Biomedica de Barcelona, Carrer del Dr. Aiguader 88, Barcelona 08003, Spain)KDEEP : Protein-Ligand Absolute Binding Affinity Prediction via 3D-Convolutional Neural NetworksPresenter: Eli Draizen https://qdata.github.io/deep2Read 10 / 18

Page 11: KDEEP: Protein-Ligand Absolute Binding Affinity Prediction ... · dissociation constants (Kd) inhibition constants (Ki) half-concentration values (IC50) A smaller re ned subset (nr

ComparisonOther Protein:Ligand Binding Affinity Predictors and Datasets

Predictors

RF-Score: Random forestX-Score: Linear RegressionCy-Score: Linear Regression using a curvature feature

Datasets

CSAR NRC-HiQ (4 data sets: 55, 55, 49, 49CSAR2012 57 proteinligand complexes from the D3R challengeCSAR2014, with 47 complexes from by the D3R consortium

Jose Jimenez, Miha Skalic, Gerard Martinez-Rosell, and Gianni De Fabritiis (Universitat Pompeu Fabra, Parc de Recerca Biomedica de Barcelona, Carrer del Dr. Aiguader 88, Barcelona 08003, Spain)KDEEP : Protein-Ligand Absolute Binding Affinity Prediction via 3D-Convolutional Neural NetworksPresenter: Eli Draizen https://qdata.github.io/deep2Read 11 / 18

Page 12: KDEEP: Protein-Ligand Absolute Binding Affinity Prediction ... · dissociation constants (Kd) inhibition constants (Ki) half-concentration values (IC50) A smaller re ned subset (nr

ResultsPearson Correlations coefficients for each target

Jose Jimenez, Miha Skalic, Gerard Martinez-Rosell, and Gianni De Fabritiis (Universitat Pompeu Fabra, Parc de Recerca Biomedica de Barcelona, Carrer del Dr. Aiguader 88, Barcelona 08003, Spain)KDEEP : Protein-Ligand Absolute Binding Affinity Prediction via 3D-Convolutional Neural NetworksPresenter: Eli Draizen https://qdata.github.io/deep2Read 12 / 18

Page 13: KDEEP: Protein-Ligand Absolute Binding Affinity Prediction ... · dissociation constants (Kd) inhibition constants (Ki) half-concentration values (IC50) A smaller re ned subset (nr

Results (Core Set)Predicted vs. Truth

Jose Jimenez, Miha Skalic, Gerard Martinez-Rosell, and Gianni De Fabritiis (Universitat Pompeu Fabra, Parc de Recerca Biomedica de Barcelona, Carrer del Dr. Aiguader 88, Barcelona 08003, Spain)KDEEP : Protein-Ligand Absolute Binding Affinity Prediction via 3D-Convolutional Neural NetworksPresenter: Eli Draizen https://qdata.github.io/deep2Read 13 / 18

Page 14: KDEEP: Protein-Ligand Absolute Binding Affinity Prediction ... · dissociation constants (Kd) inhibition constants (Ki) half-concentration values (IC50) A smaller re ned subset (nr

Results (Core Set)KDEEP prediction for CSAR Datasets

Jose Jimenez, Miha Skalic, Gerard Martinez-Rosell, and Gianni De Fabritiis (Universitat Pompeu Fabra, Parc de Recerca Biomedica de Barcelona, Carrer del Dr. Aiguader 88, Barcelona 08003, Spain)KDEEP : Protein-Ligand Absolute Binding Affinity Prediction via 3D-Convolutional Neural NetworksPresenter: Eli Draizen https://qdata.github.io/deep2Read 14 / 18

Page 15: KDEEP: Protein-Ligand Absolute Binding Affinity Prediction ... · dissociation constants (Kd) inhibition constants (Ki) half-concentration values (IC50) A smaller re ned subset (nr

Results (Core Set)RF-Score prediction for CSAR Datasets

Jose Jimenez, Miha Skalic, Gerard Martinez-Rosell, and Gianni De Fabritiis (Universitat Pompeu Fabra, Parc de Recerca Biomedica de Barcelona, Carrer del Dr. Aiguader 88, Barcelona 08003, Spain)KDEEP : Protein-Ligand Absolute Binding Affinity Prediction via 3D-Convolutional Neural NetworksPresenter: Eli Draizen https://qdata.github.io/deep2Read 15 / 18

Page 16: KDEEP: Protein-Ligand Absolute Binding Affinity Prediction ... · dissociation constants (Kd) inhibition constants (Ki) half-concentration values (IC50) A smaller re ned subset (nr

Results (Core Set)Correlations for all programs against CSAR Datasets

Jose Jimenez, Miha Skalic, Gerard Martinez-Rosell, and Gianni De Fabritiis (Universitat Pompeu Fabra, Parc de Recerca Biomedica de Barcelona, Carrer del Dr. Aiguader 88, Barcelona 08003, Spain)KDEEP : Protein-Ligand Absolute Binding Affinity Prediction via 3D-Convolutional Neural NetworksPresenter: Eli Draizen https://qdata.github.io/deep2Read 16 / 18

Page 17: KDEEP: Protein-Ligand Absolute Binding Affinity Prediction ... · dissociation constants (Kd) inhibition constants (Ki) half-concentration values (IC50) A smaller re ned subset (nr

ConclusionProblems

Input Volumes (24A) seems too small

It may ignore too much of the binding site and/or other importantsurface residues.They say it is because of memory constraints, but do not discuss sparse3DCNNS.

L-J Voxelization method is confusing and will be slow for full proteins

No alternate poses considered

No evolutionary information was included

How do small changes around the binding site effect it?

Small datasets

Binding Site and Pose Predictions not incorporated nor re-added backinto the training set (David Koes’ researchhttps://www.youtube.com/watch?v=jPg-4rkjZKc)

Jose Jimenez, Miha Skalic, Gerard Martinez-Rosell, and Gianni De Fabritiis (Universitat Pompeu Fabra, Parc de Recerca Biomedica de Barcelona, Carrer del Dr. Aiguader 88, Barcelona 08003, Spain)KDEEP : Protein-Ligand Absolute Binding Affinity Prediction via 3D-Convolutional Neural NetworksPresenter: Eli Draizen https://qdata.github.io/deep2Read 17 / 18

Page 18: KDEEP: Protein-Ligand Absolute Binding Affinity Prediction ... · dissociation constants (Kd) inhibition constants (Ki) half-concentration values (IC50) A smaller re ned subset (nr

ConclusionSummary

They created a new binding affinity predictor using 3DCNNs on 24Avolumes

KDEEP performs well and is competitive against other binding sitepredictors when compared just against PDBbind

However, no method outperforms the others on the other datasets

Jose Jimenez, Miha Skalic, Gerard Martinez-Rosell, and Gianni De Fabritiis (Universitat Pompeu Fabra, Parc de Recerca Biomedica de Barcelona, Carrer del Dr. Aiguader 88, Barcelona 08003, Spain)KDEEP : Protein-Ligand Absolute Binding Affinity Prediction via 3D-Convolutional Neural NetworksPresenter: Eli Draizen https://qdata.github.io/deep2Read 18 / 18