49
How can we get better at this? Pat Walters – D3R Workshop February 23, 2018

New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

Howcanwegetbetteratthis?

PatWalters– D3RWorkshopFebruary23,2018

Page 2: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

I’veDoneThisaFewTimes

2012 2013 2015 2016 2017 20182014

Page 3: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

3

HowISpendMyTimeOnChallenges

Confidential|©2017RelayTherapeutics

Dealingwithpoorlyformattedsubmissions

Validatingevaluations MakingSlides

Page 4: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

4

TheEvaluationProcess

Confidential|©2017RelayTherapeutics

PatEvaluate

ConnorandZiedEvaluate

FinalComparisons

Page 5: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

5

TheLiteratureMakesItLookLikeActivityPredictionisaSolvedProblem

Confidential|©2017RelayTherapeutics

0.82 0.80

0.66 0.65

Pearsonr

Page 6: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

6

ScoringPerformanceFromGC2andGC3

Confidential|©2017RelayTherapeutics

Page 7: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

Weneedtoagreeon• Whatconstitutesareasonabledataset• Howdatashouldbereported• Evaluationmetrics• Statisticsforcomparison• Whatconstitutesanullmodel• Formatofsupportingmaterial• Criteriaforreproducibility

7

GuidelinesForReviewing”ScoringFunction”Papers

Confidential|©2017RelayTherapeutics

Page 8: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

Weneedtoagreeon• Whatconstitutesareasonabledataset• Howdatashouldbereported• Evaluationmetrics• Statisticsforcomparison• Whatconstitutesanullmodel• Formatofsupportingmaterial• Criteriaforreproducibility

8

GuidelinesForReviewing”ScoringFunction”Papers

Confidential|©2017RelayTherapeutics

Page 9: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

Whenevaluatingaregressionmodel,thedatasetshouldhaveadynamicrangesimilartothoseobservedindrugdiscoveryprojects(typically4-6logs)

9

DatasetsShouldSpanaReasonableDynamicRange

Confidential|©2017RelayTherapeutics

Thisdataset(PDBindv.2016coreset)spans10logsanddoesn’tprovideanappropriaterepresentationofcorrelation

Page 10: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

10

CorrelationsCanChangeDramaticallyWithDynamicRange

Confidential|©2017RelayTherapeutics

R2=0.22MAE=0.69

R2=0.76MAE=0.55

Thisisthesamedataset.Ontheleftweconsidertheentireset,whichhasanunrealisticallylarge(~10log)dynamicrange.Ontherightweconsideramorerealisticsubsetwitha3logdynamicrange.Notethechangeincorrelation.

Page 11: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

11

GC3CatSDatasetSpansaRealisticDynamicRange

Confidential|©2017RelayTherapeutics

Page 12: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

Weneedtoagreeon• Whatconstitutesareasonabledataset• Howdatashouldbereported• Evaluationmetrics• Statisticsforcomparison• Whatconstitutesanullmodel• Formatofsupportingmaterial• Criteriaforreproducibility

12

GuidelinesForReviewing”ScoringFunction”Papers

Confidential|©2017RelayTherapeutics

Page 13: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

13

Don’tCramMultipleDatasetsontotheSamePlot

Confidential|©2017RelayTherapeutics

http://pubs.acs.org/doi/abs/10.1021/acs.jpcb.7b07224 http://pubs.acs.org/doi/abs/10.1021/ja512751q

Page 14: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

14

EvenMyFriendsAreGuilty

Confidential|©2017RelayTherapeutics

MillandNeysa(Yesterday)

Page 15: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

15

Trellisingprovidesamuchmoreeffectivemeansofcomparingdatasets

Confidential|©2017RelayTherapeutics

Page 16: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

Weneedtoagreeon• Whatconstitutesareasonabledataset• Howdatashouldbereported• Evaluationmetrics• Statisticsforcomparison• Whatconstitutesanullmodel• Formatofsupportingmaterial• Criteriaforreproducibility

16

GuidelinesForReviewing”ScoringFunction”Papers

Confidential|©2017RelayTherapeutics

Page 17: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

ReportPearson,SpearmanandKendallcorrelationsFavorR2 overRwhenreportingaPearsoncorrelationcoefficientReportMAEand/orRMSE

17

Alwaysreportcorrelationsappropriately

Confidential|©2017RelayTherapeutics

Ihavenoideawhatthismeans

http://pubs.acs.org/doi/abs/10.1021/acs.jpcb.7b07224

Page 18: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

18

MaximumAchievableCorrelation

Confidential|©2017RelayTherapeutics

StartwithexperimentaldataAddGaussianerror

§ Mean=0.0§ Standarddeviation=0.3log

CalculationcorrelationRepeat1000times

Brown,ScottP.,StevenW.Muchmore,andPhilipJ.Hajduk."Healthyskepticism:assessingrealisticmodelperformance.”DrugDiscoveryToday14.7(2009):420-427.

Page 19: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

19

MaximumAchievableCorrelation- HPS90D3R1

Confidential|©2017RelayTherapeutics

Page 20: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

https://github.com/PatWalters/metk

OpenSourceEvaluationCode(MoretoCome)

Page 21: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

Weneedtoagreeon• Whatconstitutesareasonabledataset• Howdatashouldbereported• Evaluationmetrics• Statisticsforcomparison• Whatconstitutesanullmodel• Formatofsupportingmaterial• Criteriaforreproducibility

21

GuidelinesForReviewing”ScoringFunction”Papers

Confidential|©2017RelayTherapeutics

Page 22: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

22

EnsureThatDifferencesinCorrelationAreSignificant

Confidential|©2017RelayTherapeutics

Inparticular,bothMM-PB/SAandMM-GB/SAproducedbetterresultsbyusingarepresentativestructure(R)0.72-0.79)ratherthanaveragingovertheconformationalensembleofeachgivencomplex(R)0.61-0.74

Page 23: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

23Confidential|©2017RelayTherapeutics

M1_dynamic M1_static M2_static M3_dynamic M3_static M4_dynamic M4_static

Table L2

abs(

Pea

rson

r)

0.0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

Aliteraturecomparisonof7methodsforscoringprotein-ligandinteractions

Page 24: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

24

Rememberthatcorrelationshaveconfidenceintervalsandreporttheseintervals

Confidential|©2017RelayTherapeutics

Page 25: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

25

It’sAlltheSame!

Confidential|©2017RelayTherapeutics

M1_dynamic M1_static M2_static M3_dynamic M3_static M4_dynamic M4_static

Table L2

ab

s(P

ea

rso

n r

)

0.0

0.2

0.4

0.6

0.8

Page 26: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

Weneedtoagreeon• Whatconstitutesareasonabledataset• Howdatashouldbereported• Evaluationmetrics• Statisticsforcomparison• Whatconstitutesanullmodel• Formatofsupportingmaterial• Criteriaforreproducibility

26

GuidelinesForReviewing”ScoringFunction”Papers

Confidential|©2017RelayTherapeutics

Page 27: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

MolecularweightandcalculatedLogParepoornullmodels

Page 28: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

GenerateRDKitfingerprintsforligandsTrainonPDBbindrefinedset(n=4057)TestonPDBbindcoreset(n=290)Wallclocktime<5min

28

SimpleQSARasaNullModel

Confidential|©2017RelayTherapeutics

Page 29: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

29

WhatConstitutesanAppropriateNullModel

Confidential|©2017RelayTherapeutics

MolecularWeight XLogP SimpleQSAR

Page 30: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

30

ANullModelforRMSE

Confidential|©2017RelayTherapeutics

1.SampleNobservedvalues2.CalculateRMS3.Repeat1and2*1000

Page 31: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

31

NullModelforGC1HSP90FreeEnergyChallenge

Confidential|©2017RelayTherapeutics

RMSE(kcal/m

ol)

Page 32: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

32

ComparingRMSvsNullforGC1HSP90Challenge

Confidential|©2017RelayTherapeutics

Dashedlineindicatesthenullmodel

Page 33: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

Weneedtoagreeon• Whatconstitutesareasonabledataset• Howdatashouldbereported• Evaluationmetrics• Statisticsforcomparison• Whatconstitutesanullmodel• Formatofsupportingmaterial• Criteriaforreproducibility

33

GuidelinesForReviewing”ScoringFunction”Papers

Confidential|©2017RelayTherapeutics

Page 34: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

Alwaysprovideamachinereadabletable(e.g.csv)ofpredictedandexperimentalvaluesAtableinapaperisnotsufficient,itisoftenverydifficulttoextracttablesfrompdffilesChemicalstructuresshouldbeincludedasSDFor,whereappropriate,SMILEStofacilitatecomparisonwithothermethodsNeedtoenablereaderstoevaluatecorrelationsanderrors

34

Includeappropriatesupportinginformation

Confidential|©2017RelayTherapeutics

Page 35: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

Weneedtoagreeon• Whatconstitutesareasonabledataset• Howdatashouldbereported• Evaluationmetrics• Statisticsforcomparison• Whatconstitutesanullmodel• Formatofsupportingmaterial• Criteriaforreproducibility

35

GuidelinesForReviewing”ScoringFunction”Papers

Confidential|©2017RelayTherapeutics

Page 36: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

36

CanIReproduceYourMethod?

Confidential|©2017RelayTherapeutics

Page 37: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

Code!!!AthoroughdescriptionofyourmethodAwebimplementationNoneoftheabove

37

WhatConstitutesReproducibility?

Confidential|©2017RelayTherapeutics

Page 38: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

Weneedtoagreeon• Whatconstitutesareasonabledataset• Howdatashouldbereported• Evaluationmetrics• Statisticsforcomparison• Whatconstitutesanullmodel• Formatofsupportingmaterial• Criteriaforreproducibility

38

GuidelinesForReviewing”ScoringFunction”Papers

Confidential|©2017RelayTherapeutics

Page 39: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

39

HowCanYouHelp?

Confidential|©2017RelayTherapeutics

Page 40: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

40

DockingChallengesHaveBecomeMoreChallenging

Confidential|©2017RelayTherapeutics

Page 41: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

Arewespendingenoughtimeunderstandcompoundsthatdockedpoorly?• Insufficientconformationalsampling• Insufficientposesampling• Inadequatescoring• LigandposeswithlimiteddensityIseveryonemissingthesamecompounds?Cangroupsworktogethertoimprovetheirmethods?

41

QuestionsonDockingChallenges

Confidential|©2017RelayTherapeutics

Page 42: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

D3RParticipantsCSARParticipantsTDTParticipantsSAMPLParticipants

RommieAmaroMikeGilson

MillLambertNeysaNevins

ConnorParksZiedGaieb

ShuaiLiu

42

Acknowledgements

Confidential|©2017RelayTherapeutics

Page 43: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

https://github.com/PatWalters/metk

OpenSourceEvaluationCode(MoretoCome)

Page 44: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

BACKUP

44Confidential|©2017RelayTherapeutics

Page 45: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

45

LooksLikeActivityPredictionisaSolvedProblem

Confidential|©2017RelayTherapeutics

0.82 0.80

0.66 0.65

Pearsonr

Page 46: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

46

WhatConstitutesanAppropriateNullModel

Confidential|©2017RelayTherapeutics

MolecularWeight XLogP SimpleQSAR

Page 47: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

47

WhatConstitutesanAppropriateNullModel

Confidential|©2017RelayTherapeutics

MolecularWeight XLogP SimpleQSAR

Page 48: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

48

Evaluatemaximumpossiblecorrelationforadatasetgivenexperimentalerror

Confidential|©2017RelayTherapeutics https://www.sciencedirect.com/science/article/pii/S1359644609000403

Page 49: New Walters D3R 2018 - D3R | Welcome... · 2018. 3. 5. · Pat Walters – D3R Workshop February 23, 2018. I’ve Done This a Few Times ... Final Comparisons. 5 The Literature Makes

StartwithexperimentaldataAddGaussianerror• Mean=0.0• Standarddeviation=0.3logCalculationcorrelationRepeat1000times

49

MaximumAchievableCorrelation

Confidential|©2017RelayTherapeutics