library compound design methods for custom ... - chemaxon
TRANSCRIPT
![Page 1: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/1.jpg)
Solutions for Cheminformatics
21-25 Nov, 2010, Hyderabad
Library Compound Design Methods for Custom
Library Synthesis
![Page 2: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/2.jpg)
Offers
![Page 3: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/3.jpg)
Usage
![Page 4: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/4.jpg)
Library design by ChemAxon
DB
DB
Databases
Reactions
Molecules
Markush structures
Queries
Compound selection
Similarity searches
Substructure searches
Enumeration
Fuse fragments
R-group composition
Reaction enumeration
Markush enumeration
Library analysis
Clustering
2D similarity screen
3D Shape similarity
screen
Fragmentation
R-group decomposition
Fragmentation
Reagent clipping
![Page 5: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/5.jpg)
Library design by ChemAxon
ChemAxon Technology
Chemical data storage JChemBase
JChem Cartridge for Oracle
Chemical data search JChem search technology
Chemical data visualization JChem for Excel – Marvin
Instant Jchem - Marvin
Chemical data characterization Calculator plugins – logP, pKa ...
Enumeration Reactor – reaction enumeration
Markush enumeration
R-group composition
Fragment fusion
Fragmentation Fragmenter
R-group decomposition
Analysis JKlustor
Screen 3D - Screen 2D
![Page 6: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/6.jpg)
Databases: displaying content on your desktop
JChem for Excel
Instant JChem
![Page 7: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/7.jpg)
Databases: displaying content: JSP application
• Search technology
• Descriptors
• Alignments
• Chemical Terms filter
• Import / Export /Edit
• AJAX in JChem Webservices
ONLINE TRYOUT
https://www.chemaxon.com
![Page 8: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/8.jpg)
Building blocks for library enumeration
Instant JChem
- Fragmentation
JChem for Excel
- R-group decomposition
Command line
- Fragmentation
- R-group decomposition
![Page 9: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/9.jpg)
0.47 0.55
0.57
0.28
0.20
0.06
Which fragments?
regular Tanimoto
optimized Tanimoto
Optimization of Similarity search metrics:
ECFP/FPCP/Chemical FP/ Pharmacophore FP
![Page 10: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/10.jpg)
Similarity searching statistics
1
10
100
1000
10000
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18
Number of Active Hits
Num
ber
of
Hits
Tanimto Euclidean Optimized Ideal
![Page 11: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/11.jpg)
Enumeration
Output files ChemAxon technology
Fragments Fragment fusion
Markush structures Markush enumeration
(search without enumeration)
Reactants – generic
reactions
Reaction enumeartion
R-tables R-group composition
![Page 12: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/12.jpg)
Enumeration
R-table Markush structure
![Page 13: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/13.jpg)
ChemAxon in Knime
![Page 14: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/14.jpg)
Reaction Enumeration
EXCLUDE: match(reactant(1), "[Cl,Br,I]C(=[O,S])C=C") or
match(reactant(0), "[H][O,S]C=[O,S]") or
match(reactant(0), "[P][H]") or
(max(pka(reactant(0), filter(reactant(0),
"match('[O,S;H1]')"), "acidic")) > 14.5) or
(max(pka(reactant(0), filter(reactant(0),
"match('[#7:1][H]', 1)"), "basic")) > 0)
![Page 15: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/15.jpg)
ChemAxon in Knime
![Page 16: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/16.jpg)
Library analysis
• Characterisation of library:
– Fragments - Fragmenter
– Molecular descriptors – Calculator plugins
![Page 17: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/17.jpg)
Library analysis – 3D shape similarity search
Test on DUD
1% Enrichment
0
5
10
15
20
25
30
35
40
ADA CDK2 DHFR ER FXA HIVRT NA P38 thrombin TK trypsin
Perc
en
t o
f th
e a
cti
ves f
ou
nd
Surflex-sim
ROCS
FlexS
ICMsim
CXN-H
Giganti et al. J. Chem. Inf. Model. 2010, 50, 992
![Page 18: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/18.jpg)
Library analysis
Wide range of methods• Unsupervised, agglomerative
clustering
• Hierarchical and non-hierarchical
methods
• Similarity based and structure
based techniques
Flexible search options• Tanimoto and Euclidean metrics,
weighting
• Maximum common substructure
identification
• chemical property matching
including atom type, bond type,
hybridization, charge
![Page 19: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/19.jpg)
Use cases
![Page 20: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/20.jpg)
Use cases
![Page 21: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/21.jpg)
Use cases
![Page 22: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/22.jpg)
Use cases
![Page 23: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/23.jpg)
Use cases
![Page 24: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/24.jpg)
Use cases
![Page 25: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/25.jpg)
Use cases
![Page 26: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/26.jpg)
Use cases
![Page 27: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/27.jpg)
Use cases
![Page 28: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/28.jpg)
Target-focused libraries:
rapid selection of potential PDE inhibitors from
multi-million compounds’ repositories
Why do we need rapid selection of target- focused libraries?
Design inputs
2D similarity searching strategy
Property-based filtering
Seed/ chemotype representation (diversity)
Conclusion/ Proposals
TargetEx Ltd.,
György Dormán
![Page 29: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/29.jpg)
Target -focused libraries via Virtual Screening
H-bond Acceptor
AromaticH-bond donor
Cation
Docking Target structure
Source CompoundsCommercial Samples
Combinatorial Libraries/Historical collectionsDe Novo Compounds
Known Active Compounds
2DSubstructure-
Similarity SearchingPartitioningData FusionClustering
KernelsSVM
3D Pharmacophore
Shape Similarity3D/4D-QSAR
Final Visual InspectionAcquisition
Plating
FilteringADMET
Lead-likeness
Biological testing2D fingerprint
Focused library
TargetEx Ltd.,
György Dormán
![Page 30: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/30.jpg)
2D similarity selection
January 5, 2011
![Page 31: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/31.jpg)
Similarity searching strategy: execution
• Setting the starting similarity level (dependent on the
fingerprint S/W, T= 60-75 % for ChemAxon)
• Iteration based on the results (scenarios):
• the number of virtual hits are between 50 and 500,
OK
• the number of virtual hits are <50 or >500
– if <50 lower the similarity treshold with 5 %
– if >500 increase the similarity treshold with 5 %
– This can be continued until the optimal range achieved
– If 5 % decrease results in >500 compounds the search can
be refined by 2% (alternatively a diversity selection would
be needed, but that is not available)
– Duplications can be removed when merged the resulting
DBs
![Page 32: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/32.jpg)
How to reduce the number of the hits?
Normally screening companies would like to buy 100-1000
compounds
• Since from the various vendor DBs we can obtain 2000-
10.000 virtual hits their number can be reduced
• 1. Applying the reference property space (Lipinski and Veber
rules) (IJchem OK)
• 2. There are overrepresented seeds thus virtual hits coming
from those seeds can be reduced (IJchem OK)
• 3. Applying an optimal distribution of the resulting chemotypes
(removing the overrepresented compounds) (Limited with
Jklustor)
• 4. Simple diversity analysis (JKlustor)
![Page 33: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/33.jpg)
TargetEx Ltd., György Dormán
1. Applying the reference property space: Structural determinants: H-bond donor/ acceptor, hydrophobic interactions
(property space determination)
Pharmacophore fingerprints requires more computation and time consuming
In simple similarity search pharmacophore features can only be
considered as statistical features (not connected to structures)
The similarity search results can be filtered based on the physico-chemical
parameter space of the seed compounds (+10/-10 % range applied)
![Page 34: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/34.jpg)
Results and further reduction
• Similarity search results: 8655
• After property filtering: 2009
• 2. There are overrepresented seeds thus virtual
hits coming from those seeds can be reduced
• When combining the similarity search the
contribution of the seeds can be controlled (or set
the number of analogues derived from certain
seeds)
![Page 35: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/35.jpg)
2. Overrepresented seeds
Seeds leading to highest number of similar hits
HN
NN
N
O
CH3
CH3
CH3
S
N
N
H3C
O
O
O
#4 (Sildenafil)
238 analogues
(60 % similarity or above)
H3C
O
S
O
O
N
O
N
O
HN
CH3
#13
328 analogues
(60 % similarity or above)
H3C
O
S
O
O
N
O
N
O
HN
CH3
#18 (desantafil)
4494 analogues
(60- 80 % similarity)
![Page 36: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/36.jpg)
2. Overrepresented seeds
Seeds leading to highest number of similar hits
O
CH3
SO O
N
N
H3C
HN
N
N
O
N
CH3
CH3
#27
237 analogues
(60 % similarity or above)
N
N
HN
O
CH3
O
CH3
NN
#28
272 analogues
(60 % similarity or above)
N
N
Cl
HN
N
OH
O
O
O
#30
466 analogues
(60 % similarity or
above)NH
N
O
O
N
O
OCH
3
#44
2726 analogues
(60 % similarity or above)
![Page 37: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/37.jpg)
Recurring structural motifs in the seed structures
![Page 38: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/38.jpg)
Recurring structural motifs in the similarity search results
![Page 39: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/39.jpg)
3. Applying an optimal distribution of the resulting chemotypes
Proposed application of JKlustor/LibMCS
• Taking into consideration of the substructure
where the maximum number of connection
(bond) is found
– it can be an option
– Maybe difficult to define
• Using such option the „real” core structure can be
found easier
![Page 40: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/40.jpg)
Use caseIan Berry
Evotec
![Page 41: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/41.jpg)
Evotec Library Profiler
• Aim is to be able to select from a large virtual library either:
– A combinatorial subset
• Typically small focussed libraries
– A non-combinatorial subset
• Medicinal chemistry projects
• Desirable to allow access to all scientists
– Creativity
– Share ideas
– Security aspect
• Interactive
• Subsets need to satisfy multiple criteria
![Page 42: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/42.jpg)
Workflow
Enumerate
Virtual
Library
Export to fileImport into
Esma
Property
calculation
Filtering /
analysisExport to file
Import into
Jchem for
Excel
Filtering /
analysis
Import into
Spotfire
![Page 43: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/43.jpg)
Workflow using the Library Profiler
Enumerate
Virtual
Library
Select
properties
to calculate
Filtering /
analysis
Export to fileFurther
analysis
![Page 44: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/44.jpg)
5-Jan-1144
![Page 45: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/45.jpg)
Charting – Scatter plot
![Page 46: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/46.jpg)
Pivot View - Properties
![Page 47: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/47.jpg)
Using ChemAxon tools
• High usage of Marvin View and Sketch
– Easy to integrate
• JChem cartridge for filtering
– Experience in using cartridge
• JChem tools for many of the property calculations
– HBD, HBA, ROT, AMW, TPSA, Veber Bioavailability, BBB
distribution, undesirable functional groups, Andrews
AVERAGE energy, Bioavailability score, Ligand binding
efficiency, PGP Substrate prediction, pKa, protonated
atom count, non-H atom count
![Page 48: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/48.jpg)
WORKSHOP AT 14:00HANDS-ON SESSION
Focused and diverse library
generation by ChemAxon
technology
![Page 49: Library Compound Design Methods for Custom ... - ChemAxon](https://reader031.vdocument.in/reader031/viewer/2022012502/617b6a2dd4ba6053e913b802/html5/thumbnails/49.jpg)
Visit other technical presentations
www.chemaxon.com