scientific & technical presentation - wordpress.com · 2011-02-14 · structural overview web...
TRANSCRIPT
![Page 1: Scientific & technical presentation - WordPress.com · 2011-02-14 · Structural overview Web browser Application Web application JChem Base API: Chemical logic Structure cache JDBC](https://reader035.vdocument.in/reader035/viewer/2022063001/5f1c5f24eda3cc5c5c5b8e46/html5/thumbnails/1.jpg)
version 5.3, February 2010
Scientific & technical presentation
JChem Base
![Page 2: Scientific & technical presentation - WordPress.com · 2011-02-14 · Structural overview Web browser Application Web application JChem Base API: Chemical logic Structure cache JDBC](https://reader035.vdocument.in/reader035/viewer/2022063001/5f1c5f24eda3cc5c5c5b8e46/html5/thumbnails/2.jpg)
Introduction to JChem Base
High performance Java based tools for:
storage, search and retrieval of chemical
structures and associated data
The components can be integrated into
web-based or standalone applications
in association with other ChemAxon tools
![Page 3: Scientific & technical presentation - WordPress.com · 2011-02-14 · Structural overview Web browser Application Web application JChem Base API: Chemical logic Structure cache JDBC](https://reader035.vdocument.in/reader035/viewer/2022063001/5f1c5f24eda3cc5c5c5b8e46/html5/thumbnails/3.jpg)
Structural overview
Web
browserApplication Web application
JChem Base API:Chemical logicStructure cache
JDBC driver: Standard interface to the RDBMS
RDBMS (e.g. Oracle, MySQL, etc.) :
Storage and security
![Page 4: Scientific & technical presentation - WordPress.com · 2011-02-14 · Structural overview Web browser Application Web application JChem Base API: Chemical logic Structure cache JDBC](https://reader035.vdocument.in/reader035/viewer/2022063001/5f1c5f24eda3cc5c5c5b8e46/html5/thumbnails/4.jpg)
Compatibility and integration
File formats:• SMILES• MDL molfile (v2000 and v3000)
• MDL SDF• RXN• RDF• MRV• IUPAC name, InChI• Markush DARC• CDX
Integration:extensive API for• Java• .NET• JChem Cartridge for Oracle
Database engines:• Oracle• MySQL• MS SQL Server• PostgreSQL• MS Access• IBM DB2• Derby• etc.
Operating systems:• Windows• Linux• Mac OS X• Solaris• etc.
![Page 5: Scientific & technical presentation - WordPress.com · 2011-02-14 · Structural overview Web browser Application Web application JChem Base API: Chemical logic Structure cache JDBC](https://reader035.vdocument.in/reader035/viewer/2022063001/5f1c5f24eda3cc5c5c5b8e46/html5/thumbnails/5.jpg)
JSP example application
Features:• Substructure, Superstructure, Full,
Exact fragment, Similarity and Perfect search
• Molecular Descriptor similarity search with descriptor coloring
• Substructure hit alignment and coloring, inverse hit list
• Chemical Terms filter• Import / Export• Export of hits• Insert / Modify / Delete structures• AJAX in JChem Webservices
![Page 6: Scientific & technical presentation - WordPress.com · 2011-02-14 · Structural overview Web browser Application Web application JChem Base API: Chemical logic Structure cache JDBC](https://reader035.vdocument.in/reader035/viewer/2022063001/5f1c5f24eda3cc5c5c5b8e46/html5/thumbnails/6.jpg)
Structure search features
See detailed information on structure search: www.chemaxon.com/conf/Structural_Search.ppt
• Wide range of query atoms• Query properties• R-group queries• Full SMARTS support• Coordination compounds• Link nodes• Pseudo atoms, lone pairs• Relative stereo• Reaction search features• Hit coloring, position variation• Polymers
![Page 7: Scientific & technical presentation - WordPress.com · 2011-02-14 · Structural overview Web browser Application Web application JChem Base API: Chemical logic Structure cache JDBC](https://reader035.vdocument.in/reader035/viewer/2022063001/5f1c5f24eda3cc5c5c5b8e46/html5/thumbnails/7.jpg)
Search options
Some selected structure search options:•Stereo on/off•Ignore charge/isotope/radical/valence/polymers, etc.•Vague bond matching options•Chemical Terms filter•Tautomer search•Inverse hit list•Maximum search time / number of hits•Combine with non-structure conditions•Ordering of results•etc.
![Page 8: Scientific & technical presentation - WordPress.com · 2011-02-14 · Structural overview Web browser Application Web application JChem Base API: Chemical logic Structure cache JDBC](https://reader035.vdocument.in/reader035/viewer/2022063001/5f1c5f24eda3cc5c5c5b8e46/html5/thumbnails/8.jpg)
JChem Base 5.2.2, Intel Quad Q6600 2.4GHz, 8 GB RAM;Oracle 10.2.0.3
Performance (1)
Number of compounds
Elapsed timeDuplicates not
checkedDuplicates checked
10,000 21 s 26 s
100,000 2 min 4 s 2 min 34 s
200,000 4 min 24 s 5 min 13 s
Query Number of hits Search time
2 0.91 s
93 0.98 s
6,001 1.30 s
146,256 5,66 s
Compound registration:
Substructure search inPubChem (19.5 millioncompounds):
![Page 9: Scientific & technical presentation - WordPress.com · 2011-02-14 · Structural overview Web browser Application Web application JChem Base API: Chemical logic Structure cache JDBC](https://reader035.vdocument.in/reader035/viewer/2022063001/5f1c5f24eda3cc5c5c5b8e46/html5/thumbnails/9.jpg)
Performance (2)
Similarity search:Tanimoto >0.9
JChem Base 5.2.2, Intel Quad Q6600 2.4GHz, 8 GB RAM; Oracle 10.2.0.
Query Number of hits Search time
0 3.39 s
0 3.82 s
0 3.33 s
![Page 10: Scientific & technical presentation - WordPress.com · 2011-02-14 · Structural overview Web browser Application Web application JChem Base API: Chemical logic Structure cache JDBC](https://reader035.vdocument.in/reader035/viewer/2022063001/5f1c5f24eda3cc5c5c5b8e46/html5/thumbnails/10.jpg)
Markush structures
Markush structure registration and search• Markush features
• R-groups
• Atom lists, bond lists
• Position variation bond
• Link nodes and repeating units
• Homology variation (alkyl, aryl, etc.)
• Compatible Markush enumeration plugin
![Page 11: Scientific & technical presentation - WordPress.com · 2011-02-14 · Structural overview Web browser Application Web application JChem Base API: Chemical logic Structure cache JDBC](https://reader035.vdocument.in/reader035/viewer/2022063001/5f1c5f24eda3cc5c5c5b8e46/html5/thumbnails/11.jpg)
Administration with JChemManager
User interface for• creating tables• import• export• deleting rows• dropping tables
Most functions are also available from command line.
![Page 12: Scientific & technical presentation - WordPress.com · 2011-02-14 · Structural overview Web browser Application Web application JChem Base API: Chemical logic Structure cache JDBC](https://reader035.vdocument.in/reader035/viewer/2022063001/5f1c5f24eda3cc5c5c5b8e46/html5/thumbnails/12.jpg)
Standardization
• Default standardization includes:
– Hydrogen removal
– Aromatization
• Custom standardizationcan be specified for each table by specifying an XMLconfiguration file at table creation or in the “Table Options” dialog of JChem Manager (jcman)
![Page 13: Scientific & technical presentation - WordPress.com · 2011-02-14 · Structural overview Web browser Application Web application JChem Base API: Chemical logic Structure cache JDBC](https://reader035.vdocument.in/reader035/viewer/2022063001/5f1c5f24eda3cc5c5c5b8e46/html5/thumbnails/13.jpg)
Custom Standardization Example
afterbefore
Standardizer http://www.chemaxon.com/conf/Standardizer.ppt
![Page 14: Scientific & technical presentation - WordPress.com · 2011-02-14 · Structural overview Web browser Application Web application JChem Base API: Chemical logic Structure cache JDBC](https://reader035.vdocument.in/reader035/viewer/2022063001/5f1c5f24eda3cc5c5c5b8e46/html5/thumbnails/14.jpg)
The property table
The property table stores information about JChem structure tables, including:
• Fingerprint parameters
• Custom standardization rules
• Other table options and information
More than one property table can be used, each property table represents a particular JChem environment.
![Page 15: Scientific & technical presentation - WordPress.com · 2011-02-14 · Structural overview Web browser Application Web application JChem Base API: Chemical logic Structure cache JDBC](https://reader035.vdocument.in/reader035/viewer/2022063001/5f1c5f24eda3cc5c5c5b8e46/html5/thumbnails/15.jpg)
Table types
Control allowed chemical structures and available operations
• Molecule
• Reaction
• Markush
• Query
• Any structure
![Page 16: Scientific & technical presentation - WordPress.com · 2011-02-14 · Structural overview Web browser Application Web application JChem Base API: Chemical logic Structure cache JDBC](https://reader035.vdocument.in/reader035/viewer/2022063001/5f1c5f24eda3cc5c5c5b8e46/html5/thumbnails/16.jpg)
The structure of JChem tables
Column name Explanationcd_id unique numeric identifier in the tablecd_structure the imported structure in the original format, without
modifications (except for the removal of data fields)cd_smiles; cd_smarts; cd_markush
the standardized structure format dependig on the different table types, used by the search process
cd_formula the formula of the standardized structurecd_sortable_formula formula representation for alphanumerical sortingcd_molweight the molecular weight of the standardized structurecd_hash; cd_flags;cd_fp…
fields used internally for structure searching
cd_timestamp the date and time of the insertion of the row[user fields] custom data fields can be added by the user
![Page 17: Scientific & technical presentation - WordPress.com · 2011-02-14 · Structural overview Web browser Application Web application JChem Base API: Chemical logic Structure cache JDBC](https://reader035.vdocument.in/reader035/viewer/2022063001/5f1c5f24eda3cc5c5c5b8e46/html5/thumbnails/17.jpg)
Structural search in database
Two stage method provides optimal performance:
1. Rapid pre-screening reduces the number ofpossible hit candidates
• Chemical Hashed Fingerprints are used forsubstructure and superstructure searches
• Hash code is used for duplicate filtering(usually during compound registration)
2. Graph search algorithm is used to determine the final hit list
![Page 18: Scientific & technical presentation - WordPress.com · 2011-02-14 · Structural overview Web browser Application Web application JChem Base API: Chemical logic Structure cache JDBC](https://reader035.vdocument.in/reader035/viewer/2022063001/5f1c5f24eda3cc5c5c5b8e46/html5/thumbnails/18.jpg)
Structure Cache
• Contains Fingerprints for screening and ChemAxon Extended SMILES for ABAS
• Instant access to the structures for the search process
• Reduced load on the database server
• Incremental update ensures minimum overhead after changes in the table
• Small memory footprint due to – SMILES compression– Optimized storage technique
• Approximately 100MB memory needed for 1 million typical drug-like structures (using default, 512 bit long fingerprints)
![Page 19: Scientific & technical presentation - WordPress.com · 2011-02-14 · Structural overview Web browser Application Web application JChem Base API: Chemical logic Structure cache JDBC](https://reader035.vdocument.in/reader035/viewer/2022063001/5f1c5f24eda3cc5c5c5b8e46/html5/thumbnails/19.jpg)
Future plans
• Graphical user interface for R-group decomposition
• Arbitrary table structure (Java and .NET API for JChem index)
• Maximum common substructure search type
• Additional layer: JChem Server (later also as grid)
• Compound registration system API
![Page 20: Scientific & technical presentation - WordPress.com · 2011-02-14 · Structural overview Web browser Application Web application JChem Base API: Chemical logic Structure cache JDBC](https://reader035.vdocument.in/reader035/viewer/2022063001/5f1c5f24eda3cc5c5c5b8e46/html5/thumbnails/20.jpg)
Summary
ChemAxon’s JChem Base API provides sophisticated high performance tools for the developer to deal with chemical structures and associated data.
Building on the JChem API is convenient, because:
• Our various tools integrate seamlessly
• Both high and low level API classes are available
• Responsive developer-to-developer support
![Page 21: Scientific & technical presentation - WordPress.com · 2011-02-14 · Structural overview Web browser Application Web application JChem Base API: Chemical logic Structure cache JDBC](https://reader035.vdocument.in/reader035/viewer/2022063001/5f1c5f24eda3cc5c5c5b8e46/html5/thumbnails/21.jpg)
Links
• JChem home page:http://www.chemaxon.com/products/jchem-base
• Online tryout:http://www.chemaxon.com/jchem/examples.html
• API documentation:http://www.chemaxon.com/jchem/doc/api/index.html
• Brochure:www.chemaxon.com/brochures/JChemBase.pdf
![Page 22: Scientific & technical presentation - WordPress.com · 2011-02-14 · Structural overview Web browser Application Web application JChem Base API: Chemical logic Structure cache JDBC](https://reader035.vdocument.in/reader035/viewer/2022063001/5f1c5f24eda3cc5c5c5b8e46/html5/thumbnails/22.jpg)
Visit other technical presentations
MarvinSketch/View http://www.chemaxon.com/MarvinSketch_View.ppt
MarvinSpace http://www.chemaxon.com/MarvinSpace.ppt
Calculator Plugins http://www.chemaxon.com/Calculator_Plugins.ppt
JChem Base http://www.chemaxon.com/JChem_Base.ppt
JChem Cartridge http://www.chemaxon.com/JChem_Cartridge.ppt
Standardizer http://www.chemaxon.com/Standardizer.ppt
Screen http://www.chemaxon.com/Screen.ppt
JKlustor http://www.chemaxon.com/JKlustor.ppt
Fragmenter http://www.chemaxon.com/Fragmenter.ppt
Reactor http://www.chemaxon.com/Reactor.ppt