serving the medicinal chemistry community with royal society of chemistry cheminformatics platforms
DESCRIPTION
The Royal Society of Chemistry (RSC) is a major participant in providing access to chemistry related data via the web. As an internationally renowned society for the chemical sciences, a scientific publisher and the host of the ChemSpider database for the community, RSC continues to make dramatic strides in providing online access to data. ChemSpider provides access to over 30 million chemicals sourced from over 500 data suppliers and linked out to related information on the web. The platform is a crowdsourcing environment whereby members of the community can participate in validating and expanding the content of the database. With a set of application programming interfaces ChemSpider is used by various organizations and projects to serve up data for various purposes. These include structure identification for mass spectrometry instrument vendors, RSC databases such as the Marinlit natural products database and a European grant-based project from the Innovative Medicines Initiative fund. This presentation will provide an overview of various cheminformatics activities and projects that RSC is involved with to serve the medicinal chemistry community. This will include the Open PHACTS semantic web project, the PharmaSea project to identify new pharmaceutical leads from the ocean and the UK National Compound Collection to identify new lead compounds contained within PhD theses.TRANSCRIPT
![Page 1: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/1.jpg)
Serving the Medicinal Chemistry Community with RSC
Cheminformatics Platforms
Antony WilliamsBrazilian Medicinal Chemistry Conference,
November 11th 2014
![Page 3: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/3.jpg)
Chemistry for the Community
• The Royal Society of Chemistry as a provider of chemistry for the community:• As a charity • As a scientific publisher• As a host of commercial databases• As a partner in grant-based projects• As the host of ChemSpider• New: the RSC Data Repository for Chemistry
![Page 4: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/4.jpg)
Overwhelmed with data…
![Page 5: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/5.jpg)
So much online data…
![Page 6: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/6.jpg)
Organizations releasing data
![Page 7: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/7.jpg)
Funders encourage openness
![Page 8: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/8.jpg)
We model data – then lose it
• What if we could share models and the underlying data via a central repository?
• This is MOSTLY not a technology issue!!!
![Page 9: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/9.jpg)
Pharma Companies Repeat Work
Pre-competitive Informatics:Pharma are all accessing, processing, storing & re-processing external research data
LiteraturePubChem
GenbankPatents
DatabasesDownloads
Data Integration Data AnalysisFirewalled Databases
Repeat at each
companyx
![Page 10: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/10.jpg)
Publications lock up data
![Page 11: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/11.jpg)
When I finish this article…
![Page 12: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/12.jpg)
The data will be locked up..
![Page 13: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/13.jpg)
But what if we could navigate?
What’s the structure?
Are they in our file?
What’s similar?
What’s the target?Pharmacology
data?
Known Pathways?
Working On Now?Connections
to disease?
Expressed in right cell type?
Competitors?
IP?
![Page 14: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/14.jpg)
![Page 15: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/15.jpg)
• ~30 million chemicals and growing• Data sourced from >500 different sources• Crowd sourced curation and annotation• Ongoing deposition of data from our
journals and our collaborators• Structure centric hub for web-searching• It’s a really big dictionary!!!
![Page 16: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/16.jpg)
ChemSpider
![Page 17: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/17.jpg)
ChemSpider
![Page 18: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/18.jpg)
Experimental/Predicted Properties
![Page 19: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/19.jpg)
Literature references
![Page 20: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/20.jpg)
Google Books
![Page 21: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/21.jpg)
Patents references
![Page 22: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/22.jpg)
RSC Databases
![Page 23: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/23.jpg)
Vendors and data sources
![Page 24: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/24.jpg)
With structures and names…
![Page 25: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/25.jpg)
Name Searching
![Page 26: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/26.jpg)
Standards for Integration
![Page 27: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/27.jpg)
Structure Searching
![Page 28: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/28.jpg)
What did we learn???
• Data Quality is an enormous challenge• Crowd sourced annotation can help!
![Page 29: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/29.jpg)
Crowdsourced Enhancement
• The community can clean and enhance the database by providing Feedback and direct curation
• Tens of thousands of edits made
![Page 30: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/30.jpg)
But Software Can Help
• SRS as guidance for standardization rules
![Page 32: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/32.jpg)
ChemSpider is a building block
![Page 33: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/33.jpg)
…for grant-based services
• Use ChemSpider data slices and API/Web services to support grant-based projects
• Multiple European consortium-based grants• PharmaSea (FP7 funded)• Open PHACTS (IMI funded)
• Support Open Drug Discovery projects
![Page 34: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/34.jpg)
![Page 35: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/35.jpg)
Over half of all drugs introduced between 1940 and 2006 were of natural origin or inspired by natural compounds
![Page 36: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/36.jpg)
![Page 38: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/38.jpg)
PharmaSea
![Page 39: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/39.jpg)
..as a dereplication platform
![Page 41: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/41.jpg)
• 3-year Innovative Medicines Initiative project
• Integrating chemistry and biology data using semantic web technologies
• Open code, open data, open standards
• Academics, Pharma Companies, Publishers…
![Page 42: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/42.jpg)
The Open PHACTS community ecosystem
![Page 44: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/44.jpg)
Chemistry Searching…
![Page 45: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/45.jpg)
But what about Biology???
![Page 47: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/47.jpg)
Open PHACTS Explorer
![Page 48: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/48.jpg)
Pharmacology by Compound
![Page 49: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/49.jpg)
Compounds and enzymes
![Page 50: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/50.jpg)
Compounds and enzymes
![Page 51: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/51.jpg)
Pharmacology by Target
![Page 52: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/52.jpg)
![Page 53: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/53.jpg)
Facilitated by ChemSpider RDF
![Page 54: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/54.jpg)
Open Sourcing Data and Code
• Open PHACTS data licensed as Open Data – approx. 2 Million chemicals
• Open Source code to release to community (from Open PHACTS github site)• Chemical Registration Service• Chemical Validation and Standardization Platform
![Page 55: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/55.jpg)
ChemSpider as a “dictionary”
• Systematic name(s)• Trivial Name(s)• SMILES• InChI Strings• InChIKeys• Database IDs• Registry Number
![Page 56: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/56.jpg)
Valium on ChemSpider
With strong dictionaries connections can be made…
![Page 57: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/57.jpg)
Semantic Mark-up of Articles
![Page 58: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/58.jpg)
Linking Names to Structures
![Page 59: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/59.jpg)
MedChemComm
…and providing more links…
![Page 60: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/60.jpg)
Mark-up of MedChemComm
![Page 61: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/61.jpg)
Mark-up of MedChemComm
![Page 62: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/62.jpg)
Mark-up of MedChemComm
![Page 63: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/63.jpg)
Links out from MedChemComm
![Page 64: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/64.jpg)
Links out from MedChemComm
![Page 65: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/65.jpg)
What about old publications?
• We would LOVE to bring data out of our archive• Find chemical names and generate structures• Find chemical images and generate structures• Find reactions – and make a database!• Find data (MP, BP, LogP) and host. Build models!• Find figures and database them• Find spectra (and link to structures)• Validate the data algorithmically
![Page 66: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/66.jpg)
RSC Archive – since 1841
![Page 67: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/67.jpg)
SO MANY reactions!
![Page 68: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/68.jpg)
Text Mining
The N-(β-hydroxyethyl)-N-methyl-N'-(2-trifluoromethyl-1,3,4-thiadiazol-5-yl)urea prepared in Example 6 , thionyl chloride ( 5 ml ) and benzene ( 50 ml ) were charged into a glass reaction vessel equipped with a mechanical stirrer , thermometer and reflux condenser .
The reaction mixture was heated at reflux with stirring , for a period of about one-half hour .
After this time the benzene and unreacted thionyl chloride were stripped from the reaction mixture under reduced pressure to yield the desired product N-(β-chloroethyl)-N-methyl-N'-(2-trifluoromethyl-1,3,4-thiaidazol-5-yl)urea as a solid residue
![Page 69: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/69.jpg)
Text Mining
The N-(β-hydroxyethyl)-N-methyl-N'-(2-trifluoromethyl-1,3,4-thiadiazol-5-yl)urea prepared in Example 6 , thionyl chloride ( 5 ml ) and benzene ( 50 ml ) were charged into a glass reaction vessel equipped with a mechanical stirrer , thermometer and reflux condenser .
The reaction mixture was heated at reflux with stirring , for a period of about one-half hour .
After this time the benzene and unreacted thionyl chloride were stripped from the reaction mixture under reduced pressure to yield the desired product N-(β-chloroethyl)-N-methyl-N'-(2-trifluoromethyl-1,3,4-thiaidazol-5-yl)urea as a solid residue
![Page 70: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/70.jpg)
But names = structures
• Systematic names can be generated FROM chemical structures algorithmically
![Page 71: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/71.jpg)
..and Context Gives Reactions
The N-(β-hydroxyethyl)-N-methyl-N'-(2-trifluoromethyl-1,3,4-thiadiazol-5-yl)urea prepared in Example 6 , thionyl chloride ( 5 ml ) and benzene ( 50 ml ) were charged into a glass reaction vessel equipped with a mechanical stirrer , thermometer and reflux condenser .
The reaction mixture was heated at reflux with stirring , for a period of about one-half hour .
After this time the benzene and unreacted thionyl chloride were stripped from the reaction mixture under reduced pressure to yield the desired product N-(β-chloroethyl)-N-methyl-N'-(2-trifluoromethyl-1,3,4-thiaidazol-5-yl)urea as a solid residue
![Page 72: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/72.jpg)
ChemSpider Reactions
![Page 73: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/73.jpg)
Text spectra?
• 13C NMR (CDCl3, 100 MHz): δ = 14.12 (CH3), 30.11 (CH, benzylic methane), 30.77 (CH, benzylic methane), 66.12 (CH2), 68.49 (CH2), 117.72, 118.19, 120.29, 122.67, 123.37, 125.69, 125.84, 129.03, 130.00, 130.53 (ArCH), 99.42, 123.60, 134.69, 139.23, 147.21, 147.61, 149.41, 152.62, 154.88 (ArC)
![Page 74: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/74.jpg)
1H NMR (CDCl3, 400 MHz): δ = 2.57 (m, 4H, Me, C(5a)H), 4.24 (d, 1H, J = 4.8 Hz, C(11b)H), 4.35 (t, 1H, Jb = 10.8 Hz, C(6)H), 4.47 (m, 2H, C(5)H), 4.57 (dd, 1H, J = 2.8 Hz, C(6)H), 6.95 (d, 1H, J = 8.4 Hz, ArH), 7.18–7.94 (m, 11H, ArH)
![Page 75: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/75.jpg)
Turn “Figures” Into Data
FIGURE
EXTRACTED
![Page 76: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/76.jpg)
Models published from data
![Page 77: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/77.jpg)
Text-mining Data to compare
Presently extracting property data from
Google Patents as test
![Page 78: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/78.jpg)
National Compound Collection
• Unlock chemistry data in PhD theses• Discover novel molecules for biosciences• Working together with industry and the
academic community• Build the data into RSC online platforms• Perform virtual screening/modeling and access
physical samples to screen
![Page 79: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/79.jpg)
We should make sure thesis data are available in consumable formats – compounds, reactions, analytical data etc.
![Page 80: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/80.jpg)
What are we building?
• We are building the “RSC Data Repository”• Containers for compounds, reactions, analytical
data, tabular data• Algorithms for data validation and standardization • Flexible indexing and search technologies• A platform for modeling data and hosting existing
models and predictive algorithms• Chemistry RESEARCH DATA MANAGEMENT
![Page 81: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/81.jpg)
Compounds
![Page 82: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/82.jpg)
Reactions
![Page 83: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/83.jpg)
Analytical data
![Page 84: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/84.jpg)
Crystallography data
![Page 85: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/85.jpg)
What’s the structure?
Are they in our file?
What’s similar?
What’s the target?Pharmacology
data?
Known Pathways?
Working On Now?Connections
to disease?
Expressed in right cell type?
Competitors?
IP?
With data in hand maybe it’s time
![Page 86: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/86.jpg)
Conclusions
• We are building platforms that can support multiple communities, including MedChem
• We are working hard to extend our data and improve quality of online data
• Opening APIs to the platforms and data provides a powerful building block
• We are Open Sourcing components of our platforms to the community
![Page 87: Serving the medicinal chemistry community with Royal Society of Chemistry cheminformatics platforms](https://reader036.vdocument.in/reader036/viewer/2022081513/5584af49d8b42a045c8b5071/html5/thumbnails/87.jpg)
Thank youEmail: [email protected]: 0000-0002-2668-4821 Twitter: @ChemConnectorPersonal Blog: www.chemconnector.com SLIDES: www.slideshare.net/AntonyWilliams