an introduction to das
DESCRIPTION
An Introduction to DAS. Andy Jenkinson , EBI. Summary of Topics. What is Data Integration? Problems in Data Integration An architectural overview of DAS Brief History of DAS. What is Data Integration. All These are Data Integration. Reading some papers so you can write a report - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: An Introduction to DAS](https://reader036.vdocument.in/reader036/viewer/2022062323/56815b49550346895dc929cc/html5/thumbnails/1.jpg)
Andy Jenkinson, EBI
An Introduction to DAS
![Page 2: An Introduction to DAS](https://reader036.vdocument.in/reader036/viewer/2022062323/56815b49550346895dc929cc/html5/thumbnails/2.jpg)
Summary of Topics
• What is Data Integration?
• Problems in Data Integration
• An architectural overview of DAS
• Brief History of DAS
![Page 3: An Introduction to DAS](https://reader036.vdocument.in/reader036/viewer/2022062323/56815b49550346895dc929cc/html5/thumbnails/3.jpg)
What is Data Integration
![Page 4: An Introduction to DAS](https://reader036.vdocument.in/reader036/viewer/2022062323/56815b49550346895dc929cc/html5/thumbnails/4.jpg)
All These are Data Integration
• Reading some papers so you can write a report
• Exploring some database websites so you can learn about a topic
• Downloading some data from different databases so you can analyse it
• Downloading some data from different databases so you can combine it with your own
![Page 5: An Introduction to DAS](https://reader036.vdocument.in/reader036/viewer/2022062323/56815b49550346895dc929cc/html5/thumbnails/5.jpg)
All These are Data Integration
• Reading some papers so you can write a report
• Exploring some database websites so you can learn about a topic
• Downloading some data from different databases so you can analyse it
• Downloading some data from different databases so you can combine it with your own
![Page 6: An Introduction to DAS](https://reader036.vdocument.in/reader036/viewer/2022062323/56815b49550346895dc929cc/html5/thumbnails/6.jpg)
Data Integration
• “Automatic” data integration• pulling in data from different
locations• processing it• creating a resource derived from
the data• done via computers, not humans
• e.g. creating/updating a data warehouse
Warehouse
PDB
EnsemblUniProt
![Page 7: An Introduction to DAS](https://reader036.vdocument.in/reader036/viewer/2022062323/56815b49550346895dc929cc/html5/thumbnails/7.jpg)
Warehouse model
![Page 8: An Introduction to DAS](https://reader036.vdocument.in/reader036/viewer/2022062323/56815b49550346895dc929cc/html5/thumbnails/8.jpg)
Data Integration:like herding cats
![Page 9: An Introduction to DAS](https://reader036.vdocument.in/reader036/viewer/2022062323/56815b49550346895dc929cc/html5/thumbnails/9.jpg)
Databases are all different
![Page 10: An Introduction to DAS](https://reader036.vdocument.in/reader036/viewer/2022062323/56815b49550346895dc929cc/html5/thumbnails/10.jpg)
Databases evolve
![Page 11: An Introduction to DAS](https://reader036.vdocument.in/reader036/viewer/2022062323/56815b49550346895dc929cc/html5/thumbnails/11.jpg)
Data ages
![Page 12: An Introduction to DAS](https://reader036.vdocument.in/reader036/viewer/2022062323/56815b49550346895dc929cc/html5/thumbnails/12.jpg)
Databases are big
![Page 13: An Introduction to DAS](https://reader036.vdocument.in/reader036/viewer/2022062323/56815b49550346895dc929cc/html5/thumbnails/13.jpg)
Distributed Annotation System
• Distributed
• Client-Server architecture
• Federation
• RESTful web services
![Page 14: An Introduction to DAS](https://reader036.vdocument.in/reader036/viewer/2022062323/56815b49550346895dc929cc/html5/thumbnails/14.jpg)
Warehouse model
![Page 15: An Introduction to DAS](https://reader036.vdocument.in/reader036/viewer/2022062323/56815b49550346895dc929cc/html5/thumbnails/15.jpg)
DAS model
![Page 16: An Introduction to DAS](https://reader036.vdocument.in/reader036/viewer/2022062323/56815b49550346895dc929cc/html5/thumbnails/16.jpg)
Architectural Overview
![Page 17: An Introduction to DAS](https://reader036.vdocument.in/reader036/viewer/2022062323/56815b49550346895dc929cc/html5/thumbnails/17.jpg)
DAS
• Databases are all different• DAS is a uniform facet of a database – always the same
• Databases change their structure• when the database changes, DAS stays the same
• Databases are updated• DAS data comes directly from the provider so is always fresh
• Databases are big• DAS uses real-time targeted queries
![Page 18: An Introduction to DAS](https://reader036.vdocument.in/reader036/viewer/2022062323/56815b49550346895dc929cc/html5/thumbnails/18.jpg)
History
Developed circa 1999 for sharing genome annotations
Expanded 2004 onwards• more data types• better metadata• addition of Registry
DAS/2 project• split from DAS, not backwards compatible• inspired some DAS developments
![Page 19: An Introduction to DAS](https://reader036.vdocument.in/reader036/viewer/2022062323/56815b49550346895dc929cc/html5/thumbnails/19.jpg)
To Summarise…
The Distributed Annotation System is…• A network of biological data sources• An example of federation• A collection of REST web services
The DAS Protocol is…• An integration platform• A client-server protocol• An agreed standard
![Page 20: An Introduction to DAS](https://reader036.vdocument.in/reader036/viewer/2022062323/56815b49550346895dc929cc/html5/thumbnails/20.jpg)
Image Credits
• Flickr/muir.ceardach• Flickr/Horia Varlan• Flickr/Alessandro Pinna• Fotopedia/Jean-Marie Hullot• listicles.com/?p=3485• Google Earth/Cnes/Spot Image• Olivier H. Beauchesne