building while flying
TRANSCRIPT
what I’ll talk about
• What is Graphdb anyway?
• Graph Tech
• What’s under the hood?
• building while flying
• demo
Bob AliceFriends With
id Name Age Location
1 Bob 32 Boston
2 Alice 21 San Diego
User 1 User 2
1 2
id Post
1 Graphs are cool!
User Post
1 1
2 1
Graphs are cool
Post
s
Posts
Users
Users PostsPosts
Friends
Bob Alice
Friends With
Post
s
Posts
Graphs are cool
Social Network
The paper written by Leonhard Euler on the Seven Bridges of Königsberg and published in 1736 is regarded as the first paper in the history of graph theory.
History
Node Node
Edge
Edge
EdgeNode
Graphs
A graph is a representation of a set of nodes where some pairs of nodes are connected by edges.
Node Node
Edge
Edge
EdgeNode
Property Graphs
Property 1Property 2Property 3 Property 1
Property 2
Property 4Property 5Property 6
Property 1Property 2Property 8
Property 4Property 5Property 6
Property 4Property 6
Bob
Graph Team
Visualization Group
ERB Team
Analytics Department
Team Outing
“Intro to Graph”
Document
Server A
Works at
Part of
Leads Logged to
Creates
Organizes
Alice
QA Team
Toast Masters
Part Of
Part of
Leads
Logged to
Manages
Edits
Works at
Manager Of
And this one as well
Graph Use Cases
Recommendation enginesAnomaly/Fraud DetectionNetwork Analysis/Route planningSocial NetworksIdentity/Access ManagementGraph-based searchMaster Data Management
Graph is the ‘Natural’ way to represent and query connected
Car
Alaska
Fred
Chips
Boston
Jane
Salsa
Soda
NYC
John
Sold in
Bought
BoughtBought
Bought
BoughtLives in
Lives in
Knows
Knows
Made in
Recommendations
Suggest to Fred that he might like some salsa or some Soda
Patient E
Alaska
Dr A
Patient D
Boston
Dr B
Patient B
Patient A
NYC
Patient F
Lives in
Files Claim
Files ClaimFiles Claim
Files a Claim
Files a ClaimLives in
Lives in
Files Claim
Knows
Lives
Fraud Detection
Why would a Dr that lives in Boston be filing claims against patients in Alaska and CA?
Lives in
CALives in
Lives in
Complex Data
Graphs are built for complex or highly networked/connected data.
RDBMSs are designed for ordered data or data that can be easily structured
Apache TinkerPop™ is a graph computing framework for both graph databases (OLTP) and graph analytic systems (OLAP).
graph.traversal().V().has('name','Alaa').outE('posts').values('comment')
Gremlin is a graph traversal language and virtual machine developed by Apache TinkerPop of the Apache Software Foundation.
Gremlin is based on Groovy and uses the same syntax.
Deliver a highly available, fault-tolerant graph database service that application developers can use to store, query, and visualize data points, their connections, and properties.
Service-First Approach to Development
* We, the devOps team, own all the pieces that are critical to our success: dev, qa, ops, advocacy, support…etc.
* Team is focused on operational integrity
* Deploying code in the cloud does not make a service
Multiple users operating inside a single environment
Conquering hurdles:* Cassandra keyspace* Groovy sandbox: blacklists, “def”* Forced indexing* noisy neighbor* OSS support
#Multi-tenancy
#DevOpsEngine/APIs/UI dev
Contribute to open Source Pipeline Logging
Monitoring Security
Compliance Alerts - 24/7 on call - PagerDuty
Support tickets did someone say all VMs need to restart and we don’t know
schedule for each?? HW - desk failure
BareMetal backup battery is busted Cloud Platform is not stable today, or it the service?