for the social graph - · center for research in intelligent storage 1. complex distributed write...
TRANSCRIPT
![Page 1: for the Social Graph - · Center for Research in Intelligent Storage 1. Complex distributed write logic to handle to ensure data correctness, consistency 2.A lot of concurrent read](https://reader030.vdocument.in/reader030/viewer/2022041015/5ec6c151e7e47a60df11c4a6/html5/thumbnails/1.jpg)
Center for Research in Intelligent Storage
Authors: Nathan Bronson, et. al
Presentors: Zhichao Cao, Fenggang Wu
10/01/2018
TAO: Facebook’s Distributed Data Store
for the Social Graph
![Page 2: for the Social Graph - · Center for Research in Intelligent Storage 1. Complex distributed write logic to handle to ensure data correctness, consistency 2.A lot of concurrent read](https://reader030.vdocument.in/reader030/viewer/2022041015/5ec6c151e7e47a60df11c4a6/html5/thumbnails/2.jpg)
Center for Research in Intelligent Storage Source: https://www.usenix.org/node/174510
![Page 3: for the Social Graph - · Center for Research in Intelligent Storage 1. Complex distributed write logic to handle to ensure data correctness, consistency 2.A lot of concurrent read](https://reader030.vdocument.in/reader030/viewer/2022041015/5ec6c151e7e47a60df11c4a6/html5/thumbnails/3.jpg)
Center for Research in Intelligent Storage Source: https://www.usenix.org/node/174510
![Page 4: for the Social Graph - · Center for Research in Intelligent Storage 1. Complex distributed write logic to handle to ensure data correctness, consistency 2.A lot of concurrent read](https://reader030.vdocument.in/reader030/viewer/2022041015/5ec6c151e7e47a60df11c4a6/html5/thumbnails/4.jpg)
Center for Research in Intelligent Storage Source: https://www.usenix.org/node/174510
![Page 5: for the Social Graph - · Center for Research in Intelligent Storage 1. Complex distributed write logic to handle to ensure data correctness, consistency 2.A lot of concurrent read](https://reader030.vdocument.in/reader030/viewer/2022041015/5ec6c151e7e47a60df11c4a6/html5/thumbnails/5.jpg)
Center for Research in Intelligent Storage Source: https://www.usenix.org/node/174510
![Page 6: for the Social Graph - · Center for Research in Intelligent Storage 1. Complex distributed write logic to handle to ensure data correctness, consistency 2.A lot of concurrent read](https://reader030.vdocument.in/reader030/viewer/2022041015/5ec6c151e7e47a60df11c4a6/html5/thumbnails/6.jpg)
Center for Research in Intelligent Storage
Stage 1: Relational Database Supported
MySQL
WebServers
……
1. Low write latency
2. High read latency, difficult
to achieve load balancing
3. Low scalability
![Page 7: for the Social Graph - · Center for Research in Intelligent Storage 1. Complex distributed write logic to handle to ensure data correctness, consistency 2.A lot of concurrent read](https://reader030.vdocument.in/reader030/viewer/2022041015/5ec6c151e7e47a60df11c4a6/html5/thumbnails/7.jpg)
Center for Research in Intelligent Storage
Stage 2: Cache Supported (memcache)
Source: https://www.usenix.org/node/174510
![Page 8: for the Social Graph - · Center for Research in Intelligent Storage 1. Complex distributed write logic to handle to ensure data correctness, consistency 2.A lot of concurrent read](https://reader030.vdocument.in/reader030/viewer/2022041015/5ec6c151e7e47a60df11c4a6/html5/thumbnails/8.jpg)
Center for Research in Intelligent Storage
Stage 2: Cache Supported (cache in between)
MySQL
WebServers
……
1. Higher write latency (clean
the cache)
2. Lower read latency
3. Higher scalability
Cache Layer
![Page 9: for the Social Graph - · Center for Research in Intelligent Storage 1. Complex distributed write logic to handle to ensure data correctness, consistency 2.A lot of concurrent read](https://reader030.vdocument.in/reader030/viewer/2022041015/5ec6c151e7e47a60df11c4a6/html5/thumbnails/9.jpg)
Center for Research in Intelligent Storage
Stage 3: TAO (The Associations and Objects)
Source: https://www.usenix.org/node/174510
![Page 10: for the Social Graph - · Center for Research in Intelligent Storage 1. Complex distributed write logic to handle to ensure data correctness, consistency 2.A lot of concurrent read](https://reader030.vdocument.in/reader030/viewer/2022041015/5ec6c151e7e47a60df11c4a6/html5/thumbnails/10.jpg)
Center for Research in Intelligent Storage
TAO Design 1: One Tier with Sharding
Source: https://www.usenix.org/node/174510
![Page 11: for the Social Graph - · Center for Research in Intelligent Storage 1. Complex distributed write logic to handle to ensure data correctness, consistency 2.A lot of concurrent read](https://reader030.vdocument.in/reader030/viewer/2022041015/5ec6c151e7e47a60df11c4a6/html5/thumbnails/11.jpg)
Center for Research in Intelligent Storage
TAO Design 2: Multiple Caching Tiers
Source: https://www.usenix.org/node/174510
![Page 12: for the Social Graph - · Center for Research in Intelligent Storage 1. Complex distributed write logic to handle to ensure data correctness, consistency 2.A lot of concurrent read](https://reader030.vdocument.in/reader030/viewer/2022041015/5ec6c151e7e47a60df11c4a6/html5/thumbnails/12.jpg)
Center for Research in Intelligent Storage
1. Complex distributed write
logic to handle to ensure
data correctness,
consistency
2. A lot of concurrent read
or write go to the
MySQL server, for the
same data
3. Make tradeoffs between
limiting the maximum
tier size and scaling the
cache
Source: https://www.usenix.org/node/174510
![Page 13: for the Social Graph - · Center for Research in Intelligent Storage 1. Complex distributed write logic to handle to ensure data correctness, consistency 2.A lot of concurrent read](https://reader030.vdocument.in/reader030/viewer/2022041015/5ec6c151e7e47a60df11c4a6/html5/thumbnails/13.jpg)
Center for Research in Intelligent Storage Source: https://www.usenix.org/node/174510
![Page 14: for the Social Graph - · Center for Research in Intelligent Storage 1. Complex distributed write logic to handle to ensure data correctness, consistency 2.A lot of concurrent read](https://reader030.vdocument.in/reader030/viewer/2022041015/5ec6c151e7e47a60df11c4a6/html5/thumbnails/14.jpg)
Center for Research in Intelligent Storage Source: https://www.usenix.org/node/174510
![Page 15: for the Social Graph - · Center for Research in Intelligent Storage 1. Complex distributed write logic to handle to ensure data correctness, consistency 2.A lot of concurrent read](https://reader030.vdocument.in/reader030/viewer/2022041015/5ec6c151e7e47a60df11c4a6/html5/thumbnails/15.jpg)
Center for Research in Intelligent Storage Source: https://www.usenix.org/node/174510
![Page 16: for the Social Graph - · Center for Research in Intelligent Storage 1. Complex distributed write logic to handle to ensure data correctness, consistency 2.A lot of concurrent read](https://reader030.vdocument.in/reader030/viewer/2022041015/5ec6c151e7e47a60df11c4a6/html5/thumbnails/16.jpg)
Center for Research in Intelligent Storage Source: https://www.usenix.org/node/174510
![Page 17: for the Social Graph - · Center for Research in Intelligent Storage 1. Complex distributed write logic to handle to ensure data correctness, consistency 2.A lot of concurrent read](https://reader030.vdocument.in/reader030/viewer/2022041015/5ec6c151e7e47a60df11c4a6/html5/thumbnails/17.jpg)
Center for Research in Intelligent Storage
Master regionSlave region 1
Slave region 2
Master DBSlave DB
Slave DB
How to Scale out
![Page 18: for the Social Graph - · Center for Research in Intelligent Storage 1. Complex distributed write logic to handle to ensure data correctness, consistency 2.A lot of concurrent read](https://reader030.vdocument.in/reader030/viewer/2022041015/5ec6c151e7e47a60df11c4a6/html5/thumbnails/18.jpg)
Center for Research in Intelligent Storage Source: https://www.usenix.org/node/174510
![Page 19: for the Social Graph - · Center for Research in Intelligent Storage 1. Complex distributed write logic to handle to ensure data correctness, consistency 2.A lot of concurrent read](https://reader030.vdocument.in/reader030/viewer/2022041015/5ec6c151e7e47a60df11c4a6/html5/thumbnails/19.jpg)
Center for Research in Intelligent Storage Source: https://www.usenix.org/node/174510
![Page 20: for the Social Graph - · Center for Research in Intelligent Storage 1. Complex distributed write logic to handle to ensure data correctness, consistency 2.A lot of concurrent read](https://reader030.vdocument.in/reader030/viewer/2022041015/5ec6c151e7e47a60df11c4a6/html5/thumbnails/20.jpg)
Center for Research in Intelligent Storage Source: https://www.usenix.org/node/174510
![Page 21: for the Social Graph - · Center for Research in Intelligent Storage 1. Complex distributed write logic to handle to ensure data correctness, consistency 2.A lot of concurrent read](https://reader030.vdocument.in/reader030/viewer/2022041015/5ec6c151e7e47a60df11c4a6/html5/thumbnails/21.jpg)
Center for Research in Intelligent Storage Source: https://www.usenix.org/node/174510
![Page 22: for the Social Graph - · Center for Research in Intelligent Storage 1. Complex distributed write logic to handle to ensure data correctness, consistency 2.A lot of concurrent read](https://reader030.vdocument.in/reader030/viewer/2022041015/5ec6c151e7e47a60df11c4a6/html5/thumbnails/22.jpg)
Center for Research in Intelligent Storage
MySQL Architecture
Source: https://techsoftcompute.wordpress.com/2013/08/19/mysql-internal-architecture/
![Page 23: for the Social Graph - · Center for Research in Intelligent Storage 1. Complex distributed write logic to handle to ensure data correctness, consistency 2.A lot of concurrent read](https://reader030.vdocument.in/reader030/viewer/2022041015/5ec6c151e7e47a60df11c4a6/html5/thumbnails/23.jpg)
Center for Research in Intelligent Storage Source: http://data-base-recovery.blogspot.com/2015/10/storage-engines-in-mysql.html
![Page 24: for the Social Graph - · Center for Research in Intelligent Storage 1. Complex distributed write logic to handle to ensure data correctness, consistency 2.A lot of concurrent read](https://reader030.vdocument.in/reader030/viewer/2022041015/5ec6c151e7e47a60df11c4a6/html5/thumbnails/24.jpg)
Center for Research in Intelligent Storage Source: https://www.slideshare.net/morgo/inno-db-presentation
![Page 25: for the Social Graph - · Center for Research in Intelligent Storage 1. Complex distributed write logic to handle to ensure data correctness, consistency 2.A lot of concurrent read](https://reader030.vdocument.in/reader030/viewer/2022041015/5ec6c151e7e47a60df11c4a6/html5/thumbnails/25.jpg)
Center for Research in Intelligent Storage
![Page 26: for the Social Graph - · Center for Research in Intelligent Storage 1. Complex distributed write logic to handle to ensure data correctness, consistency 2.A lot of concurrent read](https://reader030.vdocument.in/reader030/viewer/2022041015/5ec6c151e7e47a60df11c4a6/html5/thumbnails/26.jpg)
Center for Research in Intelligent Storage
RocksDB
MySQL Architecture in FaceBook
Source: https://techsoftcompute.wordpress.com/2013/08/19/mysql-internal-architecture/
![Page 27: for the Social Graph - · Center for Research in Intelligent Storage 1. Complex distributed write logic to handle to ensure data correctness, consistency 2.A lot of concurrent read](https://reader030.vdocument.in/reader030/viewer/2022041015/5ec6c151e7e47a60df11c4a6/html5/thumbnails/27.jpg)
Center for Research in Intelligent Storage Source: https://www.slideshare.net/meeeejin/rocksdb-detail
![Page 28: for the Social Graph - · Center for Research in Intelligent Storage 1. Complex distributed write logic to handle to ensure data correctness, consistency 2.A lot of concurrent read](https://reader030.vdocument.in/reader030/viewer/2022041015/5ec6c151e7e47a60df11c4a6/html5/thumbnails/28.jpg)
Center for Research in Intelligent Storage Source: https://www.usenix.org/node/174510
![Page 29: for the Social Graph - · Center for Research in Intelligent Storage 1. Complex distributed write logic to handle to ensure data correctness, consistency 2.A lot of concurrent read](https://reader030.vdocument.in/reader030/viewer/2022041015/5ec6c151e7e47a60df11c4a6/html5/thumbnails/29.jpg)
Center for Research in Intelligent Storage
Thanks
Q/A