lessons in moving from physical hosts to mesos

16
Lessons in moving from physical hosts to Mesos Raj Shekhar, Senior Site Reliability Engineer @ilunate ch

Upload: raj-shekhar

Post on 13-Apr-2017

195 views

Category:

Engineering


2 download

TRANSCRIPT

Page 1: Lessons in moving from physical hosts to mesos

Lessons in moving from physical hosts to Mesos

Raj Shekhar, Senior Site Reliability Engineer

@ilunatech

Page 2: Lessons in moving from physical hosts to mesos

Mesos

WHATWHYHOW

NOW WHAT

Page 3: Lessons in moving from physical hosts to mesos

How most Ops teams run clusters today

Page 4: Lessons in moving from physical hosts to mesos

Static partitioning has problemsUnequal load distribution on machinesSlower to add capacityNot fault tolerant

Page 5: Lessons in moving from physical hosts to mesos

Is there a better way?Do we want machines or do we want resources?

Page 6: Lessons in moving from physical hosts to mesos

MesosResource manager - the datacenter is one big poolCan run multi-tenant workloadsFailure detectionServices are isolated from one another

Page 7: Lessons in moving from physical hosts to mesos

Why Mesos - Better resource utilizationRun multi-tenant workload on machines

Dynamic partitioning - no dedicated machines for tasks

Less resource hungry than virtual machines

Page 8: Lessons in moving from physical hosts to mesos

Why Mesos - all the other good things

Fault tolerant - automatically restart failed jobs

Elasticity - grow and shrink on demand

Faster deploys

Page 9: Lessons in moving from physical hosts to mesos

T.co - URL shortening

http://example.com/example http://t.co/examp

Page 10: Lessons in moving from physical hosts to mesos

How

Package Deploy Test Go Live!

Page 11: Lessons in moving from physical hosts to mesos

Life after Go LiveLowered operating expenseFewer routine operational tasksFaster deploys

Page 12: Lessons in moving from physical hosts to mesos

Job throttling

Sudden spikes in latencies

What we learned

cgroups and cpu quotas

Page 13: Lessons in moving from physical hosts to mesos

Capacity planning

Max traffic of the cluster was lower than our expectationWhat we learned

Different CPU variants have different throughput

Page 14: Lessons in moving from physical hosts to mesos

Rethink service discovery

Services get hosts and ports assigned dynamically

What we learned

Use static proxies to forward connections

Page 15: Lessons in moving from physical hosts to mesos

No perfect isolation

Sudden spike in latency

What we learned

Async ops where possible, noisy neighbours still affect us

Page 16: Lessons in moving from physical hosts to mesos

Questions?

[email protected]

@ilunatech