the transport of data center flows (daq: deadline aware queue) · expediting the transport of data...

17
Expediting the transport of Data Center Flows (DAQ: DeadlineAware Queue) Roberto RojasCessa Networking Research Laboratory ECE Dept. New Jersey Institute of Technology Newark, NJ 07102 [email protected] Research Day CWCSPR

Upload: others

Post on 13-Aug-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: the transport of Data Center Flows (DAQ: Deadline Aware Queue) · Expediting the transport of Data Center Flows (DAQ: Deadline‐Aware Queue) Roberto Rojas‐Cessa Networking Research

Expediting the transport of Data Center Flows(DAQ: Deadline‐Aware Queue)

Roberto Rojas‐CessaNetworking Research Laboratory

ECE Dept.New Jersey Institute of Technology

Newark, NJ [email protected]

Research Day ‐ CWCSPR

Page 2: the transport of Data Center Flows (DAQ: Deadline Aware Queue) · Expediting the transport of Data Center Flows (DAQ: Deadline‐Aware Queue) Roberto Rojas‐Cessa Networking Research

Examples of DC topologies

Research Day ‐ CWCSPR

BCubeFat‐Tree

VL2 FiCoon

Page 3: the transport of Data Center Flows (DAQ: Deadline Aware Queue) · Expediting the transport of Data Center Flows (DAQ: Deadline‐Aware Queue) Roberto Rojas‐Cessa Networking Research

What is unique in Data Center Traffic?  Partition‐Aggregate Model

Research Day ‐ CWCSPR

Simplification of traffic load.

Page 4: the transport of Data Center Flows (DAQ: Deadline Aware Queue) · Expediting the transport of Data Center Flows (DAQ: Deadline‐Aware Queue) Roberto Rojas‐Cessa Networking Research

Data Aggregation

Research Day ‐ CWCSPR

Flows may be associated with response deadlines

Deadlines are inherited by partial processes

For all flows, short Flow Completion times (FCTs)are desirable 

For deadline‐sensitive flows, short Application Throughput is desirable. 

Data aggregation  Connection‐Oriented Transport  Transmission Control Protocol  (TCP)

Page 5: the transport of Data Center Flows (DAQ: Deadline Aware Queue) · Expediting the transport of Data Center Flows (DAQ: Deadline‐Aware Queue) Roberto Rojas‐Cessa Networking Research

Expected requirements of a Data Center (DC) Transport Protocol

Maximize the number of flows completing transmission before deadlinesGuarantee a high throughput for long flows.Allow high, if not 100%, link utilization.Achieve lossless transmissions.Minimize the amount of state information at switches

Research Day ‐ CWCSPR

Page 6: the transport of Data Center Flows (DAQ: Deadline Aware Queue) · Expediting the transport of Data Center Flows (DAQ: Deadline‐Aware Queue) Roberto Rojas‐Cessa Networking Research

Why TCP is not good enough?

Research Day ‐ CWCSPR

Incast Throughput Collapse:  Retransmission Time Offs + Retransmission choke  bandwidth

Flow control mechanisms are not transmission speed aware Long FCTs!

• Data Center Flows: Long + Short Flows

• Congestion• Multiple flows concur at aggregation switches

• Lack of a centralized scheduler

Page 7: the transport of Data Center Flows (DAQ: Deadline Aware Queue) · Expediting the transport of Data Center Flows (DAQ: Deadline‐Aware Queue) Roberto Rojas‐Cessa Networking Research

Existing Solutions

• Earlier Congestion Notification (ECN): DCTCP • Rate Control: D2TCP, D3, PDQ (deadline aware)

• Congestion Control: RCP• Pacing Schemes: HULL• Load Balancing Schemes: DeTail, CONGA, RepFlow

• Switch Modification: DAQ 

Research Day ‐ CWCSPR

Page 8: the transport of Data Center Flows (DAQ: Deadline Aware Queue) · Expediting the transport of Data Center Flows (DAQ: Deadline‐Aware Queue) Roberto Rojas‐Cessa Networking Research

Deadline‐Applicable Schemes

• RCP [Dukkipati05] assigns rate according to available bandwidth. Parameters must be tuned.

• DCTCP [Alizadeh10]:  ECN + congestion window modification. Agnostic to deadlines.

• D3 [Wilson11] reserves transmission rates FCFS. • PDQ [Hong12]: selects flows earliest deadline first (EDF) and the shortest job first (SJF). High complexity.

Research Day ‐ CWCSPR

Page 9: the transport of Data Center Flows (DAQ: Deadline Aware Queue) · Expediting the transport of Data Center Flows (DAQ: Deadline‐Aware Queue) Roberto Rojas‐Cessa Networking Research

Proposed Scheme: Deadline Aware Queue (DAQ) at DC Switches

• Objectives:– Maximize application throughput – Ensure minimum bandwidth for long flows– Minimize flow‐state information at switches– Minimize modification to layered protocols

Research Day ‐ CWCSPR

Page 10: the transport of Data Center Flows (DAQ: Deadline Aware Queue) · Expediting the transport of Data Center Flows (DAQ: Deadline‐Aware Queue) Roberto Rojas‐Cessa Networking Research

Switch Architecture

Research Day ‐ CWCSPR

Use Three Queues: Urgent, Non‐urgent, LongShort flows: Urgent  or Non‐urgentLong flows: long‐flow queue + service weighted scheduling

Page 11: the transport of Data Center Flows (DAQ: Deadline Aware Queue) · Expediting the transport of Data Center Flows (DAQ: Deadline‐Aware Queue) Roberto Rojas‐Cessa Networking Research

Test setup

• Loss‐less flow control between– Senders and switch– Switch and receiver (aggregator)

• Large congestion window size instead of slow start

Research Day ‐ CWCSPR

Comparison: RCP and D3

Page 12: the transport of Data Center Flows (DAQ: Deadline Aware Queue) · Expediting the transport of Data Center Flows (DAQ: Deadline‐Aware Queue) Roberto Rojas‐Cessa Networking Research

Impact of Urgent Threshold Value

Flow size: 30KB, rate: 3600 flows/s Number of long flows: 5

Research Day ‐ CWCSPR

Application throughput: No. on‐time flows/All arrived flows

Page 13: the transport of Data Center Flows (DAQ: Deadline Aware Queue) · Expediting the transport of Data Center Flows (DAQ: Deadline‐Aware Queue) Roberto Rojas‐Cessa Networking Research

Supported number of senders

Number of concurrent senders for achieving 99% application throughput with flow size mean of 10 Kbytes and deadlines  [20, 40] ms.

Research Day ‐ CWCSPR

Page 14: the transport of Data Center Flows (DAQ: Deadline Aware Queue) · Expediting the transport of Data Center Flows (DAQ: Deadline‐Aware Queue) Roberto Rojas‐Cessa Networking Research

Application and Average Throughput

Short flow size: 15 Kbyte, long flow size: 100Mbyte (2). Short flow load: 0.3 %No. of senders: [5, 40]

Short flows Long flows

Research Day ‐ CWCSPR

Page 15: the transport of Data Center Flows (DAQ: Deadline Aware Queue) · Expediting the transport of Data Center Flows (DAQ: Deadline‐Aware Queue) Roberto Rojas‐Cessa Networking Research

Performance under short and long flows

Short flow size: 15KBResearch Day ‐ CWCSPR

Page 16: the transport of Data Center Flows (DAQ: Deadline Aware Queue) · Expediting the transport of Data Center Flows (DAQ: Deadline‐Aware Queue) Roberto Rojas‐Cessa Networking Research

Conclusions

• Deadline‐oriented approach with small modification to transport layer.

• Urgent flows receive preferential service. • Few urgent flows speedup transmission. • DAQ achieves high Application Throughput• Long flows receive minimum throughput through Weighted Round‐Robin

Research Day ‐ CWCSPR

Page 17: the transport of Data Center Flows (DAQ: Deadline Aware Queue) · Expediting the transport of Data Center Flows (DAQ: Deadline‐Aware Queue) Roberto Rojas‐Cessa Networking Research

Thank you

[email protected]

Research Day ‐ CWCSPR