Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4....

87
1 Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput and Heavy-traffic Delay Optimality Xingyu Zhou

Transcript of Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4....

Page 1: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

1

Flexible Load Balancing with Multi-dimensionalState-space Collapse: Throughput and

Heavy-traffic Delay Optimality

Xingyu Zhou

Page 2: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

2

Joint work with...

Jian Tan, OSU Ness Shroff, OSU

Page 3: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

3

I Discrete-time system, i.e, time-slotted.

I Arrival rate at each time slot is λΣ, arbitrary distribution .

I Service rate at each server k is µk , arbitrary distribution.

I Arrival and service are independent.

Page 4: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

3

I Discrete-time system, i.e, time-slotted.

I Arrival rate at each time slot is λΣ, arbitrary distribution .

I Service rate at each server k is µk , arbitrary distribution.

I Arrival and service are independent.

Page 5: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

3

I Discrete-time system, i.e, time-slotted.

I Arrival rate at each time slot is λΣ, arbitrary distribution 1 .

I Service rate at each server k is µk , arbitrary distribution.

I Arrival and service are independent.

1with exponential decay tail

Page 6: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

3

I Discrete-time system, i.e, time-slotted.

I Arrival rate at each time slot is λΣ, arbitrary distribution 1 .

I Service rate at each server k is µk , arbitrary distribution.

I Arrival and service are independent.

1with exponential decay tail

Page 7: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

3

I Discrete-time system, i.e, time-slotted.

I Arrival rate at each time slot is λΣ, arbitrary distribution 1 .

I Service rate at each server k is µk , arbitrary distribution.

I Arrival and service are independent.

1with exponential decay tail

Page 8: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

4

The goal of load balancing:

choose the right server(s) for each request.

What does right mean?

Page 9: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

4

The goal of load balancing:

choose the right server(s) for each request.

What does right mean?

Page 10: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

5

Throughput Optimality

DefinitionIt can stabilize the system for any arrival rate in capacity region, i.e, forany ε > 0 where ε =

∑µn − λΣ.

Page 11: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

6

Heavy-traffic Delay Optimality

DefinitionIt can achieve the lower bound on delay when ε→ 0, that is,limε↓0 εE [

∑Qn] = limε↓0 εE [q] (since the queue length is order O(1/ε))

Fact: E [∑

Qn] ≥ E [q], since packet remains in the queue until finished.

Page 12: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

6

Heavy-traffic Delay Optimality

DefinitionIt can achieve the lower bound on delay when ε→ 0, that is,limε↓0 εE [

∑Qn] = limε↓0 εE [q] (since the queue length is order O(1/ε))

Fact: E [∑

Qn] ≥ E [q], since packet remains in the queue until finished.

Page 13: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

7

Some ‘optimal’ policies...

I Join-Shortest-Queue (JSQ): Sample all the queue lengths, jointhe shortest one. [Foschini and Salz’78], [Eryilmaz and Srikant’12]

I Power-of-d choices (Pod): Randomly sample d queues, join theshortest one. [Chen and Ye’12], [Maguluri, et al’14]

I A general class of optimal policies: Any policy that statisticallyprefers shorter queues is heavy-traffic optimal. [Zhou, et al’18]

All of them share one thing in common: state-space collapse to the line.

All the queue lengths are nearly equal in heavy traffic.

Page 14: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

7

Some ‘optimal’ policies...I Join-Shortest-Queue (JSQ): Sample all the queue lengths, join

the shortest one. [Foschini and Salz’78], [Eryilmaz and Srikant’12]

I Power-of-d choices (Pod): Randomly sample d queues, join theshortest one. [Chen and Ye’12], [Maguluri, et al’14]

I A general class of optimal policies: Any policy that statisticallyprefers shorter queues is heavy-traffic optimal. [Zhou, et al’18]

All of them share one thing in common: state-space collapse to the line.

All the queue lengths are nearly equal in heavy traffic.

Page 15: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

7

Some ‘optimal’ policies...I Join-Shortest-Queue (JSQ): Sample all the queue lengths, join

the shortest one. [Foschini and Salz’78], [Eryilmaz and Srikant’12]

I Power-of-d choices (Pod): Randomly sample d queues, join theshortest one. [Chen and Ye’12], [Maguluri, et al’14]

I A general class of optimal policies: Any policy that statisticallyprefers shorter queues is heavy-traffic optimal. [Zhou, et al’18]

All of them share one thing in common: state-space collapse to the line.

All the queue lengths are nearly equal in heavy traffic.

Page 16: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

7

Some ‘optimal’ policies...I Join-Shortest-Queue (JSQ): Sample all the queue lengths, join

the shortest one. [Foschini and Salz’78], [Eryilmaz and Srikant’12]

I Power-of-d choices (Pod): Randomly sample d queues, join theshortest one. [Chen and Ye’12], [Maguluri, et al’14]

I A general class of optimal policies: Any policy that statisticallyprefers shorter queues is heavy-traffic optimal. [Zhou, et al’18]

All of them share one thing in common: state-space collapse to the line.

All the queue lengths are nearly equal in heavy traffic.

Page 17: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

7

Some ‘optimal’ policies...I Join-Shortest-Queue (JSQ): Sample all the queue lengths, join

the shortest one. [Foschini and Salz’78], [Eryilmaz and Srikant’12]

I Power-of-d choices (Pod): Randomly sample d queues, join theshortest one. [Chen and Ye’12], [Maguluri, et al’14]

I A general class of optimal policies: Any policy that statisticallyprefers shorter queues is heavy-traffic optimal. [Zhou, et al’18]

All of them share one thing in common: state-space collapse to the line.

All the queue lengths are nearly equal in heavy traffic.

Page 18: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

8

Warm-up...

Is it possible to achieve delay optimality in heavy traffic with thefollowing state-space collapse?

(A). Yes (B). No

The answer is Yes!

Page 19: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

8

Warm-up...

Is it possible to achieve delay optimality in heavy traffic with thefollowing state-space collapse?

(A). Yes (B). No

The answer is Yes!

Page 20: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

9

Part I: From single to multi-dimension state-space collapse.

Page 21: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

10

Multi-dimensional cone...

I Consider the following finitely generated cone:

Kα =

x ∈ RN : x =

∑n∈N

wnb(n),wn ≥ 0 for all n ∈ N, (1)

where b(n) is an N-dimensional vector with the nth component being1 and α everywhere, α ∈ [0, 1].

Page 22: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

10

Multi-dimensional cone...

I Consider the following finitely generated cone:

Kα =

x ∈ RN : x =

∑n∈N

wnb(n),wn ≥ 0 for all n ∈ N, (1)

where b(n) is an N-dimensional vector with the nth component being1 and α everywhere, α ∈ [0, 1].

I Example: b(1) = (1, 0.5) and b(2) = (0.5, 1)

Q1 = Q2

Q1

Q2

Page 23: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

10

Multi-dimensional cone...

I Consider the following finitely generated cone:

Kα =

x ∈ RN : x =

∑n∈N

wnb(n),wn ≥ 0 for all n ∈ N, (1)

where b(n) is an N-dimensional vector with the nth component being1 and α everywhere, α ∈ [0, 1].

I Example: b(1) = (1, 0.1) and b(2) = (0.1, 1)

Q1 = Q2

Q1

Q2

Page 24: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

10

Multi-dimensional cone...I Consider the following finitely generated cone:

Kα =

x ∈ RN : x =

∑n∈N

wnb(n),wn ≥ 0 for all n ∈ N, (1)

where b(n) is an N-dimensional vector with the nth component being1 and α everywhere, α ∈ [0, 1].

I Example: b(1) = (1, 0.1) and b(2) = (0.1, 1)

Q1 = Q2

Q1

Q2

Smaller α, bigger cone.

Page 25: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

11

State-space collapse to the cone...

I We can decompose the queue length vector as follows.

Q = Q‖ + Q⊥,

as shown in

Q1

Q2

Q

K↵

Q?

Qk

Q1

Q2

Q

K↵

Q?

Qk

Page 26: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

12

State-space collapse to the cone...DefinitionLet Q be the steady-state, we say state-space collapses to Kα if

E[∥∥∥Q

(ε)

∥∥∥r] ≤ Mr (2)

for all ε ∈ (0, ε0), ε0 > 0 and for each r = 1, 2, · · · , Mr are constants thatare independent of ε. (recall that ε is the heavy-traffic parameter.)

Q1

Q2

Q

K↵

Q?

Qk

Q1

Q2

Q

K↵

Q?

Qk

Page 27: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

13

Main Result...

Theorem (Stability + Collapse to cone =⇒ Optimality)

Given a throughput optimal load balancing policy, if there exists anα ∈ (0, 1] such that the state-space collapses to the cone Kα

,

then thispolicy is heavy-traffic delay optimal in steady-state.

Page 28: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

13

Main Result...

Theorem (Stability + Collapse to cone =⇒ Optimality)Given a throughput optimal load balancing policy,

if there exists anα ∈ (0, 1] such that the state-space collapses to the cone Kα

,

then thispolicy is heavy-traffic delay optimal in steady-state.

Page 29: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

13

Main Result...

Theorem (Stability + Collapse to cone =⇒ Optimality)Given a throughput optimal load balancing policy, if there exists anα ∈ (0, 1] such that the state-space collapses to the cone Kα,

then thispolicy is heavy-traffic delay optimal in steady-state.

Page 30: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

13

Main Result...

Theorem (Stability + Collapse to cone =⇒ Optimality)Given a throughput optimal load balancing policy, if there exists anα ∈ (0, 1] such that the state-space collapses to the cone Kα, then thispolicy is heavy-traffic delay optimal in steady-state.

Page 31: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

14

Main Result...

Theorem (Stability + Collapse to cone =⇒ Optimality)

Key implications:

I If α = 1, the cone Kα reduces to previous single dimensional line.

I Delay optimality in heavy traffic does not require queue lengthsbeing equal.

I The actual state-space collapse region R could even be non-convex.

Q1

Q2

K↵

Q1

Q2

Q

K↵

Q?

Qk

R

Page 32: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

14

Main Result...

Theorem (Stability + Collapse to cone =⇒ Optimality)

Key implications:

I If α = 1, the cone Kα reduces to previous single dimensional line.

I Delay optimality in heavy traffic does not require queue lengthsbeing equal.

I The actual state-space collapse region R could even be non-convex.

Q1

Q2

K↵

Q1

Q2

Q

K↵

Q?

Qk

R

Page 33: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

14

Main Result...

Theorem (Stability + Collapse to cone =⇒ Optimality)

Key implications:

I If α = 1, the cone Kα reduces to previous single dimensional line.

I Delay optimality in heavy traffic does not require queue lengthsbeing equal.

I The actual state-space collapse region R could even be non-convex.

Q1

Q2

K↵

Q1

Q2

Q

K↵

Q?

Qk

R

Page 34: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

14

Main Result...

Theorem (Stability + Collapse to cone =⇒ Optimality)

Key implications:

I If α = 1, the cone Kα reduces to previous single dimensional line.

I Delay optimality in heavy traffic does not require queue lengthsbeing equal.

I The actual state-space collapse region R could even be non-convex.

Q1

Q2

K↵

Q1

Q2

Q

K↵

Q?

Qk

R

Page 35: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

15

Umm...it seems a little counter-intuitive, any intuitions?

Page 36: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

16

The ‘King’ equation...

The sufficient and necessary condition for HT-optimality:

limε↓0

E[∥∥Q

(ε)(t + 1)

∥∥1

∥∥U(ε)

(t)∥∥

1

]= 0.

where the unused service vector U(t) = maxS(t)−Q(t)− A(t), 0.

I Note that Qn(t + 1)Un(t) = 0 for all n and t.

IMPLICATIONS: No server is idle while others with high loads.

Page 37: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

16

The ‘King’ equation...

The sufficient and necessary condition for HT-optimality:

limε↓0

E[∥∥Q

(ε)(t + 1)

∥∥1

∥∥U(ε)

(t)∥∥

1

]= 0.

where the unused service vector U(t) = maxS(t)−Q(t)− A(t), 0.I Note that Qn(t + 1)Un(t) = 0 for all n and t.

IMPLICATIONS: No server is idle while others with high loads.

Page 38: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

16

The ‘King’ equation...

The sufficient and necessary condition for HT-optimality:

limε↓0

E[∥∥Q

(ε)(t + 1)

∥∥1

∥∥U(ε)

(t)∥∥

1

]= 0.

where the unused service vector U(t) = maxS(t)−Q(t)− A(t), 0.I Note that Qn(t + 1)Un(t) = 0 for all n and t.

IMPLICATIONS: No server is idle while others with high loads.

Page 39: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

16

The ‘King’ equation...

The sufficient and necessary condition for HT-optimality:

limε↓0

E[∥∥Q

(ε)(t + 1)

∥∥1

∥∥U(ε)

(t)∥∥

1

]= 0.

where the unused service vector U(t) = maxS(t)−Q(t)− A(t), 0.I Note that Qn(t + 1)Un(t) = 0 for all n and t.

IMPLICATIONS: No server is idle while others with high loads.

“Probability theory is nothing but common sense reduced to calculation.”

— Pierre Laplace

Page 40: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

16

The ‘King’ equation...

The sufficient and necessary condition for HT-optimality:

limε↓0

E[∥∥Q

(ε)(t + 1)

∥∥1

∥∥U(ε)

(t)∥∥

1

]= 0.

where the unused service vector U(t) = maxS(t)−Q(t)− A(t), 0.I Note that Qn(t + 1)Un(t) = 0 for all n and t.

IMPLICATIONS: No server is idle while others with high loads.

Q1 = Q2

Q1

Q2Q1 = Q2

Q1

Q2

(1/) (1/)

(1/)

(1/)

Page 41: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

17

The problem with ‘ice-cream’ cone...Consider the following cone given by

Kθ =

x ∈ RN :‖x(1)

‖ ‖‖x‖ ≥ cos(θ)

,

where x(1)‖ is the projection of x onto the line 1 = (1, 1, . . . , 1).

Page 42: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

18

The problem with ‘ice-cream’ cone...

Consider the following cone given by

Kθ =

x ∈ RN :‖x(1)

‖ ‖‖x‖ ≥ cos(θ)

,

where x(1)‖ is the projection of x onto the line 1 = (1, 1, . . . , 1).

Requirements: avoid one queue is empty while others are not.

I To exclude points on axes, e.g., (1,0,0), θ < arccos(1/√

3).

I To exclude points such as (1,1,0), θ < arccos(√

2/√

3).

I In general, θ < arccos(√N − 1/

√N), which reduces to

1 = (1, 1, . . . , 1) for large N.

Page 43: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

18

The problem with ‘ice-cream’ cone...

Consider the following cone given by

Kθ =

x ∈ RN :‖x(1)

‖ ‖‖x‖ ≥ cos(θ)

,

where x(1)‖ is the projection of x onto the line 1 = (1, 1, . . . , 1).

Requirements: avoid one queue is empty while others are not.

I To exclude points on axes, e.g., (1,0,0), θ < arccos(1/√

3).

I To exclude points such as (1,1,0), θ < arccos(√

2/√

3).

I In general, θ < arccos(√N − 1/

√N), which reduces to

1 = (1, 1, . . . , 1) for large N.

Page 44: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

18

The problem with ‘ice-cream’ cone...

Consider the following cone given by

Kθ =

x ∈ RN :‖x(1)

‖ ‖‖x‖ ≥ cos(θ)

,

where x(1)‖ is the projection of x onto the line 1 = (1, 1, . . . , 1).

Requirements: avoid one queue is empty while others are not.

I To exclude points on axes, e.g., (1,0,0), θ < arccos(1/√

3).

I To exclude points such as (1,1,0), θ < arccos(√

2/√

3).

I In general, θ < arccos(√N − 1/

√N), which reduces to

1 = (1, 1, . . . , 1) for large N.

Page 45: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

18

The problem with ‘ice-cream’ cone...

Consider the following cone given by

Kθ =

x ∈ RN :‖x(1)

‖ ‖‖x‖ ≥ cos(θ)

,

where x(1)‖ is the projection of x onto the line 1 = (1, 1, . . . , 1).

Requirements: avoid one queue is empty while others are not.

I To exclude points on axes, e.g., (1,0,0), θ < arccos(1/√

3).

I To exclude points such as (1,1,0), θ < arccos(√

2/√

3).

I In general, θ < arccos(√N − 1/

√N), which reduces to

1 = (1, 1, . . . , 1) for large N.

Page 46: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

19

Umm...wait, how can we achieve this type of collapse?

Page 47: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

20

Part II: Flexible load balancing

Page 48: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

21

A general view...

The nth component of dispatching distribution P(t) is the probabilityof dispatching arrival to the nth shortest queue.

I let σt(·) be the permutation of queues in increasing order.

I Pn(t) is then the probability for dispatching to the server σt(n).

We also define dispatching preference

∆(t) , P(t)− Prand(t)

where Prand(t) is the dispatching distribution under random routing.

I homogeneous servers: the nth component of Prand(t) is 1/N.

I heterogeneous servers: the nth component of Prand(t) is µσt(n)/µΣ.

Page 49: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

21

A general view...

The nth component of dispatching distribution P(t) is the probabilityof dispatching arrival to the nth shortest queue.

I let σt(·) be the permutation of queues in increasing order.

I Pn(t) is then the probability for dispatching to the server σt(n).

We also define dispatching preference

∆(t) , P(t)− Prand(t)

where Prand(t) is the dispatching distribution under random routing.

I homogeneous servers: the nth component of Prand(t) is 1/N.

I heterogeneous servers: the nth component of Prand(t) is µσt(n)/µΣ.

Page 50: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

21

A general view...

The nth component of dispatching distribution P(t) is the probabilityof dispatching arrival to the nth shortest queue.

I let σt(·) be the permutation of queues in increasing order.

I Pn(t) is then the probability for dispatching to the server σt(n).

We also define dispatching preference

∆(t) , P(t)− Prand(t)

where Prand(t) is the dispatching distribution under random routing.

I homogeneous servers: the nth component of Prand(t) is 1/N.

I heterogeneous servers: the nth component of Prand(t) is µσt(n)/µΣ.

Page 51: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

22

Examples...

Consider a system with 4 homogeneous servers.

I Random: randomly joins oneI Prand(t) = (1/4, 1/4, 1/4, 1/4)I ∆rand(t) = (0, 0, 0, 0)

I JSQ: always join the shortest oneI PJSQ(t) = (1, 0, 0, 0)I ∆JSQ(t) = (3/4,−1/4,−1/4,−1/4)

I Power of 2: randomly picks two and joins the shorter oneI PPo2(t) = (1/2, 1/3, 1/6, 0)I ∆Po2(t) = (1/4, 1/12,−1/12,−1/4)

n

Pn(t)

1 2 3 4

1/4

JSQ

n

Pn(t)

1 2 3 4

1/4

Po2

n

Pn(t)

1 2 3 4

1/4

Page 52: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

22

Examples...

Consider a system with 4 homogeneous servers.

I Random: randomly joins oneI Prand(t) = (1/4, 1/4, 1/4, 1/4)I ∆rand(t) = (0, 0, 0, 0)

I JSQ: always join the shortest oneI PJSQ(t) = (1, 0, 0, 0)I ∆JSQ(t) = (3/4,−1/4,−1/4,−1/4)

I Power of 2: randomly picks two and joins the shorter oneI PPo2(t) = (1/2, 1/3, 1/6, 0)I ∆Po2(t) = (1/4, 1/12,−1/12,−1/4)

n

Pn(t)

1 2 3 4

1/4

Rand

n

Pn(t)

1 2 3 4

1/4

JSQ

n

Pn(t)

1 2 3 4

1/4

Po2

n

Pn(t)

1 2 3 4

1/4

Page 53: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

22

Examples...

Consider a system with 4 homogeneous servers.

I Random: randomly joins oneI Prand(t) = (1/4, 1/4, 1/4, 1/4)I ∆rand(t) = (0, 0, 0, 0)

I JSQ: always join the shortest oneI PJSQ(t) = (1, 0, 0, 0)I ∆JSQ(t) = (3/4,−1/4,−1/4,−1/4)

I Power of 2: randomly picks two and joins the shorter oneI PPo2(t) = (1/2, 1/3, 1/6, 0)I ∆Po2(t) = (1/4, 1/12,−1/12,−1/4)

n

Pn(t)

1 2 3 4

1/4

Rand

n

Pn(t)

1 2 3 4

1/4

JSQ

n

Pn(t)

1 2 3 4

1/4

Po2

n

Pn(t)

1 2 3 4

1/4

Page 54: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

22

Examples...

Consider a system with 4 homogeneous servers.

I Random: randomly joins oneI Prand(t) = (1/4, 1/4, 1/4, 1/4)I ∆rand(t) = (0, 0, 0, 0)

I JSQ: always join the shortest oneI PJSQ(t) = (1, 0, 0, 0)I ∆JSQ(t) = (3/4,−1/4,−1/4,−1/4)

I Power of 2: randomly picks two and joins the shorter oneI PPo2(t) = (1/2, 1/3, 1/6, 0)I ∆Po2(t) = (1/4, 1/12,−1/12,−1/4)

n

Pn(t)

1 2 3 4

1/4

Rand

n

Pn(t)

1 2 3 4

1/4

JSQ

n

Pn(t)

1 2 3 4

1/4

Po2

n

Pn(t)

1 2 3 4

1/4

Page 55: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

22

Examples...

Consider a system with 4 homogeneous servers.

I Random: randomly joins oneI Prand(t) = (1/4, 1/4, 1/4, 1/4)I ∆rand(t) = (0, 0, 0, 0)

I JSQ: always join the shortest oneI PJSQ(t) = (1, 0, 0, 0)I ∆JSQ(t) = (3/4,−1/4,−1/4,−1/4)

I Power of 2: randomly picks two and joins the shorter oneI PPo2(t) = (1/2, 1/3, 1/6, 0)I ∆Po2(t) = (1/4, 1/12,−1/12,−1/4)

n

Pn(t)

1 2 3 4

1/4

Rand

n

Pn(t)

1 2 3 4

1/4

JSQ

n

Pn(t)

1 2 3 4

1/4

Po2

n

Pn(t)

1 2 3 4

1/4

Page 56: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

22

Examples...

Consider a system with 4 homogeneous servers.

I Random: randomly joins oneI Prand(t) = (1/4, 1/4, 1/4, 1/4)I ∆rand(t) = (0, 0, 0, 0)

I JSQ: always join the shortest oneI PJSQ(t) = (1, 0, 0, 0)I ∆JSQ(t) = (3/4,−1/4,−1/4,−1/4)

I Power of 2: randomly picks two and joins the shorter oneI PPo2(t) = (1/2, 1/3, 1/6, 0)I ∆Po2(t) = (1/4, 1/12,−1/12,−1/4)

n

Pn(t)

1 2 3 4

1/4

Rand

n

Pn(t)

1 2 3 4

1/4

JSQ

n

Pn(t)

1 2 3 4

1/4

Po2

n

Pn(t)

1 2 3 4

1/4

Page 57: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

23

Preference of shorter queues...

∆(t) , P(t)− Prand(t)

DefinitionA P(t) is δ-tilted if, for some 2 ≤ k ≤ N

I ∆n(t) ≥ 0 for all n < k and ∆n(t) ≤ 0 for all n ≥ k

I ∆1(t) ≥ δ, ∆N(t) ≤ −δ

n

Pn(t)

1 2 3 4

1/4

Rand

n

Pn(t)

1 2 3 4

1/4

JSQ

n

Pn(t)

1 2 3 4

1/4

Po2

n

Pn(t)

1 2 3 4

1/4

Page 58: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

23

Preference of shorter queues...

∆(t) , P(t)− Prand(t)

DefinitionA P(t) is δ-tilted if, for some 2 ≤ k ≤ N

I ∆n(t) ≥ 0 for all n < k and ∆n(t) ≤ 0 for all n ≥ k

I ∆1(t) ≥ δ, ∆N(t) ≤ −δ

n

Pn(t)

1 2 3 4

1/4

Rand

n

Pn(t)

1 2 3 4

1/4

JSQ

n

Pn(t)

1 2 3 4

1/4

Po2

n

Pn(t)

1 2 3 4

1/4

Page 59: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

23

Preference of shorter queues...

∆(t) , P(t)− Prand(t)

DefinitionA P(t) is δ-tilted if, for some 2 ≤ k ≤ N

I ∆n(t) ≥ 0 for all n < k and ∆n(t) ≤ 0 for all n ≥ k

I ∆1(t) ≥ δ, ∆N(t) ≤ −δ

n

Pn(t)

1 2 3 4

1/4

Rand

n

Pn(t)

1 2 3 4

1/4

JSQ

n

Pn(t)

1 2 3 4

1/4

Po2

n

Pn(t)

1 2 3 4

1/4

Page 60: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

24

But, where is the cone?

Page 61: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

25

Main Result...

Theorem (δ-tilted outside the cone =⇒ Optimality)

Given a load balancing policy, if there exists a cone Kα with α ∈ (0, 1]such that dispatching distribution is δ-tilted for any Q(t) /∈ Kα, then thispolicy is heavy-traffic delay optimal in steady-state.

Page 62: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

25

Main Result...

Theorem (δ-tilted outside the cone =⇒ Optimality)Given a load balancing policy,

if there exists a cone Kα with α ∈ (0, 1]such that dispatching distribution is δ-tilted for any Q(t) /∈ Kα, then thispolicy is heavy-traffic delay optimal in steady-state.

Page 63: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

25

Main Result...

Theorem (δ-tilted outside the cone =⇒ Optimality)Given a load balancing policy, if there exists a cone Kα with α ∈ (0, 1]

such that dispatching distribution is δ-tilted for any Q(t) /∈ Kα, then thispolicy is heavy-traffic delay optimal in steady-state.

Page 64: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

25

Main Result...

Theorem (δ-tilted outside the cone =⇒ Optimality)Given a load balancing policy, if there exists a cone Kα with α ∈ (0, 1]such that dispatching distribution is δ-tilted for any Q(t) /∈ Kα,

then thispolicy is heavy-traffic delay optimal in steady-state.

Page 65: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

25

Main Result...

Theorem (δ-tilted outside the cone =⇒ Optimality)Given a load balancing policy, if there exists a cone Kα with α ∈ (0, 1]such that dispatching distribution is δ-tilted for any Q(t) /∈ Kα, then thispolicy is heavy-traffic delay optimal in steady-state.

Page 66: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

26

Main Result...

Theorem (δ-tilted outside the cone =⇒ Optimality)

Flexibility from two aspects:

1. When Q(t) ∈ Kα, arbitrary dispatching is allowed.

2. Preference of shorter queue is not necessarily decreasing.

Applications:

I Load balancing with constraints of data locality.

I Load balancing with inaccurate queue lengths information.

I Load balancing with cache replacement cost.

I ......

Page 67: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

26

Main Result...

Theorem (δ-tilted outside the cone =⇒ Optimality)

Flexibility from two aspects:

1. When Q(t) ∈ Kα, arbitrary dispatching is allowed.

2. Preference of shorter queue is not necessarily decreasing.

Applications:

I Load balancing with constraints of data locality.

I Load balancing with inaccurate queue lengths information.

I Load balancing with cache replacement cost.

I ......

Page 68: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

26

Main Result...

Theorem (δ-tilted outside the cone =⇒ Optimality)

Flexibility from two aspects:

1. When Q(t) ∈ Kα, arbitrary dispatching is allowed.

2. Preference of shorter queue is not necessarily decreasing.

Applications:

I Load balancing with constraints of data locality.

I Load balancing with inaccurate queue lengths information.

I Load balancing with cache replacement cost.

I ......

Page 69: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

26

Main Result...

Theorem (δ-tilted outside the cone =⇒ Optimality)

Flexibility from two aspects:

1. When Q(t) ∈ Kα, arbitrary dispatching is allowed.

2. Preference of shorter queue is not necessarily decreasing.

Applications:

I Load balancing with constraints of data locality.

I Load balancing with inaccurate queue lengths information.

I Load balancing with cache replacement cost.

I ......

Page 70: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

27

The challenge to prove it...

Recall that:

Theorem (Stability + Collapse to cone =⇒ Optimality)

(a) Stability with bounded moments.

I standard Foster’s theorem is difficult, positive drift in cone.

I can solve it by combining fluid model with drift analysis.

(b) State-space collapses to the cone Kα.

I standard drift-based technique fails in our case.I since a closed-form formula of the projection onto a polyhedral cone

is still an open problem.

I instead, we found that a monotone property of the projection isenough.

Page 71: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

27

The challenge to prove it...

Recall that:

Theorem (Stability + Collapse to cone =⇒ Optimality)

(a) Stability with bounded moments.

I standard Foster’s theorem is difficult, positive drift in cone.

I can solve it by combining fluid model with drift analysis.

(b) State-space collapses to the cone Kα.

I standard drift-based technique fails in our case.I since a closed-form formula of the projection onto a polyhedral cone

is still an open problem.

I instead, we found that a monotone property of the projection isenough.

Page 72: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

27

The challenge to prove it...

Recall that:

Theorem (Stability + Collapse to cone =⇒ Optimality)

(a) Stability with bounded moments.

I standard Foster’s theorem is difficult, positive drift in cone.

I can solve it by combining fluid model with drift analysis.

(b) State-space collapses to the cone Kα.

I standard drift-based technique fails in our case.I since a closed-form formula of the projection onto a polyhedral cone

is still an open problem.

I instead, we found that a monotone property of the projection isenough.

Page 73: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

27

The challenge to prove it...

Recall that:

Theorem (Stability + Collapse to cone =⇒ Optimality)

(a) Stability with bounded moments.

I standard Foster’s theorem is difficult, positive drift in cone.

I can solve it by combining fluid model with drift analysis.

(b) State-space collapses to the cone Kα.

I standard drift-based technique fails in our case.I since a closed-form formula of the projection onto a polyhedral cone

is still an open problem.

I instead, we found that a monotone property of the projection isenough.

Page 74: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

27

The challenge to prove it...

Recall that:

Theorem (Stability + Collapse to cone =⇒ Optimality)

(a) Stability with bounded moments.

I standard Foster’s theorem is difficult, positive drift in cone.

I can solve it by combining fluid model with drift analysis.

(b) State-space collapses to the cone Kα.

I standard drift-based technique fails in our case.I since a closed-form formula of the projection onto a polyhedral cone

is still an open problem.

I instead, we found that a monotone property of the projection isenough.

Page 75: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

27

The challenge to prove it...

Recall that:

Theorem (Stability + Collapse to cone =⇒ Optimality)

(a) Stability with bounded moments.

I standard Foster’s theorem is difficult, positive drift in cone.

I can solve it by combining fluid model with drift analysis.

(b) State-space collapses to the cone Kα.

I standard drift-based technique fails in our case.I since a closed-form formula of the projection onto a polyhedral cone

is still an open problem.

I instead, we found that a monotone property of the projection isenough.

Page 76: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

28

Extensions...

Recall that: two parameters determine the flexibility.

I α determines the cone size, and hence how often prefer shorterqueues. (frequency)

I δ determines how strong shorter queue is preferred. (intensity)

Both of them can scale down to zero with the load to enjoy even greaterflexibility.

PropositionConsider the same policy as before, i.e., δ-tilted outside a cone Kα.Suppose that

α(ε)δ(ε) = Ω(εβ)

for some β ∈ [0, 1), then this policy is heavy-traffic delay optimal.

Page 77: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

28

Extensions...

Recall that: two parameters determine the flexibility.

I α determines the cone size, and hence how often prefer shorterqueues. (frequency)

I δ determines how strong shorter queue is preferred. (intensity)

Both of them can scale down to zero with the load to enjoy even greaterflexibility.

PropositionConsider the same policy as before, i.e., δ-tilted outside a cone Kα.Suppose that

α(ε)δ(ε) = Ω(εβ)

for some β ∈ [0, 1), then this policy is heavy-traffic delay optimal.

Page 78: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

28

Extensions...

Recall that: two parameters determine the flexibility.

I α determines the cone size, and hence how often prefer shorterqueues. (frequency)

I δ determines how strong shorter queue is preferred. (intensity)

Both of them can scale down to zero with the load to enjoy even greaterflexibility.

PropositionConsider the same policy as before, i.e., δ-tilted outside a cone Kα.Suppose that

α(ε)δ(ε) = Ω(εβ)

for some β ∈ [0, 1), then this policy is heavy-traffic delay optimal.

Page 79: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

29

Geometric intuition...

PropositionConsider the same policy as before, i.e., δ-tilted outside a cone Kα.Suppose that

α(ε)δ(ε) = Ω(εβ)

for some β ∈ [0, 1), then this policy is heavy-traffic delay optimal.

Page 80: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

29

Geometric intuition...

PropositionConsider the same policy as before, i.e., δ-tilted outside a cone Kα.Suppose that

α(ε)δ(ε) = Ω(εβ)

for some β ∈ [0, 1), then this policy is heavy-traffic delay optimal.

Q1 = Q2

Q1

Q2

(1/)

m

O(1/())

K↵()

Q1 = Q2

Q1

Q2

(1/)

m

d0

O(1/())

(↵()/)

K↵()

Page 81: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

29

Geometric intuition...

PropositionConsider the same policy as before, i.e., δ-tilted outside a cone Kα.Suppose that

α(ε)δ(ε) = Ω(εβ)

for some β ∈ [0, 1), then this policy is heavy-traffic delay optimal.

Q1 = Q2

Q1

Q2

(1/)

m

O(1/())

K↵()

Q1 = Q2

Q1

Q2

(1/)

m

d0

O(1/())

(↵()/)

K↵()

Page 82: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

29

Geometric intuition...PropositionConsider the same policy as before, i.e., δ-tilted outside a cone Kα.Suppose that

α(ε)δ(ε) = Ω(εβ)

for some β ∈ [0, 1), then this policy is heavy-traffic delay optimal.

Q1 = Q2

Q1

Q2Q1 = Q2

Q1

Q2

(1/)

m

d0

d

O(1/())

(↵()/)

K↵()

limε→0d′

m =∞ far away from boundary

Page 83: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

30

What if... the collapse region cannot be covered by a cone?

Q1

Q2

Our new paper addresses it, to appear in Sigmetrics/Performance 2019.

“Heavy-traffic Delay Optimality in Pull-based Load Balancing Systems:Necessary and Sufficient Conditions”

Page 84: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

30

What if... the collapse region cannot be covered by a cone?

Q1

Q2

Our new paper addresses it, to appear in Sigmetrics/Performance 2019.

“Heavy-traffic Delay Optimality in Pull-based Load Balancing Systems:Necessary and Sufficient Conditions”

Page 85: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

30

What if... the collapse region cannot be covered by a cone?

Q1

Q2

Our new paper addresses it, to appear in Sigmetrics/Performance 2019.

“Heavy-traffic Delay Optimality in Pull-based Load Balancing Systems:Necessary and Sufficient Conditions”

Page 86: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

31

Conclusion...

Theorem (Stability + Collapse to cone =⇒ Optimality)

I We show a multi-dimensional state-space can still guarantee delayoptimality.

I The key is no sever is idle while others with high loads.

Theorem (δ-tilted outside the cone =⇒ Optimality)

I Flexibility comes from two aspects: frequency (α) and intensity (δ).

I The methods to prove the result have the potential in general case.

Page 87: Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput … · 2020. 4. 28. · Given a throughput optimal load balancing policy, if there exists an 2(0;1]

32

Thank you!