Extended Formulations and Information Complexitypeople.csail.mit.edu/moitra/docs/tutorialf.pdf ·...

Extended Formulations and Information Complexity

Ankur Moitra Massachusetts Institute of Technology

Dagstuhl, March 2014

The Permutahedron

Let t = [1, 2, 3, … n], P = conv{π(t) | π is permutation}

The Permutahedron

How many facets of P have?

The Permutahedron

How many facets of P have? exponentially many!

The Permutahedron

e.g. S [n], Σi in S xi ≥ 1 + 2 + … + |S| = |S|(|S|+1)/2

exponentially many!

The Permutahedron

e.g. S [n], Σi in S xi ≥ 1 + 2 + … + |S| = |S|(|S|+1)/2

Let Q = {A| A is doubly-stochastic}

exponentially many!

The Permutahedron

e.g. S [n], Σi in S xi ≥ 1 + 2 + … + |S| = |S|(|S|+1)/2

Then P is the projection of Q: P = {A t | A in Q }

exponentially many!

The Permutahedron

e.g. S [n], Σi in S xi ≥ 1 + 2 + … + |S| = |S|(|S|+1)/2

Then P is the projection of Q: P = {A t | A in Q }

Yet Q has only O(n2) facets

exponentially many!

Extended Formulations

The extension complexity (xc) of a polytope P is the minimum number of facets of Q so that P = proj(Q)

e.g. xc(P) = Θ(n logn) for permutahedron

xc(P) = Θ(logn) for a regular n-gon, but Ω(√n) for its perturbation

In general, P = {x | y, (x,y) in Q} E

…analogy with quantifiers in Boolean formulae

Applications of EFs

Through EFs, we can reduce # facets exponentially!

Applications of EFs

Hence, we can run standard LP solvers instead of the ellipsoid algorithm

Applications of EFs

Hence, we can run standard LP solvers instead of the ellipsoid algorithm EFs often give, or are based on new combinatorial insights

Applications of EFs

e.g. Birkhoff-von Neumann Thm and permutahedron

Applications of EFs

e.g. Birkhoff-von Neumann Thm and permutahedron e.g. prove there is low-cost object, through its polytope

Explicit, Hard Polytopes?

Explicit, Hard Polytopes? Definition: TSP polytope:

P = conv{1F| F is the set of edges on a tour of Kn}

Explicit, Hard Polytopes? Definition: TSP polytope:

P = conv{1F| F is the set of edges on a tour of Kn} (If we could optimize over this polytope, then P = NP)

Can we prove unconditionally there is no small EF?

Definition: TSP polytope:

P = conv{1F| F is the set of edges on a tour of Kn} (If we could optimize over this polytope, then P = NP)

Caveat: this is unrelated to proving complexity l.b.s

(If we could optimize over this polytope, then P = NP)

[Yannakakis ’90]: Yes, through the nonnegative rank

Caveat: this is unrelated to proving complexity l.b.s

(If we could optimize over this polytope, then P = NP)

An Abridged History

Theorem [Yannakakis ’90]: Any symmetric EF for TSP or matching has size 2Ω(n)

An Abridged History

…but asymmetric EFs can be more powerful

An Abridged History

Theorem [Fiorini et al ’12]: Any EF for TSP has size 2Ω(√n) (based on a 2Ω(n) lower bd for clique)

Approach: connections to non-deterministic CC

An Abridged History II

Theorem [Braun et al ’12]: Any EF that approximates clique within n1/2-eps has size exp(neps)

Approach: Razborov’s rectangle corruption lemma

Theorem [Braverman, Moitra ’13]: Any EF that approximates clique within n1-eps has size exp(neps)

Approach: information complexity

Theorem [Braverman, Moitra ’13]: Any EF that approximates clique within n1-eps has size exp(neps)

Approach: information complexity

see also [Braun, Pokutta ’13]: reformulation using common information, applications to avg. case

An Abridged History III

Theorem [Chan et al ’12]: Any EF that approximates MAXCUT within 2-eps has size nΩ(log n/loglog n)

Approach: reduction to Sherali-Adams

An Abridged History III

Theorem [Chan et al ’12]: Any EF that approximates MAXCUT within 2-eps has size nΩ(log n/loglog n)

Approach: reduction to Sherali-Adams

Theorem [Rothvoss ’13]: Any EF for perfect matching has size 2Ω(n) (same for TSP)

Approach: hyperplane separation lower bound

Outline

Part I: Tools for Extended Formulations

� Yannakakis’s Factorization Theorem

� The Rectangle Bound

� A Sampling Argument

Part II: Applications

� Correlation Polytope

� Approximating the Correlation Polytope

� Matching Polytope

Outline

The Factorization Theorem

How can we prove lower bounds on EFs?

[Yannakakis ’90]:

Geometric Parameter

Algebraic Parameter

[Yannakakis ’90]:

Geometric Parameter

Algebraic Parameter

Definition of the slack matrix…

The Slack Matrix

P S(P)

vertex

The Slack Matrix

P S(P)

vertex

The Slack Matrix

P S(P)

vertex

<ai,x> ≤ bi

The Slack Matrix

The entry in row i, column j is how slack the jth vertex is on the ith constraint

P S(P)

vertex

<ai,x> ≤ bi

The Slack Matrix

The entry in row i, column j is how slack the jth vertex is on the ith constraint

bi - <ai, vj>

P S(P)

vertex

<ai,x> ≤ bi

[Yannakakis ’90]:

Geometric Parameter

Algebraic Parameter

[Yannakakis ’90]:

Geometric Parameter

Algebraic Parameter

Definition of the nonnegative rank…

Nonnegative Rank

S M1 Mr = + + …

rank one, nonnegative

Nonnegative Rank

S M1 Mr = + + …

Definition: rank+(S) is the smallest r s.t. S can be written as the sum of r rank one, nonneg. matrices

Nonnegative Rank

S M1 Mr = + + …

Definition: rank+(S) is the smallest r s.t. S can be written as the sum of r rank one, nonneg. matrices

Note: rank+(S) ≥ rank(S), but can be much larger too!

[Yannakakis ’90]:

Geometric Parameter

Algebraic Parameter

[Yannakakis ’90]:

Geometric Parameter

Algebraic Parameter

xc(P) = rank+(S(P))

[Yannakakis ’90]:

Geometric Parameter

Algebraic Parameter

xc(P) = rank+(S(P))

Intuition: the factorization gives a change of variables that preserves the slack matrix!

[Yannakakis ’90]:

Geometric Parameter

Algebraic Parameter

xc(P) = rank+(S(P))

Intuition: the factorization gives a change of variables that preserves the slack matrix!

Next we will give a method to lower bound rank+ via information complexity…

Outline

The Rectangle Bound

S M1 Mr + + …

The Rectangle Bound

M1 Mr + + …

The Rectangle Bound

= + + …

The Rectangle Bound

= + + …

The Rectangle Bound

= + + …

The Rectangle Bound

= + + …

The support of each Mi is a combinatorial rectangle

The Rectangle Bound

= + + …

The support of each Mi is a combinatorial rectangle

rank+(S) is at least # rectangles needed to cover supp of S

The Rectangle Bound

= + + …

The Rectangle Bound

= + + …

Non-deterministic Comm. Complexity

Outline

A Sampling Argument

= + + …

A Sampling Argument

= + + …

T = { }, set of entries in S with same value

A Sampling Argument

= + + …

A Sampling Argument

= + + …

Choose Mi proportional to total value on T

A Sampling Argument

= + + …

A Sampling Argument

= + + …

Choose (a,b) in T proportional to relative value in Mi

A Sampling Argument

= + + …

A Sampling Argument

= + + …

A Sampling Argument

= + + …

This outputs a uniformly random sample from T

A Sampling Argument

= + + …

A Sampling Argument

= + + …

If r is too small, this procedure uses too little entropy!

Outline

The Construction of [Fiorini et al] correlation polytope: Pcorr= conv{aaT|a in {0,1}n }

The Construction of [Fiorini et al]

S constraints:

vertices:

correlation polytope: Pcorr= conv{aaT|a in {0,1}n }

S constraints:

vertices: a in {0,1}n

b in {0,1}n

S constraints:

b in {0,1}n

(1-aTb)2

S constraints:

b in {0,1}n

(1-aTb)2

UNIQUE DISJ. Output ‘YES’ if a

and b as sets are disjoint, and ‘NO’ if a and b have one index

in common

Why is that (a sub-matrix of) the slack matrix?

(1-aTb)2 = 1 – 2aTb + (aTb)2

= 1 – 2 diag(b),aaT + bbT,aaT

(1-aTb)2 = 1 – 2aTb + (aTb)2

1 ≥ 2diag(b) - bbT,aaT

(1-aTb)2 = 1 – 2aTb + (aTb)2

What is the slack?

(1-aTb)2 = 1 – 2aTb + (aTb)2

What is the slack? (1-aTb)2

A Hard Distribution

A Hard Distribution Let T = {(a,b) | aTb = 0}, |T| = 3n

Recall: Sa,b=(1-aTb)2, so Sa,b=1 for all pairs in T

How does the sampling procedure specialize to this case? (Recall it generates (a,b) unif. from T)

Sampling Procedure:

� Let Ri be the sum of Mi(a,b) over (a,b) in T and

let R be the sum of Ri

Sampling Procedure:

� Choose i with probability Ri/R

Sampling Procedure:

� Choose (a,b) with probability Mi(a,b)/Ri

Entropy Accounting 101

Sampling Procedure:

Total Entropy:

≤ n log23

Sampling Procedure:

Total Entropy:

+ choose i

choose (a,b) conditioned on i

≤ n log23

Sampling Procedure:

Total Entropy:

log2r + choose i

≤ n log23

Sampling Procedure:

Total Entropy:

log2r (1-δ)n log23 + choose i

≤ n log23

Sampling Procedure:

Total Entropy:

log2r (1-δ)n log23 + choose i

≤ n log23 (?)

Suppose that a-j and b-j are fixed

a b bj

Mi restricted to (a-j,b-j)

a b bj

Mi restricted to (a-j,b-j)

(a1..j-1,aj=0,aj+1…n)

(a1..j-1,aj=1,aj+1…n)

Mi(a,b)

(…bj=0…) (…bj=1…)

(a1..j-1,aj=0,aj+1…n)

(a1..j-1,aj=1,aj+1…n)

Mi(a,b)

(…bj=0…) (…bj=1…)

If aj=1, bj=1 then aTb = 1, hence Mi(a,b) = 0

(a1..j-1,aj=0,aj+1…n)

(a1..j-1,aj=1,aj+1…n)

Mi(a,b)

(…bj=0…) (…bj=1…)

(a1..j-1,aj=0,aj+1…n)

(a1..j-1,aj=1,aj+1…n)

Mi(a,b)

(…bj=0…) (…bj=1…)

(a1..j-1,aj=0,aj+1…n)

(a1..j-1,aj=1,aj+1…n)

Mi(a,b)

(…bj=0…) (…bj=1…)

But rank(Mi)=1, hence there must be another zero in either the same row or column

(a1..j-1,aj=0,aj+1…n)

(a1..j-1,aj=1,aj+1…n)

Mi(a,b) Mi(a,b)

(…bj=0…) (…bj=1…)

zero zero

(a1..j-1,aj=0,aj+1…n)

(a1..j-1,aj=1,aj+1…n)

Mi(a,b) Mi(a,b)

(…bj=0…) (…bj=1…)

zero zero

H(aj,bj| i, a-j, b-j) ≤ 1 < log23

Generate uniformly random (a,b) in T:

Total Entropy:

log2r + choose i

≤ n log23

Generate uniformly random (a,b) in T:

Total Entropy:

log2r n + choose i

≤ n log23

Outline

Approximate EFs [Braun et al]

S constraints:

b in {0,1}n

(1-aTb)2

S constraints:

b in {0,1}n

(1-aTb)2

Is there a K (with small xc) s.t. Pcorr K (C+1)Pcorr?

S constraints:

b in {0,1}n

(1-aTb)2

S constraints:

b in {0,1}n

(1-aTb)2

New Goal: Output the answer to UDISJ with prob. at least ½ + 1/2(C+1)

Is the correlation polytope hard to approximate for large values of C?

Analogy: Is UDISJ hard to compute with prob. ½+1/2(C+1) for large values of C?

There is a natural barrier at C = √n for proving l.b.s:

Claim: If UDISJ can be computed with prob. ½+1/2(C+1) using o(n/C2) bits, then UDISJ can be computed with prob. ¾ using o(n) bits

Proof: Run the protocol O(C2) times and take the majority vote

Corollary [from K-S]: Computing UDISJ with probability ½+1/2(C+1) requires Ω(n/C2) bits

Theorem [B-M]: Computing UDISJ with probability ½+1/2(C+1) requires Ω(n/C) bits

Theorem [B-M]: Any EF that approximates clique within n1-eps has size exp(neps)

Theorem [B-M]: Computing UDISJ with probability ½+1/2(C+1) requires Ω(n/C) bits

Outline

The Matching Polytope [Edmonds] PPM= conv{1M| M is a perfect matching in Kn}

The Matching Polytope [Edmonds]

constraints:

vertices: 1M

U [n] with |U| = odd

PPM= conv{1M| M is a perfect matching in Kn}

constraints:

vertices: 1M

|δ(U) M|-1

constraints:

vertices: 1M

|δ(U) M|-1

Is there a small rectangle covering?

constraints:

vertices: 1M

|δ(U) M|-1

Is there a small rectangle covering?

Yes! Just guess two edges in M, crossing the cut

Hyperplane Separation Lemma [Rothvoss] attributed to [Fiorini]:

Lemma: For slack matrix S, any matrix W:

rank+(S) ≥ S, W

||S||∞ α

where α = max W,R s.t. R is rank one, entries in [0,1]

Hyperplane Separation Lemma [Rothvoss] attributed to [Fiorini]:

Lemma: For slack matrix S, any matrix W:

rank+(S) ≥ S, W

||S||∞ α

where α = max W,R s.t. R is rank one, entries in [0,1]

Proof:

W,S = Σ ||Ri||∞ W, Ri/||Ri||∞ ≤ α r ||S||∞

How do we choose W?

WU,M =

-∞ if |δ(U) M| = 1

1/Q3 if |δ(U) M| = 3

-1/Qk if |δ(U) M| = k 0 else

How do we choose W?

WU,M =

-∞ if |δ(U) M| = 1

1/Q3 if |δ(U) M| = 3

-1/Qk if |δ(U) M| = k 0 else

Proof is a substantial modification to Razborov’s rectangle corruption lemma

How do we choose W?

Summary:

� Extended formulations and Yannakakis’ factorization theorem

Summary:

� Lower bound techniques: rectangle bound, information complexity, hyperplane separation

Summary:

� Applications: connections between correlation polytope and disjointness,

Summary:

� Open question: Can we prove lower bounds against general SDPs?

Any Questions? Summary:

� Open question: Can we prove lower bounds against general SDPs?

Thanks!

Extended Formulations and Information Complexitypeople.csail.mit.edu/moitra/docs/tutorialf.pdf ·...

Documents

Transcript of Extended Formulations and Information Complexitypeople.csail.mit.edu/moitra/docs/tutorialf.pdf ·...

RESEARCH ARTICLE Open Access Multiresistant extended … · 2017. 8. 29. · RESEARCH ARTICLE Open Access Multiresistant extended-spectrum β-lactamase-producing Enterobacteriaceae

Penatalaksanaan ESBL (Extended Spectrum Beta Lactamase)

Extended Standard Hough Transform for Analytical Line Recognition

EXTENDED CHARGE ELECTRO-OSMOSIS AND ELECTRO …math.mit.edu/seminars/pms/data/uploads/2006fa/zaltzman_talk.pdf · EXTENDED CHARGE ELECTRO-OSMOSIS AND ELECTRO-CONVECTIVE INSTABILITY

The use of 4,4,4-trifluorothreonine to stabilize extended ...

Topic 2.2 Extended G1 – Calculating rotational inertia

The Extended CIR Model - 國立臺灣大學lyuu/finance1/2010/20100609.pdf2010/06/09 · The Extended CIR Model † In the extended CIR model the short rate follows dr = (µ(t) ¡

Antibiotic Resistant Genes in Multidrug-Resistant Extended ...

1. 2 Extended-Spectrum-β Lactamase (ESBLs) Extended-Spectrum-β Lactamase (ESBLs) 3.

Heat Transfer for Fins (or Extended Surfaces) Handout ...kshollen/ME350/Handouts/Fin_Summary.pdf · Heat Transfer for Fins (or Extended Surfaces) Handout Table 3.4 Fin temperature

25LC640A 64K SPI Bus Serial EEPROM Extended (M - Microchip

Molecular diversity of extended-spectrum β-lactamases and ...

EXTENDED REPORT Agrin mediates chondrocyte homeostasis …

Section 3a: Early auditory system (an extended LSI example)

Infection Control Precautions for Extended Spectrum β ... · IC/291/10 Infection Control Precautions for Extended Spectrum Lacatamase Producers ... Control Precautions for Extended

ΤΟ ΠΕΡΙΟΔΙΚΟ - · PDF fileΚωστής Παλαμάς Innovative Pharmaceutical Formulations, Maximizing Drug Therapy. Ζήτω η 28η Οκτωβρίου 1940. 6 ΑΦΙΕΡΩΜΑ

Phase Shifting Transformers Rev1 - University of Albertaapic/uploads/Forum/phase shifter technology.pdfPhase Shifting Transformers ... - Asymmetrical extended delta - Symmetrical extended

Phenotypic detection of extended spectrum β-lactamase and ...

Molecular characteristics of extended spectrum β-lactamase ...

Extended Mean Field Study of Complex 4-Theory at Finite ... · Motivation ’4-TheoryMean Field TheoryExtended Mean Field TheoryResultsOutlook Extended Mean Field Study of Complex