Local Search for String Problems: Brute Force is Essentially Optimalstelo/cpm/cpm13/11_guo.pdf ·...

25
Local Search for String Problems: Brute Force is Essentially Optimal Jiong Guo, Danny Hermelin, Christian Komusiewicz Universit¨ at des Saarlandes, Ben-Gurion University, TU Berlin & Universit´ e de Nantes CPM 2013

Transcript of Local Search for String Problems: Brute Force is Essentially Optimalstelo/cpm/cpm13/11_guo.pdf ·...

Page 1: Local Search for String Problems: Brute Force is Essentially Optimalstelo/cpm/cpm13/11_guo.pdf · 2013-07-26 · Local Search for String Problems: Brute Force is Essentially Optimal

Local Search for String Problems: Brute Force isEssentially Optimal

Jiong Guo, Danny Hermelin, Christian Komusiewicz

Universitat des Saarlandes, Ben-Gurion University,TU Berlin & Universite de Nantes

CPM 2013

Page 2: Local Search for String Problems: Brute Force is Essentially Optimalstelo/cpm/cpm13/11_guo.pdf · 2013-07-26 · Local Search for String Problems: Brute Force is Essentially Optimal

A local search problem Π consists of

a set I of instances,

for each instance I ∈ I a set F (I ) of feasible solutions,

an objective function f : F (I )→ Z, and

for each solution s ∈ F (I ) a neighborhood N(s, I ) ⊆ F (I ).

Goal: Find locally optimal solution s ∈ F (I ) such that forall s ′ ∈ N(s, I ), f (s) ≤ f (s ′).

Generic local search algorithm:

1 s := “some element of F (I )”2 while s is not locally optimal :3 find s ′ ∈ N(s) with f (s) < f (s ′)4 s := s ′

5 return s

Examples: k-Means, Simplex

C. Komusiewicz (TU Berlin & Univ. Nantes) Motivation (Local Search) 2 / 14

Page 3: Local Search for String Problems: Brute Force is Essentially Optimalstelo/cpm/cpm13/11_guo.pdf · 2013-07-26 · Local Search for String Problems: Brute Force is Essentially Optimal

Advantages of local search:

fast in practice

local optima often close to global optimum

generic algorithm scheme easy to implement

Tuning local search algorithms:

faster convergence to local optimum

changing type/size of neighborhood N

Theoretical analysis:

number of iterations PLS-completeness [Johnson, Papadimitriou & Yannakakis, JCSS 1988],Smoothed Analysis [Spielman & Teng, JACM 2004]

efficiently searching the neighborhood N Parameterized Local Search

C. Komusiewicz (TU Berlin & Univ. Nantes) Motivation (Local Search) 3 / 14

Page 4: Local Search for String Problems: Brute Force is Essentially Optimalstelo/cpm/cpm13/11_guo.pdf · 2013-07-26 · Local Search for String Problems: Brute Force is Essentially Optimal

Example:

LS-TSPInput: A set V of n cities, a distance matrix d : V × V → Z, a tour T ,and an integer k.

Question: Can we obtain a shorter tour T ′ by replacing ≤ k arcs of T?

Obvious: Solvable in nO(k) time

Aim: Improve running time to f (k) · poly(n)

But: LS-TSP is W[1]-hard parameterized by k [Marx, ORL 2008]

probably no f (k) · poly(n)-time algorithm

C. Komusiewicz (TU Berlin & Univ. Nantes) Motivation (Parameterized Local Search) 4 / 14

Page 5: Local Search for String Problems: Brute Force is Essentially Optimalstelo/cpm/cpm13/11_guo.pdf · 2013-07-26 · Local Search for String Problems: Brute Force is Essentially Optimal

Can we avoid brute-force nO(k) algorithms for searching theneighborhood?

General: Parameterized local search problems usually W[1]-hardwith respect to the k-change neighborhood[Fellows et al., JCSS, 2012]

Positive results for...

... restricted inputs:

LS-Feedback Arc Set in tournaments [Fomin et al., AAAI 2010]

LS-TSP on planar graphs [Guo et al., Algorithmica, to appear],

... other neighborhood types:

LS-TSP with ≤ k swaps in an m-bounded range[Guo et al., Algorithmica, to appear]

LS-TSP with maximum shift k [Balas, AOR 1999]

How easy are string problems with respect to parameterized localsearch?g

C. Komusiewicz (TU Berlin & Univ. Nantes) Motivation (Parameterized Local Search) 5 / 14

Page 6: Local Search for String Problems: Brute Force is Essentially Optimalstelo/cpm/cpm13/11_guo.pdf · 2013-07-26 · Local Search for String Problems: Brute Force is Essentially Optimal

Can we avoid brute-force nO(k) algorithms for searching theneighborhood?

General: Parameterized local search problems usually W[1]-hardwith respect to the k-change neighborhood[Fellows et al., JCSS, 2012]

Positive results for...

... restricted inputs:

LS-Feedback Arc Set in tournaments [Fomin et al., AAAI 2010]

LS-TSP on planar graphs [Guo et al., Algorithmica, to appear],

... other neighborhood types:

LS-TSP with ≤ k swaps in an m-bounded range[Guo et al., Algorithmica, to appear]

LS-TSP with maximum shift k [Balas, AOR 1999]

How easy are string problems with respect to parameterized localsearch?g

C. Komusiewicz (TU Berlin & Univ. Nantes) Motivation (Parameterized Local Search) 5 / 14

Page 7: Local Search for String Problems: Brute Force is Essentially Optimalstelo/cpm/cpm13/11_guo.pdf · 2013-07-26 · Local Search for String Problems: Brute Force is Essentially Optimal

Can we avoid brute-force nO(k) algorithms for searching theneighborhood?

General: Parameterized local search problems usually W[1]-hardwith respect to the k-change neighborhood[Fellows et al., JCSS, 2012]

Positive results for...

... restricted inputs:

LS-Feedback Arc Set in tournaments [Fomin et al., AAAI 2010]

LS-TSP on planar graphs [Guo et al., Algorithmica, to appear],

... other neighborhood types:

LS-TSP with ≤ k swaps in an m-bounded range[Guo et al., Algorithmica, to appear]

LS-TSP with maximum shift k [Balas, AOR 1999]

How easy are string problems with respect to parameterized localsearch for the Hamming distance neighborhood?g

C. Komusiewicz (TU Berlin & Univ. Nantes) Motivation (Parameterized Local Search) 5 / 14

Page 8: Local Search for String Problems: Brute Force is Essentially Optimalstelo/cpm/cpm13/11_guo.pdf · 2013-07-26 · Local Search for String Problems: Brute Force is Essentially Optimal

Closest StringInput: Strings S1, . . . ,Sm of length n over alphabet Σ, an integer d .

Question: Is there a string S of length n with Hamming distance ≤ d to

each Si ∈ S?

Known:

NP-hard [Frances & Litmann, TOCS 1999]

O(dd+1 ·m + nm)-time algorithm[Gramm, Niedermeier & Rossmanith, Algorithmica 2003]

f (m) · poly(n)-time algorithm[Gramm, Niedermeier & Rossmanith, Algorithmica 2003]

C. Komusiewicz (TU Berlin & Univ. Nantes) Closest String (Problem Definition) 6 / 14

Page 9: Local Search for String Problems: Brute Force is Essentially Optimalstelo/cpm/cpm13/11_guo.pdf · 2013-07-26 · Local Search for String Problems: Brute Force is Essentially Optimal

LS-Closest StringInput: Strings S1, . . . ,Sm of length n over alphabet Σ, a string S withHamming distance ≤ d to each Si ∈ S, and k ≥ 0.Question: Is there a string S ′ of length n with Hamming distance

≤ k to S , and

< d to each Si ∈ S?

S1 :=

:=

:=S2

S3

BAAAA

CCACB

ABBAC

S := BBBBB

dH(S ,S1) = 4

dH(S ,S2) = 4

dH(S ,S3) = 3

k = 2

S ′ := BBAAB

dH(S ′,S1) = 2

dH(S ′,S2) = 3

dH(S ′,S3) = 3

dH(S ′,S) = 2

Proposition: LS-Closest String can be solved in dk · poly(n,m)time.

C. Komusiewicz (TU Berlin & Univ. Nantes) Closest String (Problem Definition) 7 / 14

Page 10: Local Search for String Problems: Brute Force is Essentially Optimalstelo/cpm/cpm13/11_guo.pdf · 2013-07-26 · Local Search for String Problems: Brute Force is Essentially Optimal

LS-Closest StringInput: Strings S1, . . . ,Sm of length n over alphabet Σ, a string S withHamming distance ≤ d to each Si ∈ S, and k ≥ 0.Question: Is there a string S ′ of length n with Hamming distance

≤ k to S , and

< d to each Si ∈ S?

S1 :=

:=

:=S2

S3

BAAAA

CCACB

ABBAC

S := BBBBB

dH(S ,S1) = 4

dH(S ,S2) = 4

dH(S ,S3) = 3

k = 2

S ′ := BBAAB

dH(S ′,S1) = 2

dH(S ′,S2) = 3

dH(S ′,S3) = 3

dH(S ′,S) = 2

Proposition: LS-Closest String can be solved in dk · poly(n,m)time.

C. Komusiewicz (TU Berlin & Univ. Nantes) Closest String (Problem Definition) 7 / 14

Page 11: Local Search for String Problems: Brute Force is Essentially Optimalstelo/cpm/cpm13/11_guo.pdf · 2013-07-26 · Local Search for String Problems: Brute Force is Essentially Optimal

LS-Closest StringInput: Strings S1, . . . ,Sm of length n over alphabet Σ, a string S withHamming distance ≤ d to each Si ∈ S, and k ≥ 0.Question: Is there a string S ′ of length n with Hamming distance

≤ k to S , and

< d to each Si ∈ S?

S1 :=

:=

:=S2

S3

BAAAA

CCACB

ABBAC

S := BBBBB

dH(S ,S1) = 4

dH(S ,S2) = 4

dH(S ,S3) = 3

k = 2

S ′ := BBAAB

dH(S ′, S1) = 2

dH(S ′, S2) = 3

dH(S ′, S3) = 3

dH(S ′, S) = 2

Proposition: LS-Closest String can be solved in dk · poly(n,m)time.

C. Komusiewicz (TU Berlin & Univ. Nantes) Closest String (Problem Definition) 7 / 14

Page 12: Local Search for String Problems: Brute Force is Essentially Optimalstelo/cpm/cpm13/11_guo.pdf · 2013-07-26 · Local Search for String Problems: Brute Force is Essentially Optimal

LS-Closest StringInput: Strings S1, . . . ,Sm of length n over alphabet Σ, a string S withHamming distance ≤ d to each Si ∈ S, and k ≥ 0.Question: Is there a string S ′ of length n with Hamming distance

≤ k to S , and

< d to each Si ∈ S?

S1 :=

:=

:=S2

S3

BAAAA

CCACB

ABBAC

S := BBBBB

dH(S ,S1) = 4

dH(S ,S2) = 4

dH(S ,S3) = 3

k = 2

S ′ := BBAAB

dH(S ′, S1) = 2

dH(S ′, S2) = 3

dH(S ′, S3) = 3

dH(S ′, S) = 2

Proposition: LS-Closest String can be solved in dk · poly(n,m)time.

C. Komusiewicz (TU Berlin & Univ. Nantes) Closest String (Problem Definition) 7 / 14

Page 13: Local Search for String Problems: Brute Force is Essentially Optimalstelo/cpm/cpm13/11_guo.pdf · 2013-07-26 · Local Search for String Problems: Brute Force is Essentially Optimal

Theorem: LS-Closest String is NP-hard and W[2]-hard for theradius k of the Hamming neighborhood.

Proof by parameterized reduction from

Multicolored Hitting SetInput: A hypergraph G = (V , E) andvertex-coloring c : V → {1, . . . , k}.Question: Is there a size-k set V ′ ⊆ V such that

V ′ ∩ E 6= ∅ for all E ∈ E , and

V ′ is colorful?

Known: Multicolored Hitting Set parameterized by k isW[2]-hard

C. Komusiewicz (TU Berlin & Univ. Nantes) Closest String (Unbounded Alphabet) 8 / 14

Page 14: Local Search for String Problems: Brute Force is Essentially Optimalstelo/cpm/cpm13/11_guo.pdf · 2013-07-26 · Local Search for String Problems: Brute Force is Essentially Optimal

Parameterized Reduction:

(I , k)A

(I ′, k ′)

(I , k) is a yes-instance ⇔ (I ′, k ′) is a yes-instance

A runs in f (k) · poly(|I |) time

k ′ ≤ g(k)

A parameterized reduction from a W[t]-hard problem L to aproblem L′ shows W [t]-hardness of L′

C. Komusiewicz (TU Berlin & Univ. Nantes) Closest String (Unbounded Alphabet) 9 / 14

Page 15: Local Search for String Problems: Brute Force is Essentially Optimalstelo/cpm/cpm13/11_guo.pdf · 2013-07-26 · Local Search for String Problems: Brute Force is Essentially Optimal

V = {1, 2, 3, 4, 5, 6}E = {E1,E2,E3,E4}E1 = {1, 2, 3} E2 = {4, 6} E3 = {1, 6} E4 = {3, 5}

1 2 3 4 5 6

SE1 R B R X X X

SE2 X X X O X B

SE3 R X X X X B

SE4 X X R X O X

SR R R R R R R

SB B B B B B B

SO O O O O O O

S * * * * * *

|V ||V ||V ||V ||V ||V ||V |

S ′ R * * * O B

C. Komusiewicz (TU Berlin & Univ. Nantes) Closest String (Unbounded Alphabet) 10 / 14

Page 16: Local Search for String Problems: Brute Force is Essentially Optimalstelo/cpm/cpm13/11_guo.pdf · 2013-07-26 · Local Search for String Problems: Brute Force is Essentially Optimal

V = {1, 2, 3, 4, 5, 6}E = {E1,E2,E3,E4}E1 = {1, 2, 3} E2 = {4, 6} E3 = {1, 6} E4 = {3, 5}

1 2 3 4 5 6

SE1 R B R X X X

SE2 X X X O X B

SE3 R X X X X B

SE4 X X R X O X

SR R R R R R R

SB B B B B B B

SO O O O O O O

S * * * * * *

|V ||V ||V ||V ||V ||V ||V |

S ′ R * * * O B

C. Komusiewicz (TU Berlin & Univ. Nantes) Closest String (Unbounded Alphabet) 10 / 14

Page 17: Local Search for String Problems: Brute Force is Essentially Optimalstelo/cpm/cpm13/11_guo.pdf · 2013-07-26 · Local Search for String Problems: Brute Force is Essentially Optimal

V = {1, 2, 3, 4, 5, 6}E = {E1,E2,E3,E4}E1 = {1, 2, 3} E2 = {4, 6} E3 = {1, 6} E4 = {3, 5}

1 2 3 4 5 6

SE1 R B R X X X

SE2 X X X O X B

SE3 R X X X X B

SE4 X X R X O X

SR R R R R R R

SB B B B B B B

SO O O O O O O

S * * * * * *

|V ||V ||V ||V ||V ||V ||V |

S ′ R * * * O B

C. Komusiewicz (TU Berlin & Univ. Nantes) Closest String (Unbounded Alphabet) 10 / 14

Page 18: Local Search for String Problems: Brute Force is Essentially Optimalstelo/cpm/cpm13/11_guo.pdf · 2013-07-26 · Local Search for String Problems: Brute Force is Essentially Optimal

V = {1, 2, 3, 4, 5, 6}E = {E1,E2,E3,E4}E1 = {1, 2, 3} E2 = {4, 6} E3 = {1, 6} E4 = {3, 5}

1 2 3 4 5 6

SE1 R B R X X X

SE2 X X X O X B

SE3 R X X X X B

SE4 X X R X O X

SR R R R R R R

SB B B B B B B

SO O O O O O O

S * * * * * *

|V ||V ||V ||V ||V ||V ||V |

S ′ R * * * O B

C. Komusiewicz (TU Berlin & Univ. Nantes) Closest String (Unbounded Alphabet) 10 / 14

Page 19: Local Search for String Problems: Brute Force is Essentially Optimalstelo/cpm/cpm13/11_guo.pdf · 2013-07-26 · Local Search for String Problems: Brute Force is Essentially Optimal

For binary alphabet:

SE1 1 1 1 0 0 0

SE2 0 0 1 0 1 0

SE3 1 0 0 1 0 0

SE4 0 1 0 0 0 1

S{R,B,O} 1 1 1 1 1 1

S{B,O} 0 0 1 1 1 1

S{R,O} 1 1 0 0 1 1

S{R,B} 1 1 1 1 0 0

S{R} 1 1 0 0 0 0

S{B} 0 0 1 1 0 0

S{O} 0 0 0 0 1 1

S 0 0 0 0 0 0

1 |V | − k

|V | − k

|V | − k

|V | − k

|V ||V | − 1

|V | − 1

|V | − 1

|V | − 2

|V | − 2

|V | − 21

00 . . . . . .

S ′ 1 0 1 0 0 1 0 . . . . . . 0

C. Komusiewicz (TU Berlin & Univ. Nantes) Closest String (Binary Alphabet) 11 / 14

Page 20: Local Search for String Problems: Brute Force is Essentially Optimalstelo/cpm/cpm13/11_guo.pdf · 2013-07-26 · Local Search for String Problems: Brute Force is Essentially Optimal

LS-Closest String:

NP-hard

W[2]-hard for radius k of the Hamming neighborhood even incase |Σ| = 2

no no(k)-time algorithm(assuming the Exponential Time Hypothesis)

solvable in dk · poly(n,m) time

Further results:

W[1]-hardness for radius k of the Hamming neighborhood for localsearch variants of

Longest Common Subsequence

Shortest Common Supersequence

Shortest Common Superstring

C. Komusiewicz (TU Berlin & Univ. Nantes) Further Results (Overview) 12 / 14

Page 21: Local Search for String Problems: Brute Force is Essentially Optimalstelo/cpm/cpm13/11_guo.pdf · 2013-07-26 · Local Search for String Problems: Brute Force is Essentially Optimal

LS-Longest Common SubsequenceInput: Strings T1, . . . ,Tm over alphabet Σ, a string S that is asubsequence of each Ti , and integer k .Question: Is there a letter σ ∈ Σ and a string S of length |S | suchthat Sσ is a subsequence of each string in τ and dH(S , S) ≤ k?

S1 :=

:=

:=S2

S3

BCBAAACBA

BCCACBAC

CABBAC

S := ABA, k = 1

Sσ := BBAC

Theorem: LS-Longest Common Subsequence with |Σ| = O(1)is W[1]-hard for the radius k of the Hamming neighborhood.

C. Komusiewicz (TU Berlin & Univ. Nantes) Further Results (LS-LCS) 13 / 14

Page 22: Local Search for String Problems: Brute Force is Essentially Optimalstelo/cpm/cpm13/11_guo.pdf · 2013-07-26 · Local Search for String Problems: Brute Force is Essentially Optimal

LS-Longest Common SubsequenceInput: Strings T1, . . . ,Tm over alphabet Σ, a string S that is asubsequence of each Ti , and integer k .Question: Is there a letter σ ∈ Σ and a string S of length |S | suchthat Sσ is a subsequence of each string in τ and dH(S , S) ≤ k?

S1 :=

:=

:=S2

S3

BCBAAACBA

BCCACBAC

CABBAC

S := ABA, k = 1

Sσ := BBAC

Theorem: LS-Longest Common Subsequence with |Σ| = O(1)is W[1]-hard for the radius k of the Hamming neighborhood.

C. Komusiewicz (TU Berlin & Univ. Nantes) Further Results (LS-LCS) 13 / 14

Page 23: Local Search for String Problems: Brute Force is Essentially Optimalstelo/cpm/cpm13/11_guo.pdf · 2013-07-26 · Local Search for String Problems: Brute Force is Essentially Optimal

LS-Longest Common SubsequenceInput: Strings T1, . . . ,Tm over alphabet Σ, a string S that is asubsequence of each Ti , and integer k .Question: Is there a letter σ ∈ Σ and a string S of length |S | suchthat Sσ is a subsequence of each string in τ and dH(S , S) ≤ k?

S1 :=

:=

:=S2

S3

BCBAAACBA

BCCACBAC

CABBAC

S := ABA, k = 1

Sσ := BBAC

Theorem: LS-Longest Common Subsequence with |Σ| = O(1)is W[1]-hard for the radius k of the Hamming neighborhood.

C. Komusiewicz (TU Berlin & Univ. Nantes) Further Results (LS-LCS) 13 / 14

Page 24: Local Search for String Problems: Brute Force is Essentially Optimalstelo/cpm/cpm13/11_guo.pdf · 2013-07-26 · Local Search for String Problems: Brute Force is Essentially Optimal

LS-Longest Common SubsequenceInput: Strings T1, . . . ,Tm over alphabet Σ, a string S that is asubsequence of each Ti , and integer k .Question: Is there a letter σ ∈ Σ and a string S of length |S | suchthat Sσ is a subsequence of each string in τ and dH(S , S) ≤ k?

S1 :=

:=

:=S2

S3

BCBAAACBA

BCCACBAC

CABBAC

S := ABA, k = 1

Sσ := BBAC

Theorem: LS-Longest Common Subsequence with |Σ| = O(1)is W[1]-hard for the radius k of the Hamming neighborhood.

C. Komusiewicz (TU Berlin & Univ. Nantes) Further Results (LS-LCS) 13 / 14

Page 25: Local Search for String Problems: Brute Force is Essentially Optimalstelo/cpm/cpm13/11_guo.pdf · 2013-07-26 · Local Search for String Problems: Brute Force is Essentially Optimal

So far: only negative results

Possible directions:

other problems: Multiple Sequence Alignment, . . .

other neighborhood types: “shift”-based distances,insertions/deletions at arbitrary solution positions

C. Komusiewicz (TU Berlin & Univ. Nantes) Outlook () 14 / 14