1 Property Testing in Sparse and General Graphs Michael Krivelevich Tel Aviv University.
-
Upload
gloria-perkins -
Category
Documents
-
view
214 -
download
0
Transcript of 1 Property Testing in Sparse and General Graphs Michael Krivelevich Tel Aviv University.
1
Property Testing in Sparse and General Graphs
Michael Krivelevich
Tel Aviv University
2
Graph Property Testing
Very general setting:
P = graph property to test(k-colorability, planarity, non-existence of a copy of H, etc.)
Input: graph G on n vertices, n→∞
Promise: GP (positive)or: G is ε-far from P (negative)
(ε-percentage of description of P should be changed to get HP)
Algorithm A (typically randomized): Queries description of PGP Pr[ A accepts G] ≥ 2/3G is ε-far from P Pr[ A rejects G] ≥ 2/3
GP, Pr[ A accepts G] =1 – one-sided error algorithm
Should be specified!
3
Property Testing in Dense Graphs- Formally defined in GGR’98(appeared implicitly in combinatorial papers in 70’s, 80’s)
Input graph description: adjacency matrixG=(V,E), V=[n]
Algorithm: queries the adjacency matrix of GQuery: whether (i,j) E(G)?(vertex pair query)Distance: G is is ε-far from P if ≥εn2 entries in A(G) need to
be changed to get HP
1, ( , ) ( )
0,ij
i j E Ga
otherwise
n nA
4
Property Testing in Dense Graphs – Brief Summary
“… It’s all about REGULARITY.” (AFNS’06)
• Very strong (and fruitful) connection between property testing in dense graphs and the Szemerédi Regularity Lemma and its versions
(started in AFKS’99 and culminated in AFNS’06)
• Have reached very good understanding of this setting
(though of course quite a few challenging problems remain)
5
Dense Graph Model - limitations
• Suitable/tailored for dense graphs only
• Degenerate for many graph properties Ex. : P = “ G is connected”
- Always answer “YES” ( dist(G,P)≤ n-1 << εn2 )
• A typical algorithm: - sample S [n], |S|=O(1) - look inside to check whether G[S]P
- returns a.s. empty set S for |E(G)|=o(n2) useless/irrelevant
6
Property Testing in Bounded Degree Graphs
Introduced by GR’97
• Assumption: Δ(input graph G) ≤ d=const; ε<< 1/d• Graph representation: by incidence lists
L(vi)=(vi,1,…,vi,d) – list of neighbors of vi
• Query: who is the j-th neighbor of vi?(neighbor query)
• Distance: G is ε-far from P if need ≥ εdn modifications in incidence lists to get HP
7
Bounded Degree Graphs – an Example
Th. (GR’97): Connectivity in bounded degree model can be tested in O(1/ε2) queries
Proof: Assume: G is ε-far from being connected
G has ≥ εn connected components
G has ≥ εn/2 con. components of size ≤ 2/ε (= small components)
≥ ε/2 percentage of all vertices in small components
8
Property Testing in Bounded Degree Graphs (cont.)
Algorithm: Repeat O(1/ε) times:
1. Sample a random vertex vRV
2. Explore the connected component C(v) of v till accumulate 2/ε vertices
3. If |C(v)| ≤ 2/ε – reject
If never reject – accept
One-sided error algorithm with complexity O(1/ε2)
More careful analysis (1/ε) queries O~
9
Testing bounded degree graphs – basic tools
• Random sampling
• Local search
(exploring the neighborhood/ball of a vertex)
• Random walks
(a random neighbor of a random neighbor of a random neighbor…)
10
Bounded degree – first results
Results from GR’97:
Can test:-connectivity:
connectivity in (1/ε) queries2-edge connectivity: (1/ε2) 3-edge connectivity: (1/ε3) k-vertex connectivity, k=2,3: (1/εk)
- one-sided error algorithms
- cycle-freeness in O(1/ε3) queries- two-sided error algorithmProof idea: G is ε-far from a forest many small components with a cycle, or large components Ci with large surplus e(Ci)-v(Ci)
O~
O~O
~
O~
Uses structural connectivity results (block, cactus, etc.)
11
Testing bipartiteness in bounded degree graphsP = “G is bipartite”
Lower bound (GR’97): Ω(√n) queries- in very sharp contrast to the dense case
Proof idea: Negative distribution DN= Hamilton cycle + random perfect matching
(O(1)-far from being bipartite a.s.)Positive distribution DP=Hamilton cycle + random perfect matching between
vertices of different parity
= DN = DP
Any tester: can’t distinguish between DP, DN before having seen a cycle
Takes Ω(√n) queries by birthday paradox
12
Testing bipartiteness in bounded degree graphs (cont.)
Th. (GR’99): There is a one-sided error algorithm for testing bipartiteness in the bounded degree model in (√n) queries.
Algorithm: Repeat T= O(1/ε) times:
1. Choose a random vertex sRV2. Perform K:= (√n) random walks of length L:=polylog(n) starting from s3. If get to the same endvertex by an odd and an even path – reject
If no rejection - accept
O~
O~
13
Testing bipartiteness in bounded degree graphs (cont.)
Analysis: very elaborate
- relatively easy for rapidly mixing case
[ s Pr[a random walk of length L starting from s] = Θ(1/n) )]
- for general case:
no rapid mixing small cut (M’89)
use them to decompose the graph and the problem
14
Testing k-colorability
P = “G is k-colorable”; k≥3 – fixed
Obviously can be done in O(n) queries(just get all O(dn) edges of G)
Th. (BOT’02): For every fixed k ≥3, testing k-colorability in the bounded degree model requires Ω(n) queries
No room for sophisticated testing algorithms
15
Testing k-colorability (cont.)
Proof Idea: For one-sided error: Can use classical result of Erdős’62:
Th.: There exists G=(V,E), |V|=n, Δ(G)=O(1), G is ε-far from 3-colorable, but: every δn edges form a 3-colorable graph
tester has to obtain ≥ δn edges to catch G0 G with χ(G0)>3
For two-sided error algorithm:- Two distributions (positive, negative) over instances of systems of
linear equations; Any algorithm can’t distinguish between them in o(n) time- Then: gap preserving reductions from linear equations to 3-
colorability
16
Testing in non-expanding bounded degree graphs
Czumaj, Shapira, Sohler’07
Notion of hereditary non-expanding graphs:Def: G is λ-expanding if for every V0 V(G), |V0| ≤n/2,
|N(V0)|≥ λ |V_0|
Def: Graph family F is non-expanding if there exists n0=n0(F) s.t. for all GF , |V(G)|≥ n0, G is not (1/log2n)-expanding
Ex.: F =planar graphs – non-expanding(exists separator of size O(√|V(G)|)
Use: G non-expanding family F , bounded degree can repeatedly cut G to decompose it into constant sized pieces H1,H2,
…, number of edges between pieces ≤ ε n/2
17
Testing in non-expanding graphs (cont.)
Th. (CSS): P= hereditary property(closed under taking induced subgraphs, say, 3-colorability)
Assume: Input G non-expanding family F of bounded degree subgraphs
P can be tested over F in constant time f(ε)
Proof idea: Decompose G=(H1,H2,…) as aboveG=negative instance
many of Hi’s are witnesses can be found by random sampling + local search
18
Testing planarity
Th. (BSS’08) P = “G is planar”
P can be tested in time Oε(1) in bounded degree graphs by a 2-sided error algorithm(proved more: every minor-closed property P is testable in constant time)
Proof idea: Local statistics in planar graphs differ substantially from those in graphs ε-far from planar
(related to hyper-finite graphs, converging sequences of sparse graphs, etc.)
19
Testing planarity (cont.)
Remarks: 1. Get two-sided error algorithm, query complexity exp(exp(exp(1/ε))). Better query complexity?
2. Two-sided vs one-sidedEx: G= bounded degree expander of high girth (Θ(log n))
(say, LPS graph)- Θ(1)-far from planar- every c logn edges form a forest planar subgraph LB=Ω(log n)
can strengthen to Ω(√ n) of GR’97
Conj: P= “G is H-minor free”P can be tested with a one-sided error algorithm in O(√n) queries
20
Bounded degree graphs –open questions
• Characterization of testable properties? (testable := testable in Oε(1) queries)
or at least: wide classes of testable properties
• One-sided vs two-sided? Comparative study for various properties
• Testing in restricted graph classes?(á la CSS)
• Tolerant testing? Estimating distance to a given property?
21
Bounded degree model - limitations
Opposite/similar to the dense model
• Suitable/tailored only for bounded degree graphs
• Distance notion is “hardwired” – measured always w.r.t. to dn
• Degenerates for certain properties
(e.g. √ n-colorability – always answer “YES”)
22
Testing in graphs of general density- Introduced in KKR’03
Main principles:1. Distance in measured w.r.t. to the actual size of the input graph
(latter can be approximated first if necessary)G=(V,E) is ε-far from P if ≥ ε|E| edges need to be changed to get HP(appeared already in PR’02)
2. Queries allowed:a) vertex pair queries: whether (i,j) E(G)?(like in the dense model)b) neighbor queries: j-th neighbor of i V(G)?(like in the sparse model)c) degree queries: what is dG(i)?
No inherent limitation on input graph density
23
Testing bipartiteness in general graphs
Th. (KKR’03):
1. Testing bipartiteness can be done in (min(√n, n/d)) queries,where d=2|E|/|V| is the average degree of G;
2. Lower bound of Ω(min(√n, n/d))
- continuous interpolation between the sparse and the dense cases
queries
n√
dn√ n
O~
24
Testing bipartiteness for general graphs - proofs
Upper bound:Case d≤√n – same as in the bounded degree model
K:= Oε(√n), L:=polylogε(n)
Repeat T= O(1/ε) times:1. Choose a random vertex sRV2. Perform K random walks of length L starting from s3. A0 = endpoints of walks corresponding to paths of even length A1 = endpoints of walks corresponding to paths of odd length4. If A0∩ A1 ≠Ø – reject, found an odd cycle
Never rejected - accept
25
Testing bipartiteness for general graphs – proofs (cont.)
Upper bound:Case d≥√n
Now: K:= Oε(√(n/d)), L:=polylogε(n)
A0 , A1 – as before
Check whether A0 or A1 spans an edge(here use vertex pair queries)
If happens – reject
Never happens - accept
26
Testing bipartiteness for general graphs – proofs (cont.)
Lower bound:
Negative distribution DN= Gn,d – random d-regular graphPositive distribution DP=Gn/2,n/2,d – random bipartite d-regular graph
- choose an equipartition V=(V1,V2) u.a.r.- construct a random d-regular bipartite graph between V1, V2
Proof idea: ALG = arbitrary algorithm• o(n/d) vertex pair queries a.s. do not produce an edge• have seen o(√n) vertices a.s. no neighbor query closes a cycle
(birthday paradox)
o(min(n/d, √n)) queries – both items apply, can’t distinguish between DP, DN
27
Testing triangle-freeness in general graphs
Result of AKKR’06
Property P to test = “G is K3-free”
Most interesting part – Lower Bound
d:=average degree of the input graph• d≤ n1-δ(n), δ(n)→ 0 Ω(n1/3) queries are needed• d=Θ(n) Oε(1) queries are enough (AFKS’99) Threshold-like behavior for query complexity, abrupt change around
d=Θ(n)
Proof Idea: Cayley graphs, set of generators – random subset of a dense 3AP-free set (c.f. A’02 for the dense case)
28
Comparative study of strength of different query types
- BKKR’08Test case: k-colorability, k≥3 fixedModels to compare: vertex pair queries neighbor queries combined model (pair+neighbor queries) new query type – group query
Group query: vV - vertex, S – vertex subset
? Whether there is an edge between v and S in G ?
YES/NO(and then can find a random edge between v and S in O(log n) queries if needed)
- motivated by Group Testing
29
Comparative study of strength of different query types -results
On the qualitative level:• vertex pair, neighbor < combined model < group query
• vertex pair queries are better for dense graphs, neighbor queries are better for sparse graphs
• for group queries: UB=O(n/d)
LB= Ω(n/d)
(d := average degree of the input graph)
Say, in testing bipartiteness
30
Testing general graphs – open problems
Results for (other) concrete problems?(testing H-freeness, k-colorability, etc.)
Develop technology for proving lower bounds
One-sided vs two-sided error algorithms?
What if given ability to sample a random edge?(to eliminate hiding small dense hard instances)
Further query types, their comparison? Query types driven by practical applications?