Inference in Sparse Graphs with Pairwise ... - Dylan Foster · Dylan Foster, Daniel Reichman, and...

Some Concrete Results

• If G connected, Error = O(p|V |).• For grids of size × n, optimal Error = Θ(p2n).

Until c ≤ 2, then O(pn) optimal.

• √n×

√n grid: We recover O(p2n), but like so:

Tree decompositionsFor graph G = (V,E), let W = W1, . . . ,WN be a collection ofsubsets of V and let T = (W , F ) and be a tree graph over W.T isatreedecompositionif

1. !W∈W W = V .2. For each uv ∈ E, some W ∈ W has u, v ∈ W .3. For W1,W2,W3 ∈ W , if W2 on path from W1 to W3, need

W1 ∩W3 ⊆ W2.Maintheorem: RecoveryfromtreedecompositionSuppose we have

• G′ = (V,E′), E′ ⊆ E.• Tree decomp. T = (W, F ) for G′ w/ constant width, overlap.• (G′(W )) ! ∆ for each W ∈ W .

Then there is efficient "Y s.t.

Error ≤ #O(p⌈∆/2⌉n).

Tree algorithm: proof sketch

Statisticallearningreduction:Take !Y to be the empirical risk minimizer:

!Y = arg minY ′∈F(X)

#Y ′v = Zv

RateforERM:"

%!Yv = Yv

&≤ O

'log|F(X)|/ϵ2

(w.h.p. over Z.

We have |F(X)| ≈ O))

*pn*→ E(!Y ) ≤ O(pn).

First result

Theorem: OptimalrecoveryfortreesWhen G isatree:

• Efficient algorithm !Y with Hamming errorError ≤ "O(pn) w.h.p.

• Lower bound of Ω(pn).• → Error = "O(pn) for all connected G!

Keyideas:• Edge MLE not enough! Squeeze more info out of Zs.• Adopt statistical learning viewpoint.

Side Information

Contributions• Characterize optimal recovery rates for trees.• Lift result to general graphs via tree decomposition.• Non-trivial recovery rates for all connected graphs,

including sparse graphs where recovery without sideinformation is impossible.

• All rates finite sample and high probability.• All achieved efficiently.

Key challenge

Huge body of work on solving / approximating MAP,MLE, etc., but how to establish tight bounds on

statistical performance?

Goal: Error = O(h(p)n), with h(p) → 0 as p → 0.

Key challenge

Huge body of work on solving / approximating MAP,MLE, etc., but how to establish tight bounds on

statistical performance?

Goal: Error = O(h(p)n), with h(p) → 0 as p → 0.ModelIntroduced in [Globerson-Roughgarden-Sontag-Yildirim‘15].

• Fixed graph G = (V,E), (|V | = n, |E| = m).• Ground truth labels Y ∈ ±1V .• Observe noisyedgelabels X ∈ ±1E :

!YuYv, with prob. (1− p)−YuYv, with prob. p

• Observe noisyvertexlabels Z ∈ ±1V :

!Yu, with prob. (1− q)−Yu, with prob. q

Inference in Sparse Graphs with Pairwise Measurements and Side Information

Dylan Foster, Daniel Reichman, and Karthik Sridharan

Theory-Practice Gap

djfoster@cs.cornell.edu, daniel.reichman@gmail.com, sridharan@cs.cornell.edu

Goal: Obtain small Hammingerror:

E(!Y ) !"

#!Yv(X,Z) = Yv

aka partial recovery.

Interestedin:• Finite sample behavior of E(!Y ) as function of p:

• Want α(G) such that E(!Y ) = O(pα(G)n).• Treat q = 1/2− ϵ as constant — Zs are very noisy.• Interested in p = ω(1/n) regime.

• α(G) for deterministic classes of graphs.• Sparse regime: (G) = O(1).

Graph inferenceBasicproblem: Recover latent node variables using noisymeasurements on edges of a graph G = (V,E).

• Community detection

• Inference for structured prediction (e.g. image segmentation)

• Alignment/registration/synchronization,correlation clustering, genome assembly,... many more!

Censored block model:[Abbe et al.`14]

[Saade et al. `15]

[Globerson,Yildirim,Roughgarden,Sontag`15]

[Chen et al. `15]

[Joachims/Hopcroft`05]

Motivation

ContributionsOur question: How do recovery prospects change with addition of side information?

Proof sketch:Tree algorithm: proof sketch

HowtotakeadvantageofChernoff?!

uv∈EXuv = YuYv ≤ 2pn+O(1) w.h.p.

Define hypothesis class:

F(X) !"Y ′ ∈ ±1V |

uv∈E

#Xuv = Y ′

uY′v

$≤ 2pn+O(1)

Thenwehave Y ∈ F(X) w.h.p., and |F(X)| ≈ O&&

Strategy: Learn Y from F(X) using Z!

Result: Trees

Result: General Graphs

Further examples: Hypergrids, lattices, Newman-Watts — see paper for more.

Some Concrete Results

• If G connected, Error = O(p|V |).• For grids of size × n, optimal Error = Θ(p2n).

Until c ≤ 2, then O(pn) optimal.

• √n×

√n grid: We recover O(p2n), but like so:

Inference in Sparse Graphs with Pairwise ... - Dylan Foster · Dylan Foster, Daniel Reichman, and...

Documents

Transcript of Inference in Sparse Graphs with Pairwise ... - Dylan Foster · Dylan Foster, Daniel Reichman, and...

Managing Expectations without RE: Instruments versus Targets...2019/10/07 · Managing Expectations without RE: Instruments versus Targets George-Marios Angeletos1 and Karthik Sastry2

Classification of Protein Secondary Structure · Classification of Protein Secondary Structure Jay Yagnik and Karthik Raman Supercomputer Education & Research Centre. ... • Order

Foster Cauer

Vibration Actuators Information - foster-electric.com · A-1 9.2 φ27 R15 609217 Size φ20mm Impedance 16Ω Nominal Power - Fo 90Hz Net Weight 6 2g Speciﬁcations 612031 Size φ15mm

Basic and ultrabasic - Deep Sea Drilling Project · 2007-05-01 · pose of this group to foster programs to in-vestigate the sediments and rocks beneath the deep oceans by drilling

David Foster Wallace on Ironism

α-Sheet secondary structure in amyloid β-peptide drives ... · α-Sheet secondary structure in amyloid β-peptide drives aggregation and toxicity in Alzheimer’s disease Dylan

Authors: Iris Mos , Stine E. Jacobsen , Simon R. Foster ...

c101 1 lecturenotes chapter4 Foster · 10/20/09 1 © 2008 Brooks/Cole 1 Chapter 4: Quantities of Reactants and Products © 2008 Brooks/Cole 2 C 6H 12O ... 10/20/09 7 © 2008 Brooks/Cole

Magnetosphere,-Ionosphere,Coupling,Eﬀects, atMid,Latudes,bdu.edu.et › ... › files › Foster_Ethiopia.pdf · Magnetosphere,-Ionosphere,Coupling,Eﬀects, atMid,Latudes, John,Foster,,Philip,Erickson,,

Cloud Structure and the Origins of the Stellar Initial ...dnelson/doc/dnelson.hawaii… · Origins of the Stellar Initial Mass Function in ρ-Ophiuchus Dylan R. Nelson Department

Faraday rotation measure synthesis of UGC 10288 · Faraday rotation measure synthesis of UGC 10288 Patrick Kamieneski 1, Q. Daniel Wang1, Dylan Paré1, Kendall Sullivan1 1University

Elasticity: Measuring Responsiveness Dr. D. Foster - Microeconomics Inelastic Elastic.

Hydraulic fracturing to determine 4 - John T. Foster · Second cycle 2050 2000 1950 1900 8 1850 1800 1750 1700 Time shut Second cycle Closure pressure (min) 2050 2000 1950 1900 1850

e sounder 1 - Foster Electric...95 90 85 80 95 mA 30 25 20 15 10 5 25 20 15 10 90 4 44.5 5 8 10 2 6 V Temp:25 at 10cm Temp:25 at 10cm Temp:25 Temp:25 5. 56 6.5 V 4.5 5 5.5 6 6.5 V

Βιβλιοθήκη & Κέντρο Πληροφόρησης FOSTER · • στην επιτάχυνση της καινοτομίας, • στην εμπλοκή των πολιτών

KARTHIK ADIMURTHI1 2 arXiv:1804.09612v2 …2 Supported in part by Simons Foundation, award number 426071. 1 2 KARTHIK ADIMURTHI AND NGUYEN CONG PHUC where the principal operator divA(x,u,∇u)

April Showers - cdn.ymaws.com · Kimbery Xie ΓΠ 77 Undecided N/A Xin Yuang Zheng ΓΠ 94 Pharmacist Fairfield CT Kyle Gildow ΓΦ 30 Pharmacist CVS Dylan Atkins ΓΦ 29 Pharmacist

Trillium Bridge II Reinforcing the Bridges and Scaling up ... · 4. Highlight the social value of patient summaries 5. Develop, Collect, Assess learning resources 6. Foster innovation

jctacapan.weebly.com€¦ · Web view, I try to foster an open learning environment that leads to problem solutions through informed and conscientious inquiry and critical dialogue