Barron Fast Reliable Communication Presentation

29
Toward Fast Reliable Communication, at Rates Near Capacity with Gaussian Noise Andrew Barron, Antony Joseph Department of Statistics Yale University IEEE ISIT, Austin, TX, June 18, 2010 Barron, Joseph Toward Fast Reliable Communication 1/29

Transcript of Barron Fast Reliable Communication Presentation

Page 1: Barron Fast Reliable Communication Presentation

Toward Fast Reliable Communication,at Rates Near Capacity with Gaussian Noise

Andrew Barron, Antony JosephDepartment of Statistics

Yale University

IEEE ISIT, Austin, TX, June 18, 2010

Barron, Joseph Toward Fast Reliable Communication 1/29

Page 2: Barron Fast Reliable Communication Presentation

Shannon Formulation

Input bits: u = (u1, u2, . . . . . . , uK )

↓Encoded: x = (x1, x2, . . . , xn)

↓Channel: p(y |x)

↓Received: y = (y1, y2, . . . , yn)

↓Decoded: u = (u1, u2, . . . . . . , uK )

Rate: R = Kn Capacity C = max I(X ; Y )

Reliability: Want small Prob{u 6= u}and small Prob{Fraction mistakes ≥ α}

Barron, Joseph Toward Fast Reliable Communication 2/29

Page 3: Barron Fast Reliable Communication Presentation

Gaussian Noise Channel

Input bits: u = (u1, u2, . . . . . . , uK )

↓Encoded: x = (x1, x2, . . . , xn) ave 1

n∑n

i=1 x2i ≤ P

↓Channel: p(y |x) y = x + ε ε ∼ N(0, σ2I)

↓Received: y = (y1, y2, . . . , yn)

↓Decoded: u = (u1, u2, . . . . . . , uK )

Rate: R = Kn Capacity C = 1

2 log(1 + P/σ2)

Reliability: Want small Prob{u 6= u}and small Prob{Fraction mistakes ≥ α}

Barron, Joseph Toward Fast Reliable Communication 3/29

Page 4: Barron Fast Reliable Communication Presentation

Shannon Theory meets Coding Practice

The Gaussian noise channel is the basic model forwireless communicationradio, cell phones, television, satellite, spacewired communicationinternet, telephone, cable

Forney and Ungerboeck 1998 reviewmodulation, coding, and shaping for the Gaussian channel

Richardson and Urbanke 2008 cover much of the state ofthe art in the analysis of coding

There are fast encoding and decoding algorithms, withempirically good performance for LDPC and turbo codesSome tools for their theoretical analysis, but obstaclesremain for mathematical proof of these schemes achievingrates up to capacity for the Gaussian channel

Arikan 2009, Arikan and Teletar 2009polar codes not yet adapted to Gaussian channels

Our method is rather different and direct. Prior knowledgeof the above is not necessary to follow what we present.

Barron, Joseph Toward Fast Reliable Communication 4/29

Page 5: Barron Fast Reliable Communication Presentation

Sparse Superposition Code

Input bits: u = (u1 . . . . . . . . . . . . uK )

Coefficients: β = (00 ∗ 0000000000 ∗ 00 . . . 0 ∗ 000000)T

Sparsity: L entries non-zero out of NMatrix: X , n by N, all entries indep Normal(0, 1)

Codeword: Xβ, superposition of a subset of columnsReceive: y = Xβ + ε, a statistical linear modelDecode: β and u from X ,y

Barron, Joseph Toward Fast Reliable Communication 5/29

Page 6: Barron Fast Reliable Communication Presentation

Sparse Superposition Code

Input bits: u = (u1 . . . . . . . . . . . . uK )

Coefficients: β = (00 ∗ 0000000000 ∗ 00 . . . 0 ∗ 000000)T

Sparsity: L entries non-zero out of NMatrix: X , n by N, all entries indep Normal(0, 1)

Codeword: Xβ

Receive: y = Xβ + ε

Decode: β and u from X ,yRate: R = K

n from K = log(N

L

), near L log

(NL e

)

Barron, Joseph Toward Fast Reliable Communication 6/29

Page 7: Barron Fast Reliable Communication Presentation

Sparse Superposition Code

Input bits: u = (u1 . . . . . . . . . . . . uK )

Coefficients: β = (00 ∗ 0000000000 ∗ 00 . . . 0 ∗ 000000)T

Sparsity: L entries non-zero out of NMatrix: X , n by N, all entries indep Normal(0, 1)

Codeword: Xβ

Receive: y = Xβ + ε

Decode: β and u from X ,yRate: R = K

n from K = log(N

L

)Reliability: small Prob{Fraction β mistakes ≥ α}, small α

Barron, Joseph Toward Fast Reliable Communication 7/29

Page 8: Barron Fast Reliable Communication Presentation

Sparse Superposition Code

Input bits: u = (u1 . . . . . . . . . . . . uK )

Coefficients: β = (00 ∗ 0000000000 ∗ 00 . . . 0 ∗ 000000)T

Sparsity: L entries non-zero out of NMatrix: X , n by N, all entries indep Normal(0, 1)

Codeword: Xβ

Receive: y = Xβ + ε

Decode: β and u from X ,yRate: R = K

n from K = log(N

L

)Reliability: small Prob{Fraction β mistakes ≥ α}, small α

Outer RS code: rate 1−2α, corrects remaining mistakesOverall rate: Rtot = (1−2α)R

Barron, Joseph Toward Fast Reliable Communication 8/29

Page 9: Barron Fast Reliable Communication Presentation

Sparse Superposition Code

Input bits: u = (u1 . . . . . . . . . . . . uK )

Coefficients: β = (00 ∗ 0000000000 ∗ 00 . . . 0 ∗ 000000)T

Sparsity: L entries non-zero out of NMatrix: X , n by N, all entries indep Normal(0, 1)

Codeword: Xβ

Receive: y = Xβ + ε

Decode: β and u from X ,yRate: R = K

n from K = log(N

L

)Reliability: small Prob{Fraction β mistakes ≥ α}, small α

Outer RS code: rate 1−2α, corrects remaining mistakesOverall rate: Rtot = (1−2α)R.

Is it reliable with rate up to capacity?

Barron, Joseph Toward Fast Reliable Communication 9/29

Page 10: Barron Fast Reliable Communication Presentation

Partitioned Superposition CodeInput bits: u = (u1 . . . , . . . , . . . , . . . uK )

Coefficients: β=(00 ∗ 00000, 00000 ∗ 00, . . . , 0 ∗ 000000)

Sparsity: L sections, each of size B =N/L, a power of 2.1 non-zero entry in each section

Indices of nonzeros: (j1, j2, . . . , jL) directly specified by uMatrix: X , n by N, splits into L sectionsCodeword: Xβ

Receive: y = Xβ + ε

Decode: β and uRate: R = K

n from K = L log NL = L log B

may set B = n and L = nR/ log nReliability: small Prob{Fraction β mistakes ≥ α}Outer RS code: Corrects remaining mistakesOverall rate: up to capacity?

Barron, Joseph Toward Fast Reliable Communication 10/29

Page 11: Barron Fast Reliable Communication Presentation

Power Allocation

Coefficients: β=(00∗00000, 00000∗00, . . . , 0∗000000)

Indices of nonzeros: sent = (j1, j2, . . . , jL)

Coeff. values: βj` =√

P` for ` = 1, 2, . . . , L

Power control:∑L

`=1 P` = P

Codewords: Xβ, have average power P

Power Allocations

Constant power: P` = P/L

Variable power: P` proportional to u` = e−2C `/L

Variable with leveling: P` proportional to max{u`, cut}

Barron, Joseph Toward Fast Reliable Communication 11/29

Page 12: Barron Fast Reliable Communication Presentation

Power Allocation

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●

0 10 20 30 40 50

0.00

00.

002

0.00

40.

006

0.00

80.

010

section index

pow

er a

lloca

tion

Power = 7L = 50

Barron, Joseph Toward Fast Reliable Communication 12/29

Page 13: Barron Fast Reliable Communication Presentation

Contrast Two Decoders

Decoders using received y = Xβ + ε

Optimal: Least Squares Decoder

β = argmin‖Y − Xβ‖2

minimizes probability of error with uniform input distributionreliable for all R < C, with best form of error exponent

Practical: Adaptive Successive Decoder

fast decoderreliable using variable power allocation for all R < C

Barron, Joseph Toward Fast Reliable Communication 13/29

Page 14: Barron Fast Reliable Communication Presentation

Adaptive Successive Decoder

Decoding Steps

Start: [Step 1]Compute the inner product of Y with each column of XSee which are above a thresholdForm initial fit as weighted sum of columns above threshold

Iterate: [Step k ≥ 2]Compute the inner product of residuals Y − Fitk−1 witheach remaining column of XSee which are above thresholdAdd these columns to the fit

Stop:At Step k = log B, orif there are no inner products above threshold

Barron, Joseph Toward Fast Reliable Communication 14/29

Page 15: Barron Fast Reliable Communication Presentation

Quantities in the Iterative Decoder

Intialization: res1 = Y and J1 = {1, 2, . . . , N}, with N = LB

Loop:Residual: resk = Y − Fitk−1

Test Stat: Zk ,j = X Tj resk/‖resk‖

Threshold: τ =√

2 log B + a

Detections: 1Hk,j = 1{Zk,j≥τ}

Fit Update: Fitk = Fitk−1 +∑

j∈Jk

√Pj Xj1Hk,j

Remaining: Jk+1 = { j ∈ Jk : Zk ,j < τ}

Barron, Joseph Toward Fast Reliable Communication 15/29

Page 16: Barron Fast Reliable Communication Presentation

Tracking Progress

Messagesent = (j1, j2, . . . , jL)

False Alarms

Increment: fk =∑

j∈Jk∩(not sent) πj 1Hk,j

Total: f1,k = f1 + f2 + . . . + fk

Correct DetectionsIncrement: qk =

∑j∈Jk∩sent πj 1Hk,j

Total: q1,k = q1 + q2 + . . . + qk

Weightsπj = Pj/Pwhere Pj is the power allocated to the section containing j

Barron, Joseph Toward Fast Reliable Communication 16/29

Page 17: Barron Fast Reliable Communication Presentation

Decoding Progression

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

x

●● ●●

gL((x))x

B = 216, L == Bsnr == 15C = 2 bitsR = 1.04 bits (0.52C )No. of steps = 18

Figure: Plot of likely progression of weighted fraction of correctdetections q1,k , for snr = 15.

Barron, Joseph Toward Fast Reliable Communication 17/29

Page 18: Barron Fast Reliable Communication Presentation

Decoding Progression

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

x

● ●●

gL((x))x

B = 216, L == Bsnr == 1C = 0.5 bitsR = 0.31 bits (0.62C )No. of steps = 7

Figure: Plot of of likely progression of weighted fraction of correctdetections q1,k , for snr = 1.

Barron, Joseph Toward Fast Reliable Communication 18/29

Page 19: Barron Fast Reliable Communication Presentation

Rate and Reliability

Optimal: Least squares decoder of sparse superposition codeProb error exponentially small in n for small ∆=C−R >0

Prob{Error} ≤ e−n(C−R)2/2V

In agreement with the Shannon-Gallager optimal exponent,though with possibly suboptimal V depending on the snr

Practical: Adaptive Successive DecoderAchieves rates up to RB approaching capacity

RB =C

1 + c1/ log B

Probability exponentially small in L for R ≤ RB

Prob{

fraction mistakes ≥ const/ log B}≤ e−L(C−R)c2

It nearly matches the optimal error probability at R = RB

Barron, Joseph Toward Fast Reliable Communication 19/29

Page 20: Barron Fast Reliable Communication Presentation

Rate and Reliability, Clarification

Optimal: Least squares decoder of sparse superposition codeProb error exponentially small in n for gap ∆ = C − R > 0

Prob{Error} ≤ e−n ∆2/2V

In agreement with the optimal form of exponent

Practical: Adaptive Successive Decoder and outer RS codeAt R = RB = C/(1 + c1/ log B) with gap ∆ near c1C/ log B

Probability of error exponentially small in L = nR/ log B

Prob{Error} ≤ e−L ∆c2 ∼ e−c1c2C n/(log B)2= e−c2 n ∆2/(c1C)

It nearly matches the optimal error probability

Barron, Joseph Toward Fast Reliable Communication 20/29

Page 21: Barron Fast Reliable Communication Presentation

Rate and Reliability, Clarification

Optimal: Least squares decoder of sparse superposition codeProb error exponentially small in n for gap ∆ = C − R > 0

Prob{Error} ≤ e−n ∆2/2V

In agreement with the optimal form of exponent

Practical: Adaptive Successive Decoder and outer RS codeAt R = RB = C/(1 + c1/ log B) with gap ∆ near c1C/ log B

Probability of error exponentially small in L = nR/ log B

Prob{Error} ≤ e−L ∆c2 ∼ e−c1c2C n/(log B)2= e−c2 n ∆2/(c1C)

It nearly matches the optimal error probability, to within aloglog factor in the exponentOur c1 is near 1.5 log log B + 4C

Barron, Joseph Toward Fast Reliable Communication 21/29

Page 22: Barron Fast Reliable Communication Presentation

Upper Bounding False Alarms

False Alarms

Increment: fk =∑

j∈Jk∩(not sent) πj 1Hk,j

Upper bound:∑

j not sent πj 1Hk,j

Expectation: f ∗

Target level: f ∗ less than const/(log B)2

Total: f1,k = f1 + f2 + . . . + fk

UB expectation: kf ∗

Reliability: f1,k less than kf with high prob for f > f ∗

Total bound: const/ log B with #steps k of order log B

Barron, Joseph Toward Fast Reliable Communication 22/29

Page 23: Barron Fast Reliable Communication Presentation

Lower Bounding Correct Detections

Correct DetectionsIncrement: qk =

∑j∈Jk∩sent πj 1Hk,j

Total: q1,k = q1 + q2 + . . . + qk

Equivalent:∑

j∈sent πj 1H1,j∪H2,j∪...∪Hk,j

Lower Bound:∑

j∈sent πj 1Hk,j

LB Expectation: q∗1,k

Reliability: q1,k > q1,k with high prob for q1,k < q∗1,k

Barron, Joseph Toward Fast Reliable Communication 23/29

Page 24: Barron Fast Reliable Communication Presentation

Lower Bounding Correct Detections

Correct DetectionsIncrement: qk =

∑j∈Jk∩sent πj 1Hk,j

Total: q1,k = q1 + q2 + . . . + qk

Equivalent:∑

j∈sent πj 1H1,j∪H2,j∪...∪Hk,j

Lower Bound:∑

j∈sent πj 1Hk,j

LB Expectation: q∗1,k

Reliability: q1,k > q1,k with high prob for q1,k < q∗1,k

Recursive: q∗1,k = g(q1,k−1 − f1,k−1

)f1,k = kf bound on likely false alarms, from preceding slideg(x) shown to exceed x by at least const/ log B for R ≤ RBg(x) evaluated at xk−1 = q1,k−1 − f1,k−1 yields xklikely lower bound on correct detectionsreaches xk ≥ 1− const/ log B in order log B steps

Barron, Joseph Toward Fast Reliable Communication 24/29

Page 25: Barron Fast Reliable Communication Presentation

Decoding progression, example bounds

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

x

●● ●●

gL((x))x

B = 216, L == Bsnr == 15C = 2 bitsR = 1.04 bits (0.52C )No. of steps = 18

Figure: Plot of g(x) and the sequence xk for snr = 15, with variablepower allocation. The threshold uses a = 0.86. The final false alarmand failed detection rates are less than 0.026 and 0.013 respectively,with probability of at least that fraction of mistakes less than 0.002.

Barron, Joseph Toward Fast Reliable Communication 25/29

Page 26: Barron Fast Reliable Communication Presentation

Decoding Progression

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

x

● ●●

gL((x))x

B = 216, L == Bsnr == 1C = 0.5 bitsR = 0.31 bits (0.62C )No. of steps = 7

Figure: Plot of g(x) and the sequence xk for snr = 1, with constantpower allocation. The threshold uses a = 0.56. The final false alarmand failed detection rates are 0.026 and 0.053 respectively, withprobability bound 0.0007.

Barron, Joseph Toward Fast Reliable Communication 26/29

Page 27: Barron Fast Reliable Communication Presentation

Summary

Sparse superposition coding is fast and reliable at rates upto channel capacity

Formulation and analysis blends modern statisticalregression and information theory

Barron, Joseph Toward Fast Reliable Communication 27/29

Page 28: Barron Fast Reliable Communication Presentation

Rate versus Section Size

0.0

0.5

1.0

1.5

2.0

B

R (

bits

/ch.

use

)

● ●●

26 28 210 212 214 216

detailed envelopecurve when L == Bcurve using simulation runscapacitysnr == 15

Figure: Rate as a function of B for snr =15 and error probability 10−3.

Barron, Joseph Toward Fast Reliable Communication 28/29

Page 29: Barron Fast Reliable Communication Presentation

Rate versus Section Size

0.0

0.1

0.2

0.3

0.4

0.5

B

R (

bits

/ch.

use

)

26 28 210 212 214 216

detailed envelopecurve when L == Bcurve using simulation runscapacitysnr == 1

Figure: Rate as a function of B for snr = 1 and error probability 10−3.

Barron, Joseph Toward Fast Reliable Communication 29/29