Digital Communications Fredrik Rusek · Chapter 12 – Linear Bayesian Estimation Example 12.1...

Estimation Theory Fredrik Rusek

Chapter 12

Chapter 12 – Linear Bayesian Estimation

Summary of chapters 10 and 11

• Bayesian estimators are injecting prior information into the estimation

• Concepts from classical estimation breaks down – MVU

– Efficient estimator

– unbiasedness

• Performance measure change: variance -> Bayesian MSE

• Optimal estimator for Bmse: E(θ|x). This is the MMSE estimator

• MMSE is difficult since – Posterior is hard to find p(θ|x)

– If we can find p(θ|x), then E(θ|x) is still difficult due to integral

• Conjugate priors simplify finding p(θ|x). Posterior has same distribution as prior (with other parameters). Useful when the posterior acts as prior in a sequential estimation process.

• Other risk functions than the Bmse exists. – MAP estimation is solution to hit-and-miss risk

– Conditional Median is solution to a linear risk function

• Invariance does not hold for MAP

• Bayesian estimators can be used for deterministic parameters, but work well only for parameter values that are close to the prior mean

When an optimal Bayesian estimator is hard to find, we can resort to a linear estimator

This is the same method as the BLUE, but we now make the following changes:

Remove Unbiasedness constraint

Change cost function from variance to Bmse

An optimal estimator within this class is termed the

linear minimum mean square error (LMMSE) estimator

Finding the LMMSE estimator

Cost function

Take differentials with respect to aN

Cost function

Plug in aN into the cost function

Cost function

Plug in aN into the cost function

Cost function

Assembly into vector notation

Cost function

Generates 4 terms

Cost function

Observe: a is not random, can be moved outside from expectation operator

Cost function

Collect the results using vector notation

Computing the Bmse cost

Schur complement

Connections

We have seen the expression for the BMSE before

Connections

X and θ jointly Gaussian X and θ not jointly Gaussian

Connections

LMMSE estimator

Connections

LMMSE estimator

Connections

LMMSE estimator

Bmse MMSE estimator

LMMSE = MMSE

Connections

LMMSE estimator

Bmse MMSE estimator Bmse

LMMSE = MMSE

Better than

Connections

LMMSE estimator

Bmse MMSE estimator Bmse

LMMSE = MMSE

Better than

The LMMSE yields the same Bmse as if the variables are jointly Gaussian

Example 12.1

Bayesian options: 1. MMSE 2. MAP 3. LMMSE

Example 12.1

Bayesian options: 1. MMSE. Not possible in closed form 2. MAP. Possible: truncated sample mean 3. LMMSE

Example 12.1

All means are zero

Example 12.1

Bayesian options: 1. MMSE. Not possible in closed form 2. MAP. Possible: truncated sample mean 3. LMMSE. Doable

=……=

Example 12.1

Observations 1. With no prior, the sample mean is MVU 2. The LMMSE is a tradeoff between the MVU and the sample mean 3. We did not use the fact that A is uniform, only its mean and variance comes in 4. We do not need A and w to be independent, only uncorrelated 5. No integration is needed

Extension to vector parameter

We can work with each parameter individually

We can work with each parameter individually From before, we have that

We can work with each parameter individually From before, we have that collect in vector notation

Three properties 1. Invariance holds for affine transformations

2. LMMSE of a sum is a sum of the LMMSE

3. Number of observations can be less than parameters to estimate a significant difference from linear classical estimation

With we have

Wiener filtering Data: x[0], x[1],x[2],… WSS with zero mean Covariance matrix = ??

Wiener filtering Data: x[0], x[1],x[2],… WSS with zero mean Covariance matrix

Autocorrelation matrix

(Toeplitz structure)

Wiener filtering Data: x[0], x[1],x[2],… WSS with zero mean Covariance matrix

Parameter to be estimated: s[n], zero mean x[m] = s[m] +w[n]

Wiener filtering Four cases Filtering Find s[n] given x[0],…,x[n]

Wiener filtering Four cases Smoothing Find s[n] given x[0],…,x[N]

Wiener filtering Four cases Prediction Find s[n-1+L] given x[0],…,x[n-1]

Wiener filtering Four cases Interpolation Find x[n] given x[0],…,x[n-1],x[n+1],…

Wiener filtering Filtering problem Signal and noise are uncorrelated Since the means are zero, we know that

Wiener filtering Filtering problem Signal and noise are uncorrelated Since the means are zero, we know that We have

So, With We get

Wiener filtering Why is it a filtering?

x[n-1]

x[n-2]

x[n-3]

x[n-4]

Find s[n]

x[n-1]

x[n-2]

x[n-3]

x[n-4]

Find s[n] We do this as a weighted sum s[n] = a0x[n]+a1x[n-1]+ a2x[n-2]

x[n-1]

x[n-2]

x[n-3]

x[n-4]

Find s[n] We do this as a weighted sum s[n] = a0x[n]+a1x[n-1]+ a2x[n-2] So, the weights {ak} can be seen as a FIR filter

x[n-1]

x[n-2]

x[n-3]

x[n-4]

Find s[n] We do this as a weighted sum s[n] = a0x[n]+a1x[n-1]+ a2x[n-2] So, the weights {ak} can be seen as a FIR filter However, at the next time, the weights {ak} are not the same (edge effect)

x[n+1]

x[n-1]

x[n-2]

x[n-3]

x[n-4]

x[n+1]

To estimate s[n], we filter the recent observations with a filter that is dependent on n The filter is time-variant

a is computed for a given n (not shown explicitly)

x[n-1]

x[n-2]

x[n-3]

x[n-4]

x[n+1]

We get,

x[n-1]

x[n-2]

x[n-3]

x[n-4]

x[n+1]

We get,

x[n-1]

x[n-2]

x[n-3]

x[n-4]

x[n+1]

We get,

Wiener filtering

We get,

Observe

is a but flipped upside-down

Wiener filtering

Recall We get,

Observe

Wiener filtering

Recall We get,

Observe

Wiener filtering

Recall We get,

Observe

Wiener filtering

Wiener-Hopf equations

Wiener filtering

These equations can be solved recursively by the Levinson algorithm Observe: The matrix is Toeplitz, but cannot be approximated as circulant as n grows. Therefore, Szegö theory does not apply.

Wiener filtering

As n grows, the filter converges to a stationary solution

Wiener filtering

To find h[n], we can apply spectral factorization (= same method as is used to find a minimum phase version of a filter)

Wiener smoothing

Now consider asymptotic Wiener smoothing

x[n-1]

x[n-2]

x[n-3]

x[n-4]

x[n+1]

x[n+2]

We can still express the estimation of s[n] as a filtering of {x[k]}

Wiener smoothing

Now consider asymptotic Wiener smoothing

x[n-1]

x[n-2]

x[n-3]

x[n-4]

x[n+1]

x[n+2]

We can still express the estimation of s[n] as a filtering of {x[k]} The filter is not causal

Wiener smoothing

Filtering Smoothing

???????

Wiener smoothing

Filtering Smoothing

(12.61): Typo in book l

Wiener smoothing

No edge effect in the smoothing setup! Can be solved by approximating Rxx as a circulant matrix (Szegö theory)

Wiener smoothing

No edge effect in the smoothing setup! Can be solved by approximating Rxx as a circulant matrix (Szegö theory)

Wiener prediction

Filtering equations

Wiener prediction

Filtering equations….Filtering ”predicts” s[n] given x[n]

Wiener prediction

Filtering equations….Filtering ”predicts” s[n] given x[n] In prediction x[n] is not available, therefore there is no h[0] coefficient.

Wiener prediction

Filtering equations….Filtering ”predicts” s[n] given x[n] In prediction x[n] is not available, therefore there is no h[0] coefficient.

Wiener prediction

Filtering equations….Filtering ”predicts” s[n] given x[n] In prediction x[n] is not available, therefore there is no h[0] coefficient. We are also predicting l steps into the future

Wiener prediction

Filtering equations….Filtering ”predicts” s[n] given x[n] In prediction x[n] is not available, therefore there is no h[0] coefficient. We are also predicting l steps into the future Wiener-Hopf prediction equations. For l=1 we obtain the Yule-Walker equations Solved by Levinson recursion or spectral factorization (not Szegö Theory)

Digital Communications Fredrik Rusek · Chapter 12 – Linear Bayesian Estimation Example 12.1...

Documents

Transcript of Digital Communications Fredrik Rusek · Chapter 12 – Linear Bayesian Estimation Example 12.1...

Cram´er-Rao Bound (CRB) and Minimum Variance …namrata/EE527_Spring08/l2.pdfCram´er-Rao Bound (CRB) and Minimum Variance Unbiased (MVU) Estimation Reading • Kay-I, Ch. 3. How

Rigoroushigh-precision computation of theHurwitzzeta ...fredrikj.net/math/hurwitz_zeta.pdf2 Fredrik Johansson valid for any complex s and a and for an arbitrary number of derivatives

Heterogeneous material integration for MEMSkth.diva-portal.org/smash/get/diva2:650511/FULLTEXT01.pdf · Heterogeneous material integration for MEMS FREDRIK FORSBERG Doctoral Thesis

Cram´er-Rao Bound (CRB) and Minimum Variance Unbiased (MVU ...namrata/EE527_Spring08/l2.pdf · Cram´er-Rao Bound (CRB) and Minimum Variance Unbiased (MVU) Estimation Reading •

TEORIA DA ESTIMAÇÃO E ESTIMADORES DE MÁXIMA … files/ml.pdf · terá, por definição, de ser semi-definida positiva a matriz (B-A) e, sendo q (2) MVU, terá, também, de ser

Overuse of short-acting β2-agonists in asthma is ... · Bright I. Nwaru, Magnus Ekström, Pål Hasvold, Fredrik Wiklund, Gunilla Telg, Christer Janson . Please cite this article

Revisiting the Exploration-Exploitation Tradeoff in Bandit Modelschercheurs.lille.inria.fr/ekaufman/Simons210916.pdf · 2016-12-02 · Revisiting the Exploration-Exploitation Tradeo

Complexity of Maximum Solution - thi.uni-hannover.de fileComplexity of Maximum Solution: Max-Ones(Γ) Generalised to Larger Domains Peter Jonsson, Fredrik Kuivinen, Gustav Nordh Linköping

Multiphase Shift Keying (MPSK) Lecture 8 · Lecture 8 Goals Be able to analyze MPSK modualtion fBe able to analyze QAM modualtion φBe able to quantify the tradeoff between data rate

Fundamental algorithms in Arb - Fredrik Jfredrikj.net/math/arb2017kaiserslautern.pdf · I acb t - complex numbers [a r] + [b s]i I arb poly t, acb poly t - real and complex polynomials

Fundamental algorithms in Arb - Fredrik JFJ (2012): algorithm for p(n) with softly optimal complexity – requires tight control of the internal precision Digits Mathematica MPFR Arb

Gaussians Linear Regression Bias-Variance TradeoffLinear Regression Bias-Variance Tradeoff Machine Learning – 10701/15781 Carlos Guestrin Carnegie Mellon University January 22nd,

lattice Closest Vector Problem ShortestVectorProblemTHE NEAREST-COLATTICE ALGORITHM: TIME-APPROXIMATION TRADEOFF FOR APPROX-CVP 5 It means that for any vector of the ambient space

PowerPoint-presentation€¦ · PowerPoint-presentation Author: Fredrik Franzon Created Date: 10/12/2016 9:00:00 PM ...