nonlinear signal processing eleg 833

35
Nonlinear Signal Processing ELEG 833 Gonzalo R. Arce Department of Electrical and Computer Engineering University of Delaware [email protected] Fall 2008 Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 1 / 35

Upload: others

Post on 22-Nov-2021

14 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Nonlinear Signal Processing ELEG 833

Nonlinear Signal Processing

ELEG 833

Gonzalo R. Arce

Department of Electrical and Computer EngineeringUniversity of [email protected]

Fall 2008

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 1 / 35

Page 2: Nonlinear Signal Processing ELEG 833

Outline

1 IntroductionNon-Gaussian Random Processes

Generalized Gaussian Distributions and Weighted MediansStable Distributions and Weighted Myriads

Statistical FoundationsThe Filtering Problem

Moment Theory

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 2 / 35

Page 3: Nonlinear Signal Processing ELEG 833

Introduction

Signal processing embodies a large set of methods for:

Representation: Medical imaging

Analysis: Wavelet, spectral analysis

Transmission: Wireless, Ethernet, ADSL

Restoration of information-bearing signals: Digital imaging, digital video.

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 3 / 35

Page 4: Nonlinear Signal Processing ELEG 833

Introduction

Linear signal processing enjoys the rich theory of linear systems. Linear filters arealso simple to implement.

(b)

(a)

Figure: Frequency selective filtering: (a) chirp signal, (b) linear FIR filter output.

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 4 / 35

Page 5: Nonlinear Signal Processing ELEG 833

Introduction

Why Nonlinear Signal Processing

The underlying processes of interest may tend to produce outliers notpredicted by a Gaussian model.

The signal density functions may have tails that decay at rates lower thanthose of a Gaussian distribution.

Linear methods obeying the superposition principle suffer serious degradationupon the arrival of samples corrupted with high-amplitude noise.

Nonlinear methods exploit the statistical characteristics of the noise toovercome the limitations of traditional signal processing.

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 5 / 35

Page 6: Nonlinear Signal Processing ELEG 833

Introduction

Consider again the bandpass filtering example using a chirp signal degraded bynon-Gaussian noise. The linear FIR filter output is severely degraded.

(a)

(b)(b)

Figure: Frequency selective filtering in non-Gaussian noise: (a) linear FIR filter output,(b) nonlinear filter

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 6 / 35

Page 7: Nonlinear Signal Processing ELEG 833

Introduction

800 850 900 950 10000.2

0.3

0.4

0.5

0.6

(a)800 850 900 950 1000

0.23

0.24

0.25

0.26

0.27

0.28

0.29

0.3

(b)

300 350 400 450 5000.2

0.3

0.4

0.5

0.6

(c)500 550 600 650 700

0.1

0.11

0.12

0.13

0.14

0.15

0.16

(d)

Figure: RTT time series measured in seconds between a host at the University of Delawareand hosts in (a) Australia (12:18 AM - 3:53 AM); (b) Sydney, Australia (12:30 AM - 4:03 AM);(c) Japan (2:52 PM - 6:33 PM); (d) London, UK (10:00 AM - 1:35 PM). All plots shown in 1minute interval samples.

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 7 / 35

Page 8: Nonlinear Signal Processing ELEG 833

Introduction

0 2000 4000 6000 8000 10000 12000 140000

5000

10000

Time (s)

KB

10 s Data Counts

0

2000

4000

KB

1 s Data Counts

0

500

1000

1500K

B

100ms Data Counts

0

100

200

KB

10 ms Data Counts

Figure: Byte counts measured over 14000 seconds in a web server of the ECEDepartment at the University of Delaware viewed through different aggregation intervals:from top to bottom, 10ms, 100ms 1s, 10s.

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 8 / 35

Page 9: Nonlinear Signal Processing ELEG 833

Introduction

Figure: (a-c) Different noise and interference characteristics in ADSL lines. A linearand a nonlinear filter (recursive median filter) are used to overcome the channellimitations, both with the same window size.

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 9 / 35

Page 10: Nonlinear Signal Processing ELEG 833

Introduction

Figure: JPEG compressed image

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 10 / 35

Page 11: Nonlinear Signal Processing ELEG 833

Introduction

Figure: Output of unsharp masking using FIR filters

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 11 / 35

Page 12: Nonlinear Signal Processing ELEG 833

Introduction

Figure: Output of median sharpeners.

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 12 / 35

Page 13: Nonlinear Signal Processing ELEG 833

Introduction Non-Gaussian Random Processes

Outline

1 IntroductionNon-Gaussian Random Processes

Generalized Gaussian Distributions and Weighted MediansStable Distributions and Weighted Myriads

Statistical FoundationsThe Filtering Problem

Moment Theory

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 13 / 35

Page 14: Nonlinear Signal Processing ELEG 833

Introduction Non-Gaussian Random Processes

Non-Gaussian Random Processes

The Gaussian model appears naturally in many applications as a result of theCentral Limit Theorem.

Theorem (Central Limit Theorem)

Let X1,X2, · · · , be a sequence of i.i.d. random variables with zero mean and

variance σ2. Then as N → ∞, the normalized sum

SN =1√N

N∑

i=1

Xi (1)

converges almost surely to a zero-mean Gaussian variable with the same variance

as Xi .

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 14 / 35

Page 15: Nonlinear Signal Processing ELEG 833

Introduction Non-Gaussian Random Processes

In a wide range of applications, the Gaussian model does not produce a goodfit which, at first, may seem to contradict the principles behind the centrallimit theorem.

In order for the CLT to be valid, the variance of the superimposed randomvariables must be finite.

If the random variables possess infinite variance, it can be shown that theseries in the central limit theorem converges to a non-Gaussian impulsivevariable.

The generalized central limit theorem explains the presence ofnon-Gaussian, infinite variance processes, in practical problems.

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 15 / 35

Page 16: Nonlinear Signal Processing ELEG 833

Introduction Non-Gaussian Random Processes

In this course we will considers two model families that encompass a large class ofrandom processes with different tail characteristics:

generalized Gaussian distribution

stable distributions

The tail of a distribution can be measured by the mass of the tail (order), definedas Pr(X > x) as x → ∞.

−5 −4 −3 −2 −1 0 1 2 3 4 5−0.05

0

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4

0.45

Pr(X>x)

x

Figure: Mass of the tail of a Gaussian distribution

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 16 / 35

Page 17: Nonlinear Signal Processing ELEG 833

Introduction Non-Gaussian Random Processes

Generalized Gaussian Distributions and Weighted

Medians

A first approach starts with a Gaussian distribution and allow the exponential rateof tail decay to be a free parameter. The result is the generalized Gaussian densityfunction.

3 3.5 4 4.5 5 5.5 6−5

0

5

10

15

20

x 10−3

x

fk(x

)

Tails of the generalized Gaussian density functions

k = 2.0

k = 1.5

k = 1.0

k = 0.5

Figure: Tails of the Generalized Gaussian density functions for different k (Rate of taildecay).

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 17 / 35

Page 18: Nonlinear Signal Processing ELEG 833

Introduction Non-Gaussian Random Processes

Of special interest is the Laplacian distribution.

Weighted median filters are optimal for samples obeying Laplacian statistics.

In general, weighted median filters are more efficient than linear filters inimpulsive environments.

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 18 / 35

Page 19: Nonlinear Signal Processing ELEG 833

Introduction Non-Gaussian Random Processes

Stable Distributions and Weighted Myriads

A wide variety of processes exhibit statistics more impulsive than exponential thatare characterized with algebraic tailed distributions:

Radar clutter is the sum of signal reflections from irregular surfaces.

Transmitters in a multiuser communication system generate smallindependent signals, the sum of which reaches a user’s receiver.

Rotating electric machinery generates impulses caused by contact betweendistinct parts of the machine.

Standard atmospheric noise is the superposition of electrical dischargescaused by lightning activity around the Earth.

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 19 / 35

Page 20: Nonlinear Signal Processing ELEG 833

Introduction Non-Gaussian Random Processes

The justification for using stable distribution models lies in the generalized centrallimit theorem which includes the well known “traditional” CLT as a special case.

A random variable X is stable if it can be the limit of a normalized sumof i.i.d. random variables.

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 20 / 35

Page 21: Nonlinear Signal Processing ELEG 833

Introduction Non-Gaussian Random Processes

UNIFORM CAUCHY

20 40 60 80 100 120 140 160 180 200

−0.50

0.5

20 40 60 80 100 120 140 160 180 200

−202

20 40 60 80 100 120 140 160 180 200−4−2

024

20 40 60 80 100 120 140 160 180 200−4−2

024

20 40 60 80 100 120 140 160 180 200

−20246

20 40 60 80 100 120 140 160 180 200−50

0

50

20 40 60 80 100 120 140 160 180 200

0

100

200

20 40 60 80 100 120 140 160 180 200

0

100

200

20 40 60 80 100 120 140 160 180 200−300−200−100

0100

20 40 60 80 100 120 140 160 180 2000

100020003000

Figure: Traditional Vs. generalized CLT. The plots show the normalized sum of 1, 2, 3,10 and 30 uniform(-1,1) or Cauchy(0,1) random variables

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 21 / 35

Page 22: Nonlinear Signal Processing ELEG 833

Introduction Statistical Foundations

Outline

1 IntroductionNon-Gaussian Random Processes

Generalized Gaussian Distributions and Weighted MediansStable Distributions and Weighted Myriads

Statistical FoundationsThe Filtering Problem

Moment Theory

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 22 / 35

Page 23: Nonlinear Signal Processing ELEG 833

Introduction Statistical Foundations

Statistical Foundations

Estimation theory is a branch of statistics.Goal: Derive information of a random processes from a set of observed samples.

Given an observation {X (n)}, the goal is to extract the informationembedded within the observed signal.

In general some parameter β of the signal represents the information ofinterest. This parameter may be the local mean, variance, the local range,etc.

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 23 / 35

Page 24: Nonlinear Signal Processing ELEG 833

Introduction Statistical Foundations

Location Estimation

Observed signals are random variables described by a probability density function(pdf), f (x1, x2, · · · , xN).

The pdf is usually parameterized by an unknown parameter β.

β defines a class of pdfs where each member is defined by a particular value.

0 50 100 150 200 250 300−1

0

1

2

3

4

5

6

7

8

Figure: A Gaussian noise sample.

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 24 / 35

Page 25: Nonlinear Signal Processing ELEG 833

Introduction Statistical Foundations

Example

If our signal consists of a single point (N = 1) and β is the mean, the pdf of thedata under the Gaussian model is

f (x1;β) =1√

2πσ2exp

[

− 1

2σ2(x1 − β)2

]

(2)

Figure: Estimation of parameter β based on the observation X1.

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 25 / 35

Page 26: Nonlinear Signal Processing ELEG 833

Introduction Statistical Foundations

The value of β affects the probability of X1. We should be able to infer thevalue of β from the observed value of X1

Notice that β determines the location of the pdf. As such, β is referred to asthe location parameter.

Rules that infer the value of β from sample realizations of the data areknown as location estimators.

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 26 / 35

Page 27: Nonlinear Signal Processing ELEG 833

Introduction Statistical Foundations

Running Smoothers

The running mean is the simplest form of filtering. Given the sequence{· · · ,X (n − 1),X (n),X (n + 1), · · · }, it is defined as

Y (n) = MEAN(X (n − N),X (n − N + 1), · · · ,X (n + N)). (3)

The output is the average of the samples within a window centered at n.

At each point the running mean computes a location estimator: the samplemean

If the underlying signals are not Gaussian, the mean should be replaced by amore appropriate location estimator such as the running median.

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 27 / 35

Page 28: Nonlinear Signal Processing ELEG 833

Introduction The Filtering Problem

Outline

1 IntroductionNon-Gaussian Random Processes

Generalized Gaussian Distributions and Weighted MediansStable Distributions and Weighted Myriads

Statistical FoundationsThe Filtering Problem

Moment Theory

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 28 / 35

Page 29: Nonlinear Signal Processing ELEG 833

Introduction The Filtering Problem

The Filtering Problem

A filter can have arbitrary input and output signals and consequently it is found ina wide range of disciplines including economics, engineering, and biology.Denote a random sequence as {X} and let X(n) ⊆ RN be a N-long elementobservation vector

X(n) = [X (n), X (n − 1), · · · ,X (n − N + 1)]T

= [X1(n), X2(n), · · · , XN(n)]T (4)

where Xi (n) = X (n − i + 1). If the vector X(n) is related to a desired signal D(n),the output of the filter is D̂(n).

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 29 / 35

Page 30: Nonlinear Signal Processing ELEG 833

Introduction The Filtering Problem

The optimal filtering problem thus reduces to minimizing the cost functionassociated with the error e(n) under an error criterion.

+

(n) D(n)

e(n)

D(n)

Σ

^

FilterX

Figure: Filtering as a joint process estimation

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 30 / 35

Page 31: Nonlinear Signal Processing ELEG 833

Introduction The Filtering Problem

Under Gaussian statistics, the estimation becomes linear and the filters reduce toFIR linear filters. The linear filter output is defined as

Y (n) = MEAN(W1 · X1(n),W2 · X2(n), · · · ,WN · XN(n)), (5)

where the W ′

i s are real-valued weights assigned to each input sample.

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 31 / 35

Page 32: Nonlinear Signal Processing ELEG 833

Introduction The Filtering Problem

Under the Laplacian model, the median becomes the estimate of choice andweighted medians become the filtering structure. The output of a weightedmedian is defined as

Y (n) = MEDIAN(W1 ⋄ X1(n),W2 ⋄ X2(n), · · · ,WN ⋄ XN(n)), (6)

where the operation Wi ⋄Xi (n) replicates the sample Xi (n), Wi times, for example:

MEDIAN(2 ⋄ 5, 1 ⋄ 0, 2 ⋄ −1) =

MEDIAN(5, 5, 0,−1,−1) = 0

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 32 / 35

Page 33: Nonlinear Signal Processing ELEG 833

Introduction The Filtering Problem

For stable processes, the weighted myriad filter emerges as the ideal structure. Inthis case the filter output is defined as

Y (n) = MYRIAD (k; W1 ◦ X1,W2 ◦ X2, · · · ,WN ◦ XN) , (7)

where Wi ◦ Xi (n) represents a nonlinear weighting operation to be described later,and k in (7) is a free tunable parameter that will play an important role inweighted myriad filtering.

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 33 / 35

Page 34: Nonlinear Signal Processing ELEG 833

Introduction The Filtering Problem

Moment Theory

Historically, signal processing has relied on second order moments. The first-ordermoment

µX = E{X (n)} (8)

and the second-order moment characterization provided by the autocorrelation ofstationary processes

RX (k) = E {X (n)X (n + k)} , (9)

are deeply etched into traditional signal processing practice, for example, the PSDof a Gaussian random process is found as:

S(f ) = F {RX} ,

where F represents the Fourier transform.

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 34 / 35

Page 35: Nonlinear Signal Processing ELEG 833

Introduction The Filtering Problem

Second-order descriptions do not provide adequate information to processnon-Gaussian signals. The alternatives are:

Higher-order statistics: Moments of order greater than two.

E{|X (n)|p} for p > 2 (10)

They provide information that is unaccessible to second-order moments, butbecome less reliable in impulsive environments.

fractional lower-order statistics (FLOS) consisting of moments of orders lessthan two.

E{|X (n)|p} for p < 2 (11)

Gonzalo R. Arce (Department of Electrical and Computer Engineering University of Delaware [email protected]) Fall 2008 35 / 35