**§ ****5 ****Filtering**** **** **

Extracting the required information from the data ( or signal ) containing errors ( meaning interference, noise ) , this data processing method is called filtering. Only a few types of least squares ( or least variance ) filtering are described below.

1. Least Squares Filtering

[ Least Squares Filtering for Growing Memory ]

_{} Least Squares Criterion for Filtering

Given a discrete observation system

_{}

in the formula

_{}

_{}

_{}

_{}

where represents the transpose, which is the observation value at the moment , and is the *n* -dimensional column vector formed by , called the observation vector; yes , the matrix, where *m* is a constant; is the *n* -dimensional column vector, and its component is the error when; is the state vector ( *m* dimension ) . Suppose the state satisfies the equation_{}_{}_{}_{}_{}_{}_{}_{}_{}_{}_{}

_{}

Among them is the *m* -order square matrix, its elements are all constants, and it is set to be reversible, remember ._{}_{}_{}

If then_{}

_{}

is called the optimal estimate in the least squares sense._{}_{}

Assuming that it is reversible, remember , then it can be deduced_{}_{}

_{}

_{} The recursive formula for filtering

The above-mentioned formula for the optimal estimate is not practical on an electronic computer, and a recursive formula is usually used on the computer. can be launched_{}

_{} _{}

in

_{}

_{}is an *m* -dimensional row vector

_{}

satisfied ._{}

From the above formula, all historical observations are actually used for the purpose , and as *n* increases, the number used increases, so this filtering recursion formula is called growth memory._{}_{}

[ Weighted Least Squares Filtering ] When *n is* sufficiently large, the earlier historical data is often used in an unfavorable role in estimation. Through weighting, the effect of "too old" data can gradually disappear. The following is an example of an "exponential decline" right.

Select , introduce a diagonal matrix sequence_{}

_{}

As before, the observation system and the state equation are set as

_{}

_{}

if time_{}

_{}

It is called the optimal valuation in the sense of least squares with the weight of "exponential decline"._{}_{}

can prove

_{}

in

_{}

And there is a recursive formula for calculation_{}

_{}

in

_{}

_{}is an *m* -dimensional row vector with the same definition as before.

2. Wiener filter

Wiener filtering and Kalman filtering are both least square deviation filtering, but they are based on known conditions, calculation methods and scope of application are different.

[ Wiener Filtering Criterion ] Suppose you want to get the sequence *z* ( *t* ) ( *t* only takes some integer values ) , and actually get the sequence *x* ( *t* ) . So design a linear constant system with impulse response *h* ( *t* ) , so that its input quantity is *x* ( *t* ) , and the output quantity is

_{}

The deviation from *z* ( *t* ) is as small as possible under some criterion.

The so-called Wiener filtering problem is to take *h* ( *t* ) appropriately so that

_{}

When *x* ( *t* ) is a deterministic sequence,

_{}

When *x* ( *t* ) is a random sequence,

_{}

[ Single Least Squares Filtering ] Suppose that the filter factor *h* ( *t* ) is a sequence consisting of *s* + 1 equally spaced coefficients, such as

_{} _{}

*s* + 1 is called the length of the filter factor, then the least square filter factor *h* ( *t* ) satisfies

_{} _{}

or expressed by convolution as

_{} _{}

in

_{}

is called the autocorrelation function of the input *x* ( *t* ) ( Chapter 16 § 3 ) ,

_{}

is called the cross-correlation function of *z* ( *t* ) and *x* ( *t* ) .

How well the filtered output matches the desired output can be measured by the normalized mean squared error :

_{}

to measure, where

_{}

Obviously . At that time , the filtering effect was the best; at that time , the filtering effect was the worst._{}_{}_{}

Similar results can be obtained when the length of the filter factor is infinite. Use , , to represent the frequency spectrum ^{* of }*h* ( *t* ) , respectively , then according to the convolution of the two functions is equivalent to the product of their spectra, the frequency characteristics can be obtained_{}_{}_{}_{}^{}

_{}

It shows that the frequency spectrum of the filter factor can be determined by the frequency spectrum of the cross-correlation function and the auto-correlation function.

[ Multi-channel Least Square Filtering ] Multi-channel least square filtering method utilizes the repetition of multi-channel signals to provide more beneficial information.

Set as n *-* way input, *m* -way desired output, written as a matrix, respectively_{}_{}

_{}

_{}

_{}is the multi-channel filter factor, written as a matrix as

_{}

-------------------------------

_{} The frequency spectrum of a discrete time series is defined as_{}

_{}

in
_{}

When the discrete time series has only finite items, it can be filled with many 0s to become an infinite series and the above definition can be used._{}

_{}is the *m* -channel filtered output, written as a matrix as

_{}

If the length of each input is *k* +1 , and the length of all filter factors is *s* +1 , then the length of the filtered output of each channel is *k* + *s* +1 ._{}_{}_{}

The so-called multi-channel least square filtering is to appropriately select the multi-channel filter factor ( matrix ) *H* ( *t* ) to make the total mean square error

_{}

When it is a definite sequence,_{}

_{} _{}

When it is a random sequence,_{}

_{} _{}

The multi-channel least squares filter factor satisfies the following equation

_{}

_{} ( 1 )

in

_{} _{}

is the multi-channel least squares filter factor, is the matrix,_{}

_{} _{}

is the transpose matrix of the input autocorrelation function square matrix, called the multi-way autocorrelation matrix ,_{}_{}_{}

_{} _{}

A matrix of cross-correlation functions for the desired output and input ._{}_{}_{}_{}

Equation ( 1 ) is *mn* ( *s* + 1) linear simultaneous equations with the filter factor as the unknown . Its solution is the required multi-channel least squares filter factor._{}

The degree of agreement between the filtered output and the desired output, using the normalized mean square error

_{}

to measure, obviously, . At that time , the multi-channel least square filtering output matched the desired output best; at that time , the matching degree was the worst._{}_{}_{}

When the length of the filter factor is infinite, the least squares filter factor satisfies the following equation_{}

_{} _{}

Expressed in frequency spectrum, we have

_{}_{}

=_{}

The solution of the above equation is the desired multi-path least squares filter factor.

3. Kalman filter

[ Kalman Filtering for Linear Discrete Systems ]

_{} The dynamic model assumes that an *n* -dimensional linear dynamic system and a *p* -dimensional linear observation system are respectively described by the following difference equations :

_{}

_{} _{}

Or introduce the corresponding symbol simply written as

_{} ( 1 )

_{} _{} ( 2 )

where ( *k* is an integer ) satisfies_{}

_{}

** x** (

(i ) ( _{} for all *t* , where *I* is the identity matrix )

( ii ) ( _{} for any )_{}

( iii ) _{} _{}

_{} Linear minimum variance estimation If the estimation of the state of the system at time is determined from the dynamic model , the following conditions are satisfied : _{}_{}

( i ) the
estimate is a linear function of the observed value ;_{}_{}

( ii ) = _{}minimum value, where is the estimation error ; then this estimation is called the linear minimum variance estimation._{}

Suppose that through the *p* -dimensional linear observation system ( 2 ) , from the 1st time to the *kth* time, the state of the *n* -dimensional linear dynamic system ( 1 ) has been observed *k* times , and according to the *k* observation data, the state of the *jth* time An estimate is made , and the estimated error is , and the mean squared error of the estimate is denoted as . At the time , it was called forecasting or extrapolation, at the time , it was called interpolation. Especially when *j* = *k* is called filtering, and is abbreviated ._{}_{}_{} _{}_{}_{}_{}_{}_{}_{}_{}

_{} The Kalman filter formula is set in the above dynamic model, and the dynamic noise and observation noise are uncorrelated zero-mean white noise sequences; that is, for all *k, j* _{}_{}

mean , mean square error_{}
_{}

_{} ,
_{}

_{}

Let the statistical characteristics of the initial state be_{}

_{}

and are not related to_{}_{}

_{}

Then the optimal linear filter can be calculated recursively by the following formula_{}_{}

_{}

its initial value ;_{}

_{}

_{} _{}

_{}

_{}Called the weighting matrix or gain matrix, it is the covariance matrix of the optimal estimation error , the *I* in parentheses represents the identity matrix, and the last equation is called the covariance update equation._{}_{}

At this time, the optimal linear prediction ( extrapolation ) is estimated as

_{} ( *j* > *k* )

[ Kalman Filtering for Continuous Time Systems ]

_{} The dynamic model assumes that the state equation is

_{} ( 1 )

The observation equation is

_{}

where is a random process of *n* -dimensional vector type, and is a random process of *p* -dimensional vector type. , respectively, are *m* -dimensional ( ) and *p* -dimensional vector-type, uncorrelated white noise processes with zero mean, namely_{}_{}_{}_{}_{}_{}

_{}

_{}

_{}

_{}

In the formula, *Q* ( *t* ) and *R* ( *t* ) are both continuously differentiable, symmetric and non-negative definite matrices for time *t* ; they are Dirac functions. And *F* ( *t* ) , *G* ( *t* ) and *H* ( *t* ) are respectively matrices whose elements are non-random functions or constants of *t .*_{}_{}

_{} The linear minimum variance estimation assumes the known ( observed ) value ( ) , and is calculated by the formula _{}_{}

_{}

represents a linear estimate such that_{}_{}

_{}= minimum value

Such an estimate is called a linear minimum variance estimate, where the filter factor matrix, each element of which is continuously differentiable for both independent variables._{}

_{} The Kalman filter equation assumes that the above dynamic model satisfies the following conditions :

( i ) the matrix *R* ( *t* ) is positive definite for all *t* ;

( ii ) Under the action of ** u** (

_{}

Deterministic random function;

( iii ) At a certain time , the variance of the measured and the time is known;_{}_{}_{}_{}

Then the optimal filtering equation of the dynamic model is

_{}

in the formula

_{} ( weighted matrix equation )

_{} ( 2 )

( Ricati equation ( see Chapter XIII § 1 ))

The initial conditions are

_{}

_{}

In the above formula, it is called the weighting matrix, which is the covariance matrix of the optimal estimation error ._{}_{}_{}

In particular, when it is a vector-type stationary random process, it can be solved in ( 2 ) ._{}_{}_{}

Contribute a better translation