Каково наилучшее приближение первого порядка IIR (AR-фильтр) к фильтру скользящего среднего (FIR-фильтр)?

24

Предположим, что первый БИХ фильтр первого порядка:

y [n] = α x [n] + (1 - α) y [n - 1]

$y[n] = \alpha x[n] + (1 - \alpha) y[n - 1]$

Как я могу выбрать параметр IIR как можно лучше приближает FIR, который является средним арифметическим последних выборок: $\alpha$ $k$

z [n] = \frac{1}{k} x [n] + \frac{1}{k} x [n - 1] + \dots + \frac{1}{k} x [n - k + 1]

$z[n] = \frac{1}{k}x[n] + \frac{1}{k}x[n-1] + \ldots + \frac{1}{k}x[n-k+1]$

Где , что означает, что вход для IIR может быть длиннее, чем и все же я хотел бы получить наилучшее приближение среднего значения последних входов. $n \in [k, \infty)$ $k$ $k$

Я знаю, что БИХ имеет бесконечную импульсную характеристику, поэтому я ищу лучшее приближение. Я был бы рад за аналитическое решение, будь то для функции стоимости ${L}_{2}$ или ${L}_{1}$ .

Как можно решить эту проблему оптимизации, учитывая только 1-го порядка БИХ.

Спасибо.

— Royi
источник

Имеет ли она следовать

y [n] = α x [n] + (1 - α) y [n - 1]

$y[n] = \alpha x[n] + (1 - \alpha) y[n - 1]$ точно]?

— Фонон

1

Это должно стать очень плохим приближением. Разве вы не можете позволить себе что-то большее, чем БИХ первого порядка?

— оставил около

Возможно, вы захотите отредактировать свой вопрос так, чтобы вы не использовали

y [n]

$y[n]$ для обозначения двух разных вещей, например, второе отображаемое уравнение может иметь вид

z [n] = \frac{1}{k} x [n] + \dots + \frac{1}{k} x [n - k + 1]

$z[n] = \frac{1}{k}x[n] + \cdots + \frac{1}{k}x[n-k+1]$ , и вы можете сказать, что именно является вашим критерием «как можно лучше», например, хотите ли вы

| y [n] - z [n] |

$\vert y[n] - z[n]\vert$ быть как можно меньше для всех

n

$n$ или

| y [n] - z [n] |^{2}

$\vert y[n] - z[n]\vert^2$ быть как можно меньше для всех

n

$n$ .

— Дилип Сарват

@ Фонон, да, это должно быть БИХ первого порядка. Критерий прост, результат

должен быть как можно ближе к среднему значению последних

входов в систему, где

. Я был бы рад увидеть результат в обоих случаях. Хотя я предполагаю, что аналитическое решение жизнеспособно только для

.

y [n]

$y[n]$

k

$k$

n \in [k, inf]

$n \in [k, \inf]$

{| y [n] - z [n] |}^{2}

${|y[n]−z[n]|}^{2}$

— Рой

10

There is no analytic solution for $\alpha$ being a scalar (I think). Here is a script that gives you $\alpha$ for a given $K$ . If you need it online you can build a LUT. The script finds the solution that minimizes

\int_{0}^{π} d w {| H_{1} (j w) - H_{2} (j w) |}^{2}

$\int_{0}^{\pi} dw \quad \left|H_1(jw) - H_2(jw)\right|^2$

where $H_1$ is the FIR frequency response and $H_2$ is the IIR frequency response.

You did not specify any range for K. But I just want to make it clear that the following system is equivalent to your mean filter and has the same computational complexity and your first order IIR!

$H(z) = \frac{1}{K} \frac{1 - z^{-K}}{1-z^{-1}}$

function a = find_a(K)

w = 0.0001:0.001:pi;
as = [-1:0.001:-0.001  0.001:0.001:1];

E = zeros(size(as));
for idx=1:length(as)
    fJ = J(w,as(idx),K);
    E(idx) = sum(fJ);
end

[Emin, indx] = min(E)
a = as(indx)

function f = J(w,a,K)
    num = 2*(2-a)*(1-cos(w*K)) + 2*(cos(w*(K-1)) - cos(w)) - 2*(1-a)*(cos(w)-cos(w*(K+1)));
    den = (2-a)^2 + 1 + (1-a)^2 + 2*(1-a)*cos(2*w) - 2*(2-a)^2*cos(w);
    f = -(a/K)*num./den;
    f = f+(1/K^2)*(1-cos(w*K))./(1-cos(w))+a^2./(1+(1-a)^2-2*(1-a)*cos(w));
end

end

— niaren
источник

@Drazick It is relative straight forward. The two expressions for the IIR and FIR is plugged into the integral. The key to finding the alternative expression for the FIR filter is to recognize the geometric progression/series. You find all the details here: en.wikipedia.org/wiki/Geometric_progression#Geometric_series. In the script, the J function calculates the expression under the integral sign.

— niaren

@niaren I know this is an old post so if you can remember: how is your function 'f' derived? I've coded a similar thing but using the complex transfer functions for FIR (H1) and IIR (H2) and then doing sum(abs(H1 - H2)**2). I've compared this with your sum(fj), but get different resulting outputs. Thought I would ask before ploughing through the math.

— Dom

@Dom That IS a long time ago and I really can't remember. I guess I just went through the process of working out

[H_{1} (j ω) - H_{2} (j ω)] [H_{1} (- j ω) - H_{2} (- j ω)]

$[H_1(j\omega)-H_2(j\omega)][H_1(-j\omega)-H_2(-j\omega)]$ . I can't remember how I verified the expression. I don't mind going through the math again...

— niaren

@niaren Hi, I tried to derive your expression but got stuck when adding up the complex fractions. I made a mistake in my code...your function seems to give results that are proportional to sum(abs(H1 - H2)**2), indicating it is correct.

— Dom

16

There's a nice discussion of this problem in Embedded Signal Processing with the Micro Signal Architecture, roughly between pages 63 and 69. On page 63, it includes a derivation of the exact recursive moving average filter (which niaren gave in his answer),

H (z) = \frac{1}{N} \frac{1 - z^{- N}}{1 - z^{- 1}} .

$H(z) = { 1 \over{N} } { 1 - z^{-N} \over { 1 - z^{-1} } }.$

For convenience with respect to the following discussion, it corresponds to the following difference equation:

y_{n} = y_{n - 1} + \frac{1}{N} (x_{n} - x_{n - N}) .

$y_n = y_{n-1} + {1\over{N} }(x_n - x_{n-N}).$

The approximation which puts the filter into the form you specified requires assuming that $x_{n-N} \approx y_{n-1}$ , because (and I quote from pg. 68) " $y_{n-1}$ is the average of $x_n$ samples". That approximation allows us to simplify the preceding difference equation as follows:

\begin{matrix} y_{n} = y_{n - 1} + \frac{1}{N} (x_{n} - y_{n - 1}) \\ y_{n} = y_{n - 1} - \frac{1}{N} y_{n - 1} + \frac{1}{N} x_{n} \\ y_{n} = (1 - \frac{1}{N}) y_{n - 1} + \frac{1}{N} x_{n} . \end{matrix}

$\begin{array}\\ y_{n} = y_{n-1} + {1\over{N}}(x_n - y_{n-1}) \\ y_{n} = y_{n-1} - {1\over{N}} y_{n-1} + {1\over{N}}x_n \\ y_{n} = (1-{1\over{N}})y_{n-1} + {1\over{N}}x_n. \end{array}$

Setting $\alpha = {1\over{N}}$ , we arrive at your original form, $y_{n} = \alpha x_n + (1-\alpha)y_{n-1}$ , which shows that the coefficient you want (with respect to this approximation) is exactly $1\over{N}$ (where $N$ is the number of samples).

Is this approximation the "best" in some respect? It's certainly elegant. Here's how the magnitude response compares [at 44.1kHz] for N = 3, and as N increases to 10 (approximation in blue):

N=3 N=[3,10]

As Peter's answer suggests, approximating an FIR filter with a recursive filter can be problematic under a least squares norm. An extensive discussion of how to solve this problem in general can be found in JOS's thesis, Techniques for Digital Filter Design and System Identification with Application to the Violin. He advocates the use of the Hankel Norm, but in cases where the phase response doesn't matter, he also covers Kopec's Method, which might work well in this case (and uses an $L^2$ norm). A broad overview of the techniques in the thesis can be found here. They may yield other interesting approximations.

— datageist
источник

This is an "Elegant" way to say something about the memory of the first order IIR Filter. Its memory is equivalent to

\frac{1}{α}

$\frac{1}{\alpha}$ . I'll look into the other methods you mentioned. Thanks.

— Royi

Could you explain intuitively why under the LS norm (

L_{2}

${L}_{2}$ ) there's no solution?

— Royi

I'm not sure if there is a LS solution in this case or not yet, I just know that it tends have issues with convergence for the general "IIR-based FIR approximation" problem. I'll update w/ more info when I get a chance.

— datageist

Well, If the cost function Peter suggested (The first one) is right there is a solution. At least according to my calculations.

— Royi

Great. I'm curious to see how the "heuristic"

1 / N

$1/N$ approach compares to something more canonical.

— datageist

16

OK, let's try to derive the best:

\begin{array}{lcl} y [n] & = & α x [n] + (1 - α) y [n - 1] \\ = & α x [n] + (1 - α) α x [n - 1] + (1 - α)^{2} y [n - 2] \\ = & α x [n] + (1 - α) α x [n - 1] + (1 - α)^{2} α x [n - 2] + (1 - α)^{3} y [n - 3] \end{array}

$\begin{array}{lcl} y[n] &=& \alpha x[n] + (1 - \alpha) y[n - 1] \\ &=& \alpha x[n] + (1 - \alpha) \alpha x[n-1] + (1 - \alpha)^2 y[n - 2]\\ &=& \alpha x[n] + (1 - \alpha) \alpha x[n-1] + (1 - \alpha)^2 \alpha x[n-2] + (1 - \alpha)^3 y[n - 3]\\ \end{array}$ so that the coefficient of

x [n - m]

$x[n-m]$ is

α (1 - α)^{m}

$\alpha(1-\alpha)^m$ .

The best mean-square approximation will minimize:

\begin{array}{lcl} J (α) & = & \sum_{m = 0}^{k - 1} (α (1 - α)^{m} - \frac{1}{k})^{2} + \sum_{m = k}^{\infty} α^{2} (1 - α)^{2 m} \\ = & \sum_{m = 0}^{k - 1} (α^{2} (1 - α)^{2 m} - \frac{2}{k} α (1 - α)^{m} + \frac{1}{k^{2}}) + α^{2} (1 - α)^{2 k} \sum_{m = 0}^{\infty} (1 - α)^{2 m} \\ = & α^{2} \frac{1 - (1 - α)^{2 k}}{1 - (1 - α)^{2}} + \frac{2 α}{k} \frac{1 - (1 - α)^{k}}{1 - (1 - α)} + \frac{α^{2} (1 - α)^{2 k}}{1 - (1 - α)^{2}} + \frac{1}{k} \\ = & \frac{α^{2}}{1 - (1 - α)^{2}} + \frac{2}{k} (1 - (1 - α)^{k}) + \frac{1}{k} \\ = & \frac{α^{2}}{2 α - α^{2}} + \frac{2}{k} (1 - (1 - α)^{k}) + \frac{1}{k} \\ = & \frac{α}{2 - α} + \frac{2}{k} (1 - (1 - α)^{k}) + \frac{1}{k} \end{array}

$\begin{array}{lcl} J(\alpha) &=& \sum_{m=0}^{k-1} (\alpha(1-\alpha)^m - \frac{1}{k})^2 + \sum_{m=k}^\infty \alpha^2(1-\alpha)^{2m}\\ &=& \sum_{m=0}^{k-1} \left(\alpha^2(1-\alpha)^{2m} - \frac{2}{k}\alpha(1-\alpha)^m + \frac{1}{k^2}\right) + \alpha^2 (1-\alpha)^{2k} \sum_{m=0}^\infty (1-\alpha)^{2m} \\ &=& \alpha^2\frac{1- (1-\alpha)^{2k}}{1 - (1-\alpha)^2} + \frac{2\alpha}{k} \frac{1 - (1 - \alpha)^k}{1 - (1 - \alpha)} + \frac{\alpha^2(1-\alpha)^{2k}}{1 - (1 - \alpha)^2}+ \frac{1}{k}\\ &=& \frac{\alpha^2}{1 - (1 - \alpha)^2} + \frac{2}{k} (1-(1-\alpha)^k) + \frac{1}{k}\\ &=& \frac{\alpha^2}{2\alpha - \alpha^2 }+ \frac{2}{k} (1-(1-\alpha)^k)+ \frac{1}{k}\\ &=& \frac{\alpha}{2 - \alpha }+ \frac{2}{k} (1-(1-\alpha)^k)+ \frac{1}{k}\\ \end{array}$ because the FIR coefficients are zero for

m > k - 1

$m > k - 1$ .

Next step is to take derivatives and equate to zero.

Looking at a plot of the derived $J$ for $K = 1000$ and $\alpha$ from 0 to 1, it looks like the problem (as I've set it up) is ill-posed, because the best answer is $\alpha = 0$ .

enter image description here

I think there's a mistake here. The way it should be according to my calculations is:

\begin{array}{lcl} J (α) & = & \sum_{m = 0}^{k - 1} (α (1 - α)^{m} - \frac{1}{k})^{2} + \sum_{m = k}^{\infty} α^{2} (1 - α)^{2 m} \\ = & \sum_{m = 0}^{k - 1} (α^{2} (1 - α)^{2 m} - \frac{2}{k} α (1 - α)^{m} + \frac{1}{k^{2}}) + α^{2} (1 - α)^{2 k} \sum_{m = 0}^{\infty} (1 - α)^{2 m} \\ = & α^{2} \frac{1 - (1 - α)^{2 k}}{1 - (1 - α)^{2}} - \frac{2 α}{k} \frac{1 - (1 - α)^{k}}{1 - (1 - α)} + \frac{1}{k} + \frac{α^{2} (1 - α)^{2 k}}{1 - (1 - α)^{2}} \end{array}

$\begin{array}{lcl} J(\alpha) &=& \sum_{m=0}^{k-1} (\alpha(1-\alpha)^m - \frac{1}{k})^2 + \sum_{m=k}^\infty \alpha^2(1-\alpha)^{2m} \\ &=& \sum_{m=0}^{k-1} \left(\alpha^2(1-\alpha)^{2m} - \frac{2}{k}\alpha(1-\alpha)^m + \frac{1}{k^2}\right) + \alpha^2 (1-\alpha)^{2k} \sum_{m=0}^\infty (1-\alpha)^{2m} \\ &=& \alpha^2\frac{1- (1-\alpha)^{2k}}{1 - (1-\alpha)^2} - \frac{2\alpha}{k} \frac{1 - (1 - \alpha)^k}{1 - (1 - \alpha)} + \frac{1}{k} + \frac{\alpha^2(1-\alpha)^{2k}}{1 - (1 - \alpha)^2} \end{array}$

Simplifying it according to Mathematica yields:

J (α) = \frac{α}{2 - α} + \frac{2 {(1 - α)}^{k} - 1}{k}

$J(\alpha) = \frac{\alpha}{2 - \alpha} + \frac{2 {(1 - \alpha)}^{k} -1}{k}$

Using the following code on MATLAB yields something equivalent though different:

syms a k;

expr1 = (a ^ 2) * ((1 - ((1 - a) ^ (2 * k))) / (1 - ((1 - a) ^ 2)));
expr2 = ((2 * a) / k) * ((1 - ((1 - a) ^ (k))) / (1 - (1 - a)));
expr3 = (1 / k);
expr4 = ((a ^ 2) * ((1 - a) ^ (2 * k))) / (1 - ((1 - a) ^ (2)));

simpExpr = simplify(expr1 - expr2 + expr3 + expr4);

J (α) = \frac{- 2}{α - 2} - \frac{k - 2 {(1 - α)}^{k} + 1}{k}

$J(\alpha) = \frac{-2}{\alpha - 2} - \frac{k - 2{(1 - \alpha)}^{k} + 1}{k}$

Anyhow, those functions do have minimum.

So let's assume that we really only care about the approximation over the support (length) of the FIR filter. In that case, the optimization problem is just:

J_{2} (α) = \sum_{m = 0}^{k - 1} (α (1 - α)^{m} - \frac{1}{k})^{2}

$J_2(\alpha) = \sum_{m=0}^{k-1} (\alpha(1-\alpha)^m - \frac{1}{k})^2$

Plotting $J_2(\alpha)$ for various values of $K$ versus $\alpha$ results in the date in the plots and table below.

For $K$ = 8. $\alpha_{\tt min}$ = 0.1533333
For $K$ = 16. $\alpha_{\tt min}$ = 0.08
For $K$ = 24. $\alpha_{\tt min}$ = 0.0533333
For $K$ = 32. $\alpha_{\tt min}$ = 0.04
For $K$ = 40. $\alpha_{\tt min}$ = 0.0333333
For $K$ = 48. $\alpha_{\tt min}$ = 0.0266667
For $K$ = 56. $\alpha_{\tt min}$ = 0.0233333
For $K$ = 64. $\alpha_{\tt min}$ = 0.02
For $K$ = 72. $\alpha_{\tt min}$ = 0.0166667

enter image description here

The red dashed lines are $1/K$ and the green lines are $\alpha_{\tt min}$ , the value of $\alpha$ that minimizes $J_2(\alpha)$ (chosen from $\tt alpha = [0:.01:1]/3;$ ).

— Peter K.
источник

1

Was just going to post the exact same thing = )

— Phonon

@Phonon: Feel free to continue! I've marked it as community wiki for that purpose.

— Peter K.

The derivative w.r.t

α

$\alpha$ is a series with an infinite number of terms (i.e. not a polynomial) that you have to set equal to

0

$0$ and then solve for

α

$\alpha$ , and so some care (or possibly approximation) is going to be necessary.

— Dilip Sarwate

Can someone please check and/or correct my working? :-)

— Peter K.

@DilipSarwate, What would be the best approximation? Thanks.

— Royi

3

Based in experimental tests with k in range (2 to 100) the best fit (sum squared error) gives a relation of alfa = 1/k^0.865 being k number of samples for MovAvg filter

— chiliwili69
источник

I used an interactive excel sheet to extract the relation: drive.google.com/open?id=0B48gEYiKwYegY3NqQThMTTZ1OG8

— chiliwili69

3

I stumbled upon this old question and I would like to share my solution. As mentioned in other answers, there is no analytical solution, but the function to be minimized behaves nicely and the optimal value of $\alpha$ can be found easily with a few Newton iterations. There is also a formula to check the optimality of the result.

The impulse response of the length $N$ FIR moving average filter is given by

\begin{matrix} (1) & h_{F I R} [n] = \frac{1}{N} (u [n] - u [n - N]) \end{matrix}

$h_{FIR}[n]=\frac{1}{N}(u[n]-u[n-N])\tag{1}$

where $u[n]$ is the unit step function. The first order IIR filter

\begin{matrix} (2) & y [n] = α x [n] + (1 - α) y [n - 1] \end{matrix}

$y[n]=\alpha x[n]+(1-\alpha)y[n-1]\tag{2}$

has the impulse response

\begin{matrix} (3) & h_{I I R} [n] = α (1 - α)^{n} u [n] \end{matrix}

$h_{IIR}[n]=\alpha(1-\alpha)^nu[n]\tag{3}$

The goal is now to minimize the squared error

\begin{matrix} (4) & ϵ = \sum_{n = 0}^{\infty} {(h_{F I R} [n] - h_{I I R} [n])}^{2} \end{matrix}

$\epsilon=\sum_{n=0}^{\infty}\left(h_{FIR}[n]-h_{IIR}[n]\right)^2\tag{4}$

Using $(1)$ and $(3)$ , the error can be written as

\begin{aligned} ϵ (α) & = \sum_{n = 0}^{N - 1} {(α (1 - α)^{n} - \frac{1}{N})}^{2} + \sum_{n = N}^{\infty} α^{2} (1 - α)^{2 n} \\ = α^{2} \sum_{n = 0}^{\infty} (1 - α)^{2 n} - \frac{2 α}{N} \sum_{n = 0}^{N - 1} (1 - α)^{n} + \sum_{n = 0}^{N - 1} \frac{1}{N^{2}} \\ = \frac{α^{2}}{1 - (1 - α)^{2}} - \frac{2 α}{N} \frac{1 - (1 - α)^{N}}{1 - (1 - α)} + \frac{1}{N} \\ (5) & = \frac{α}{2 - α} - \frac{2}{N} (1 - (1 - α)^{N}) + \frac{1}{N}, 0 < α < 2 \end{aligned}

$\begin{align}\epsilon(\alpha)&=\sum_{n=0}^{N-1}\left(\alpha(1-\alpha)^n-\frac{1}{N}\right)^2+\sum_{n=N}^{\infty}\alpha^2(1-\alpha)^{2n}\\&=\alpha^2\sum_{n=0}^{\infty}(1-\alpha)^{2n}-\frac{2\alpha}{N}\sum_{n=0}^{N-1}(1-\alpha)^n+\sum_{n=0}^{N-1}\frac{1}{N^2}\\&=\frac{\alpha^2}{1-(1-\alpha)^2}-\frac{2\alpha}{N}\frac{1-(1-\alpha)^N}{1-(1-\alpha)}+\frac{1}{N}\\&=\frac{\alpha}{2-\alpha}-\frac{2}{N}\left(1-(1-\alpha)^N\right)+\frac{1}{N},\qquad 0<\alpha<2\tag{5}\end{align}$

This expression is very similar to the one given in this answer, but it's not identical. The restriction on $\alpha$ in $(5)$ makes sure that the infinite sum converges, and it is identical to the stability condition for the IIR filter given by $(2)$ .

Setting the derivative of $(5)$ to zero results in

\begin{matrix} (6) & (1 - α)^{N - 1} (2 - α)^{2} = 1 \end{matrix}

$(1-\alpha)^{N-1}(2-\alpha)^2=1\tag{6}$

Note that the optimal $\alpha$ must be in the interval $(0,1]$ because larger values of $\alpha$ result in an alternating impulse response $(3)$ , which cannot approximate the constant impulse repsonse of the FIR moving average filter.

Taking the square root of $(6)$ and introducing $\beta=1-\alpha$ , we obtain

\begin{matrix} (7) & β^{(N + 1) / 2} + β^{(N - 1) / 2} - 1 = 0 \end{matrix}

$\beta^{(N+1)/2}+\beta^{(N-1)/2}-1=0\tag{7}$

This equation cannot be solved analytically for $\beta$ , but it can be solved for $N$ :

\begin{matrix} (8) & N = - 2 \frac{\log (1 + β)}{\log (β)}, β \neq 0 \end{matrix}

$N=-2\frac{\log(1+\beta)}{\log(\beta)},\qquad \beta\neq 0\tag{8}$

Equation $(8)$ can be used to double-check a numerical solution of $(7)$ ; it must return the specified value of $N$ .

Equation $(7)$ can be solved with a few lines of (Matlab/Octave) code:

N = 50;     % desired filter length of FIR moving average filter

if ( N == 1 )    % no iteration for trivial case
    b = 0;
else
    % Newton iteration
    b = 1;       % starting value
    Nit = 7;
    n = (N+1)/2;
    for k = 1:Nit,
        f = b^n + b^(n-1) -1;
        fp = n*b^(n-1) + (n-1)*b^(n-2);
        b = b - f/fp;
    end

    % check result
    N0 = -2*log(1+b)/log(b) + 1     % must equal N
end

a = 1 - b;

Below is a table with the optimal values of $\alpha$ for a range of filter lengths $N$ :

   N     alpha

   1   1.0000e+00
   2   5.3443e-01
   3   3.8197e-01
   4   2.9839e-01
   5   2.4512e-01
   6   2.0809e-01
   7   1.8083e-01
   8   1.5990e-01
   9   1.4333e-01
  10   1.2987e-01
  20   6.7023e-02
  30   4.5175e-02
  40   3.4071e-02
  50   2.7349e-02
  60   2.2842e-02
  70   1.9611e-02
  80   1.7180e-02
  90   1.5286e-02
 100   1.3768e-02
 200   6.9076e-03 
 300   4.6103e-03
 400   3.4597e-03
 500   2.7688e-03
 600   2.3078e-03
 700   1.9785e-03
 800   1.7314e-03
 900   1.5391e-03
1000   1.3853e-03

— Matt L.
источник