Varians fungsi dari satu variabel acak

Katakanlah kita memiliki variabel acak $X$ dengan varian dan mean yang diketahui. Pertanyaannya adalah: apa varian $f(X)$ untuk beberapa fungsi yang diberikan f. Satu-satunya metode umum yang saya ketahui adalah metode delta, tetapi hanya memberikan aproximation. Sekarang saya tertarik pada $f(x)=\sqrt{x}$ , tetapi akan menyenangkan juga mengetahui beberapa metode umum.

Sunting 29.12.2010
Saya telah melakukan beberapa perhitungan menggunakan seri Taylor, tapi saya tidak yakin apakah itu benar, jadi saya akan senang jika seseorang dapat mengonfirmasinya .

$E[f(X)]$
$E[f(X)] \approx E[f(\mu)+f'(\mu)(X-\mu)+\frac{1}{2}\cdot f''(\mu)(X-\mu)^2]=f(\mu)+\frac{1}{2}\cdot f''(\mu)\cdot Var[X]$

Now we can approximate $D^2 [f(X)]$
$E[(f(X)-E[f(X)])^2] \approx E[(f(\mu)+f'(\mu)(X-\mu)+\frac{1}{2}\cdot f''(\mu)(X-\mu)^2 -E[f(X)])^2]$

Using the approximation of $E[f(X)]$ we know that $f(\mu)-Ef(x) \approx -\frac{1}{2}\cdot f''(\mu)\cdot Var[X]$

Using this we get:
$D^2[f(X)] \approx \frac{1}{4}\cdot f''(\mu)^2\cdot Var[X]^2-\frac{1}{2}\cdot f''(\mu)^2\cdot Var[X]^2 + f'(\mu)^2\cdot Var[X]+\frac{1}{4}f''(\mu)^2\cdot E[(X-\mu)^4] +\frac{1}{2}f'(\mu)f''(\mu)E[(X-\mu)^3]$
$D^2 [f(X)] \approx \frac{1}{4}\cdot f''(\mu)^2 \cdot [D^4 X-(D^2 X)^2]+f'(\mu)\cdot D^2 X +\frac{1}{2}f'(\mu)f''(\mu)D^3 X$

variance random-variable delta-method

— Tomek Tarczynski
sumber

Delta method is used for asymptotic distributions. You cannot use when you have only one random variable.

— mpiktas

@mpiktas: Actually I dont know much about Delta method, I've just read something on wikipedia. This is quotation from wiki: "The delta method uses second-order Taylor expansions to approximate the variance of a function of one or more random variables".

— Tomek Tarczynski

it seems wikipedia has exactly what you want: en.wikipedia.org/wiki/…. I will reedit my answer, it seems that I underestimated Taylor expansion.

— mpiktas

Tomek, if you disagree with the edits that were made (not by me), you can always change them again, or roll them back, or just point out the differences and ask for clarification.

— Glen_b -Reinstate Monica

@Glen_b: I agree with them E(X-mu) = 0 doesn't implyt that E[(X-mu)^3] = 0.

— Tomek Tarczynski

Update

I've underestimated Taylor expansions. They actually work. I assumed that integral of the remainder term can be unbounded, but with a little work it can be shown that this is not the case.

The Taylor expansion works for functions in bounded closed interval. For random variables with finite variance Chebyshev inequality gives

P (| X - E X | > c) \leq \frac{V a r (X)}{c}

$P(|X-EX|>c)\le \frac{Var(X)}{c}$

So for any $\varepsilon>0$ we can find large enough $c$ so that

P (X \in [E X - c, E X + c]) = P (| X - E X | \leq c) < 1 - ε

$P(X\in [EX-c,EX+c])=P(|X-EX|\le c)<1-\varepsilon$

First let us estimate $Ef(X)$ . We have

\begin{aligned} E f (X) = \int_{| x - E X | \leq c} f (x) d F (x) + \int_{| x - E X | > c} f (x) d F (x) \end{aligned}

$\begin{align} Ef(X)=\int_{|x-EX|\le c}f(x)dF(x)+\int_{|x-EX|>c}f(x)dF(x) \end{align}$ where

F (x)

$F(x)$ is the distribution function for

X

$X$ .

Since the domain of the first integral is interval $[EX-c,EX+c]$ which is bounded closed interval we can apply Taylor expansion:

\begin{aligned} f (x) = f (E X) + f^{'} (E X) (x - E X) + \frac{f^{″} (E X)}{2} (x - E X)^{2} + \frac{f^{‴} (α)}{3} (x - E X)^{3} \end{aligned}

$\begin{align} f(x)=f(EX)+f'(EX)(x-EX)+\frac{f''(EX)}{2}(x-EX)^2+\frac{f'''(\alpha)}{3}(x-EX)^3 \end{align}$ where

α \in [E X - c, E X + c]

$\alpha\in [EX-c,EX+c]$ , and the equality holds for all

x \in [E X - c, E X + c]

$x\in[EX-c,EX+c]$ . I took only 4 terms in the Taylor expansion, but in general we can take as many as we like, as long as function

f

$f$ is smooth enough.

Substituting this formula to the previous one we get

\begin{aligned} E f (X) & = \int_{| x - E X | \leq c} f (E X) + f^{'} (E X) (x - E X) + \frac{f^{″} (E X)}{2} (x - E X)^{2} d F (x) \\ + \int_{| x - E X | \leq c} \frac{f^{‴} (α)}{3} (x - E X)^{3} d F (x) + \int_{| x - E X | > c} f (x) d F (x) \end{aligned}

$\begin{align} Ef(X)&=\int_{|x-EX|\le c}f(EX)+f'(EX)(x-EX)+\frac{f''(EX)}{2}(x-EX)^2dF(x)\\\\ &+\int_{|x-EX|\le c}\frac{f'''(\alpha)}{3}(x-EX)^3dF(x) +\int_{|x-EX|>c}f(x)dF(x) \end{align}$ Now we can increase the domain of the integration to get the following formula

\begin{aligned} E f (X) & = f (E X) + \frac{f^{″} (E X)}{2} E (X - E X)^{2} + R_{3} \end{aligned}

$\begin{align} Ef(X)&=f(EX)+\frac{f''(EX)}{2}E(X-EX)^2+R_3\\\\ \end{align}$ where

\begin{aligned} R_{3} & = \frac{f^{‴} (α)}{3} E (X - E X)^{3} + \\ + \int_{| x - E X | > c} (f (E X) + f^{'} (E X) (x - E X) + \frac{f^{″} (E X)}{2} (x - E X)^{2} + f (X)) d F (x) \end{aligned}

$\begin{align} R_3&=\frac{f'''(\alpha)}{3}E(X-EX)^3+\\\\ &+\int_{|x-EX|>c}\left(f(EX)+f'(EX)(x-EX)+\frac{f''(EX)}{2}(x-EX)^2+f(X)\right)dF(x) \end{align}$ Now under some moment conditions we can show that the second term of this remainder term is as large as

P (| X - E X | > c)

$P(|X-EX|>c)$ which is small. Unfortunately the first term remains and so the quality of the approximation depends on

E (X - E X)^{3}

$E(X-EX)^3$ and the behaviour of third derivative of

f

$f$ in bounded intervals. Such approximation should work best for random variables with

E (X - E X)^{3} = 0

$E(X-EX)^3=0$ .

Now for the variance we can use Taylor approximation for $f(x)$ , subtract the formula for $Ef(x)$ and square the difference. Then

$E(f(x)-Ef(x))^2=(f'(EX))^2Var(X)+T_3$

where $T_3$ involves moments $E(X-EX)^k$ for $k=4,5,6$ . We can arrive at this formula also by using only first-order Taylor expansion, i.e. using only the first and second derivatives. The error term would be similar.

Other way is to expand $f^2(x)$ :

\begin{aligned} f^{2} (x) & = f^{2} (E X) + 2 f (E X) f^{'} (E X) (x - E X) \\ + [(f^{'} (E X))^{2} + f (E X) f^{″} (E X)] (X - E X)^{2} + \frac{(f^{2} (β))^{‴}}{3} (X - E X)^{3} \end{aligned}

$\begin{align} f^2(x)&=f^2(EX)+2f(EX)f'(EX)(x-EX)\\\\ &+[(f'(EX))^2+f(EX)f''(EX)](X-EX)^2+\frac{(f^2(\beta))'''}{3}(X-EX)^3 \end{align}$

Similarly we get then

\begin{aligned} E f^{2} (x) = f^{2} (E X) + [(f^{'} (E X))^{2} + f (E X) f^{″} (E X)] V a r (X) + {\tilde{R}}_{3} \end{aligned}

$\begin{align*} Ef^2(x)=f^2(EX)+[(f'(EX))^2+f(EX)f''(EX)]Var(X)+\tilde{R}_3 \end{align*}$ where

{\tilde{R}}_{3}

$\tilde{R}_3$ is similar to

R_{3}

$R_3$ .

The formula for variance then becomes

\begin{aligned} V a r (f (X)) = [f^{'} (E X)]^{2} V a r (X) - \frac{[f^{″} (E X)]^{2}}{4} V a r^{2} (X) + {\tilde{T}}_{3} \end{aligned}

$\begin{align} Var(f(X))=[f'(EX)]^2Var(X)-\frac{[f''(EX)]^2}{4}Var^2(X)+\tilde{T}_3 \end{align}$ where

{\tilde{T}}_{3}

$\tilde{T}_3$ have only third moments and above.

— mpiktas
sumber

I dont need to know the exact value of the variance, approximation should works for me.

— Tomek Tarczynski

Indeed, the approximate formula for

E [f (X)]

$\mathbb{E}[f(X)]$ in the OP is often used in risk analysis in economics, finance and insurance.

— Raskolnikov

@Raskolnikov, yes but it contradicts my admitedly stale knowledge of Taylor expansion. Clearly the remainder term must be taken into account. If the random variable is bounded, then no problem, since polynomials approximate continuous functions on bounded interval uniformly. But we deal with unbounded random variables. Of course for random normal we can say that it is effectively bounded, but still in general case, some nasty surprises can arise, or not. I will fix my answer when I'll have the clear answer.

— mpiktas

@Tomek Tarczynski, the third derivative of

\sqrt{x}

$\sqrt{x}$ goes to zero quite quickly for large

x

$x$ , but is unbounded near zero. So if you picked uniform distribution with support close to zero, the remainder term can get large.

— mpiktas

Note that in your link the the equality is approximate. In this answer all the equations are exact. Furthermore for the variance note that the first derivative is estimated at the

E X

$EX$ , not

x

$x$ . Also I never stated that this will not work for

\sqrt{x}

$\sqrt{x}$ , only that for

\sqrt{x}

$\sqrt{x}$ the approximate formula might have huge error if

X

$X$ domain is close to zero.

— mpiktas

To know the first two moments of X (mean and variance) is not enough, if the function f(x) is arbitrary (non linear). Not only for computing the variance of the transformed variable Y, but also for its mean. To see this -and perhaps to attack your problem- you can assume that your transformation function has a Taylor expansion around the mean of X and work from there.

— leonbloy
sumber