Равномерная случайная величина как сумма двух случайных величин

Взято из Гриммета и Штирзакера :

Покажите, что это не может быть случай, когда где равномерно распределены по [0,1], а и независимы и одинаково распределены. Вы не должны считать , что X и Y являются непрерывными переменными. $U=X+Y$ $U$ $X$ $Y$

Простое доказательство от противного достаточно для случая, когда , предполагаются дискретными, утверждая, что всегда можно найти такие и , что в то время как . $X$ $Y$ $u$ $u'$ $P(U\leq u+u') \geq P(U\leq u)$ $P(X+Y \leq u) = P(X+Y \leq u+u')$

Однако это доказательство не распространяется на является абсолютно непрерывным или сингулярно непрерывным. Подсказки / Комментарии / критика? $X,Y$

— rightskewed
источник

Подсказка : характерные функции ваших друзей.

— кардинал

X и Y одинаковы, поэтому их характерные функции должны быть идентичны. Вам нужно использовать характеристическую функцию, а не функцию, генерирующую момент - мгф не гарантированно существует для X, поэтому показание mgf имеет невозможное свойство, не означает, что такого X нет. Все RV имеют характеристическую функцию, так что если вы покажете, что обладаете невозможным свойством, то такого X нет.

— Silverfish

Если в распределениях

X $X$ и

Y $Y$ есть атомы , скажем, что

P{X=a}=P{Y=a}=b>0 $P\{X=a\}=P\{Y=a\} = b > 0$ , то

P{X+Y=2a}≥b2>0 $P\{X+Y=2a\} \geq b^2 > 0$ и, следовательно,

X+Y $X+Y$ не может быть равномерно распределено на

[0,1] $[0,1]$ , Таким образом, нет необходимости рассматривать случай распределений

X $X$ и

Y $Y$ имеющих атомы.

— Дилип Сарватэ

Ответы:

Результат можно подтвердить с помощью рисунка: видимые серые области показывают, что равномерное распределение не может быть разложено как сумма двух независимых одинаково распределенных переменных.

нотация

Пусть $X$ и $Y$ такие, что $X+Y$ имеет равномерное распределение на $[0,1]$ . Это означает, что для всех $0\le a \le b \le 1$ ,

Pr (a < X + Y \leq b) = b - a .

$\Pr(a < X+Y \le b) = b-a.$

Существенная поддержка общего распределения и , следовательно, (в противном случае был бы положительная вероятность того, что лежит вне ). $X$ $Y$ $[0,1/2]$ $X+Y$ $[0,1]$

Изображение

Пусть . Рассмотрим эту диаграмму, показывающую, как вычисляются суммы случайных величин: $0 \lt \epsilon \lt 1/4$

Основное распределение вероятностей является совместным для . Вероятность любого события определяется общей вероятностью, покрытой диагональной полосой, растянутой между линиями и . Показаны три такие полосы: от до , выглядящие как маленький синий треугольник в левом нижнем углу; от на $(X,Y)$ $a \lt X+Y \le b$ $x+y=a$ $x+y=b$ $0$ $\epsilon$ $1/2-\epsilon$ , показанный в виде серого прямоугольника с двумя (желтым и зеленым) треугольниками; и от к , появляется в виде небольшого красного треугольника в верхнем правом углу. $1/2+\epsilon$ $1-\epsilon$ $1$

Что показывает картинка

Сравнивая нижний левый треугольник на рисунке с нижним левым квадратом, содержащим его, и используя предположение iid для и , становится ясно, что $X$ $Y$

ϵ = Pr (X + Y \leq ϵ) < Pr (X \leq ϵ) Pr (Y \leq ϵ) = Pr (X \leq ϵ) 2 .

$\epsilon = \Pr(X+Y \le \epsilon) \lt \Pr(X \le \epsilon)\Pr(Y \le \epsilon) = \Pr(X \le \epsilon)^2.$

Обратите внимание, что неравенство строгое: равенство невозможно, поскольку существует некоторая положительная вероятность того, что и и меньше но, тем не менее, . $X$ $Y$ $\epsilon$ $X+Y \gt \epsilon$

Аналогично, сравнивая красный треугольник с квадратом в верхнем правом углу,

ϵ = Pr (X + Y > 1 - ϵ) < Pr (X > 1 / 2 - ϵ) 2 .

$\epsilon = \Pr(X+Y \gt 1-\epsilon) \lt \Pr(X \gt 1/2-\epsilon)^2.$

Наконец, сравнение двух противоположных треугольников в верхнем левом и нижнем правом углу с диагональной полосой, содержащей их, дает другое строгое неравенство,

2 ϵ < 2 Pr (X \leq ϵ) Pr (X > 1 / 2 - ϵ) < Pr (1 / 2 - ϵ < X + Y \leq 1 / 2 + ϵ) = 2 ϵ .

$2\epsilon \lt 2 \Pr(X\le \epsilon)\Pr(X \gt 1/2-\epsilon) \lt \Pr(1/2-\epsilon \lt X+Y \le 1/2+\epsilon) = 2\epsilon.$

Первое неравенство вытекает из двух предыдущих (взять их квадратные корни и умножить их) , а второй описывает (строгое) включение треугольников в пределах полосы и последнее равенство выражает однородность . Вывод о том, что является противоречием, доказывающим, что такие и не могут существовать, QED . $X+Y$ $2\epsilon \lt 2\epsilon$ $X$ $Y$

— Whuber
источник

(+1) Мне нравится этот подход. Когда я вытащил свою заднюю часть конверта из корзины для бумаг, я вижу, что нарисовал ту же диаграмму, за исключением того, что я не отметил желтые и зеленые треугольники внутри полосы. Я получил неравенства для синих и красных треугольников. Я поиграл с ними и с несколькими другими вероятностями, но никогда не думал о том, чтобы исследовать вероятность появления полосы, которая оказывается критическим шагом. Интересно, какой мыслительный процесс мог мотивировать это понимание?

— Серебряная рыба

Фактически, где у @whuber есть желтые и зеленые треугольники, я рисовал на квадратах (я фактически разложил

в сетку). Глядя на шаг , который "описывает (строгое) включение треугольников в полосе",

[0,0.5]2 $[0, 0.5]^2$

2Pr(X≤ϵ)Pr(X>1/2−ϵ)<Pr(1/2−ϵ<X+Y≤1/2+ϵ) $2 \Pr(X\le \epsilon)\Pr(X \gt 1/2-\epsilon) \lt \Pr(1/2-\epsilon \lt X+Y \le 1/2+\epsilon)$ Интересно, будет ли это на самом деле геометрически более естественным с квадратами, перекрывающими полосу, чем с треугольниками?

— Серебряная рыба

@Silver I was reminded of an analysis of sums of uniform distributions I posted a couple of years ago. That suggested visualizing the sum

X+Y $X+Y$ geometrically. It was immediately evident that a lot of probability had to be concentrated near the corners

(0,0) $(0,0)$ and

(1/2,1/2) $(1/2,1/2)$ in order for the sum to be uniform and for relatively little probability to be near the center diagonal

X+Y=1/2 $X+Y=1/2$ . That led to the diagram, which I redrew in Mathematica. At that point the answer wrote itself. Yes, using squares in the center band might be neater.

— whuber

Thanks! "Note that the inequality is strict: equality is not possible because there is some positive probability that either of

X $X$ or

Y $Y$ is less than

ϵ $\epsilon$ but nevertheless

X+Y>ϵ $X+Y \gt \epsilon$ ." I'm not sure I follow this. It seems to me the aim here is to show

Pr(X+Y≤ϵ)<Pr(X≤ϵ∩Y≤ϵ) $\Pr(X+Y \le \epsilon) \lt \Pr(X \le \epsilon \cap Y \le \epsilon)$ , doesn't this require a positive probability for some event

A $A$ in which both of

X $X$ and

Y $Y$ are less than or equal to

ϵ $\epsilon$ and yet

X+Y>ϵ $X + Y > \epsilon$ ? It is the "either of" vs "both of" I'm vacillating over.

— Silverfish

@Silverfish Thank you; I did not express that as I had intended. You are correct: the language is intended essentially to describe the portion of a little square not inside the triangle.

— whuber

I tried finding a proof without considering characteristic functions. Excess kurtosis does the trick. Here's the two-line answer: $\text{Kurt}(U) = \text{Kurt}(X + Y) = \text{Kurt}(X) / 2$ since $X$ and $Y$ are iid. Then $\text{Kurt}(U) = -1.2$ implies $\text{Kurt}(X) = -2.4$ which is a contradiction as $\text{Kurt}(X) \geq -2$ for any random variable.

Rather more interesting is the line of reasoning that got me to that point. $X$ (and $Y$ ) must be bounded between 0 and 0.5 - that much is obvious, but helpfully means that its moments and central moments exist. Let's start by considering the mean and variance: $\mathbb{E}(U)=0.5$ and $\text{Var}(U)=\frac{1}{12}$ . If $X$ and $Y$ are identically distributed then we have:

E (X + Y) = E (X) + E (Y) = 2 E (X) = 0.5

$\mathbb{E}(X + Y) = \mathbb{E}(X) + \mathbb{E}(Y) = 2 \mathbb{E}(X)= 0.5$

So $\mathbb{E}(X) = 0.25$ . For the variance we additionally need to use independence to apply:

Var (X + Y) = Var (X) + Var (Y) = 2 Var (X) = 1 12

$\text{Var}(X+Y) = \text{Var}(X) + \text{Var}(Y) = 2 \text{Var}(X) = \frac{1}{12}$

Hence $\text{Var}(X) = \frac{1}{24}$ and $\sigma_X = \frac{1}{2\sqrt{6}} \approx 0.204$ . Wow! That is a lot of variation for a random variable whose support ranges from 0 to 0.5. But we should have expected that, since the standard deviation isn't going to scale in the same way that the mean did.

Now, what's the largest standard deviation that a random variable can have if the smallest value it can take is 0, the largest value it can take is 0.5, and the mean is 0.25? Collecting all the probability at two point masses on the extremes, 0.25 away from the mean, would clearly give a standard deviation of 0.25. So our $\sigma_X$ is large but not impossible. (I hoped to show that this implied too much probability lay in the tails for $X + Y$ to be uniform, but I couldn't get anywhere with that on the back of an envelope.)

Second moment considerations almost put an impossible constraint on $X$ so let's consider higher moments. What about Pearson's moment coefficient of skewness, $\gamma_1 = \frac{\mathbb{E}(X - \mu_X)^3}{\sigma_X^3} = \frac{\kappa_3}{\kappa_2^{3/2}}$ ? This exists since the central moments exist and $\sigma_X \neq 0$ . It is helpful to know some properties of the cumulants, in particular applying independence and then identical distribution gives:

κ i (U) = κ i (X + Y) = κ i (X) + κ i (Y) = 2 κ i (X)

$\kappa_i(U) = \kappa_i(X + Y) = \kappa_i(X) + \kappa_i(Y) = 2\kappa_i(X)$

This additivity property is precisely the generalisation of how we dealt with the mean and variance above - indeed, the first and second cumulants are just $\kappa_1 = \mu$ and $\kappa_2 = \sigma^2$ .

Then $\kappa_3(U) = 2\kappa_3(X)$ and $\big(\kappa_2(U)\big)^{3/2} = \big(2\kappa_2(X)\big)^{3/2} = 2^{3/2} \big(\kappa_2(X)\big)^{3/2}$ . The fraction for $\gamma_1$ cancels to yield $\text{Skew}(U) = \text{Skew}(X + Y) = \text{Skew}(X) / \sqrt{2}$ . Since the uniform distribution has zero skewness, so does $X$ , but I can't see how a contradiction arises from this restriction.

So instead, let's try the excess kurtosis, $\gamma_2 = \frac{\kappa_4}{\kappa_2^2} = \frac{\mathbb{E}(X - \mu_X)^4}{\sigma_X^4} - 3$ . By a similar argument (this question is self-study, so try it!), we can show this exists and obeys:

Kurt (U) = Kurt (X + Y) = Kurt (X) / 2

$\text{Kurt}(U) = \text{Kurt}(X + Y) = \text{Kurt}(X) / 2$

The uniform distribution has excess kurtosis $-1.2$ so we require $X$ to have excess kurtosis $-2.4$ . But the smallest possible excess kurtosis is $-2$ , which is achieved by the $\text{Binomial}(1, \frac{1}{2})$ Bernoulli distribution.

— Silverfish
источник

(+1) This is a quite clever approach, which was new to me. Thanks. Note that some of your analysis could have been streamlined by considering a uniform centered at zero. (The equivalence of the problem is immediate.) That would have immediately told you that considering skew was a dead-end.

— cardinal

@cardinal: I knew the skew was a dead-end before I worked on it. The purpose was expository: it's a self-study question so I didn't want to solve it in full! Rather I wanted to leave a hint on how to deal with the next level up...

— Silverfish

@cardinal: I was in two minds whether to center or not. I did back-of-envelope calculations more conveniently, but in the final analysis we just need (1) a simple case of the general result that

Kurt(X1+...+Xn)=1nKurt(X) $Kurt(X_1 + ... + X_n) = \frac{1}{n}Kurt(X)$ for iid

Xi $X_i$ , (2) that

Kurt(U)=−1.2 $Kurt(U) = -1.2$ for any uniform distribution, and (3)

Kurt(X) $Kurt(X)$ exists since

X $X$ is bounded and

σX≠0 $\sigma_X \neq 0$ (which is trivial, else

σU=0 $\sigma_U = 0$ ). So none of the key results actually required centering, though bits may have looked less ugly!

— Silverfish

Yes, the word "streamlined" was carefully chosen. :-) I did not intend my comment to be read as criticism of your exposition. Cheers.

— cardinal

@cardinal Incidentally, variance considerations alone almost worked, but the uniform isn't quite spread out enough. With a bit more probability mass nearer the extremes, e.g.

fT(t)=12t2 $f_T(t)=12t^2$ on [-0.5, 0.5], then

$Var(T)=.15$ and if

$T = X_1 + X_2$ then

$\sigma_X = \sqrt{.15/2} \approx 0.27 > 0.25$ which is impossible as

$X$ is bounded by -0.25 and 0.25. Of course, you will see immediately how this relates to the present example! I wonder if the approach generalises, I'm sure other bounded RVs can't be decomposed into sums but require even higher moments investigated to find the contradiction.

— Silverfish