本文为 $I n t r o d u c t i o n$ $t o$ $P r o b a b i l i t y$ 的读书笔记

Joint PMF

Consider two discrete random variables $X$ and $Y$ associated with the same experiment. The probabilities of the values that $X$ and $Y$ can take are captured by the joint PMF of $X$ and $Y$ , denoted $p_{X,Y}$ .
$p_{X,Y}(x,y)=P(X=x,Y=y)$
In fact, we can calculate the PMFs of $X$ and $Y$ by using the formulas
$p_X(x)=\sum_yp_{X,Y}(x,y),\ \ \ \ \ \ \ p_Y(y)=\sum_xp_{X,Y}(x,y)$ We sometimes refer to $p_X$ and $p_Y$ as the marginal PMFs (边缘分布列), to distinguish them from the joint PMF.

Problem 26. PMF of the minimum of several random variables.
On a given day. your golf score takes values from the range 101 to 110. with probability 0.1, independent of other days. Determined to improve your score, you decide to play on three different days and declare as your score the minimum $X$ of the scores $X_1, X_2$ , and $X_3$ on the different days. Calculate the PMF of $X$ .

SOLUTION
在这里插入图片描述

We have $P (X > 100) = 1$ and for $k = 101, . . ., 110$ ,

It follows that

Functions of Multiple Random Variables

A function $Z = g (X, Y)$ of the random variables $X$ and $Y$ defines another random variable. Its PMF can be calculated from the joint PMF $p_{X,Y}$ according to
$p_Z(z)=\sum_{\{(x,y)|g(x,y)=z\}}p_{X,Y}(x,y)$
Furthermore. the expected value rule for functions naturally extends and takes the form
$E[g(X,Y)]=\sum_x\sum_y g(x,y)p_{X,Y}(x,y)$ In the special case where $g$ is linear and of the form $a X + b Y + c$ , where $a, b$ , and $c$ are given scalars, we have
$E [a X + b Y + c] = a E [X] + b E [Y] + c$

More than Two Random Variables

The joint PMF of three random variables $X, Y$ , and $Z$ is defined in analogy with the above as
$p_{X,Y,Z}(x, y, z) = P(X = x, Y = y, Z = z)$ for all possible triplets of numerical values $(x, y, z)$ . Corresponding marginal PMFs are analogously obtained by equations such as
$p_{X,Y}(x,y)=\sum_zp_{X,Y,Z}(x, y, z)$ and $p_X(x)=\sum_y\sum_zp_{X,Y,Z}(x, y, z)$
The expected value rule for functions is given by
$E[g(X,Y,Z)]=\sum_x\sum_y\sum_zg(x,y,z)p_{X,Y,Z}(x, y, z)$ and if $g$ is linear and has the form $a X + b Y + c Z + d$ , then
$E [a X + b Y + c Z + d] = a E [X] + b E [Y] + c E [Z] + d$

Example 2.10. Mean of the Binomial
Your probability class has 300 students and each student has probability $1 / 3$ of getting an A, independent of any other student. What is the mean of $X$ , the number of students that get an A?

Let
Thus $X_1,X_2, ....., X_n$ are Bernoulli random variables with common mean $p = 1 / 3$ . Their sum
$X=X_1+ X_2+···+X_n$ is the number of students that get an A. Since $X$ is the number of “successes”’ in $n$ independent trials, it is a binomial random variable with parameters $n$ and $p$ .
Using the linearity of $X$ as a function of the $X_i$ , we have
$=\sum_{i=1}^{300}E[X_i] =\sum_{i=1}^{300}\frac{1}{3}=100$ If we repeat this calculation for a general number of students $n$ and probability of A equal to $p$ , we obtain
$=\sum_{i=1}^nE[X_i] =\sum_{i=1}^np= np$

Example 2.11. The Hat Problem.
Suppose that $n$ people throw their hats in a box and then each picks one hat at random. (Each hat can be picked by only one person, and each assignment of hats to persons is equally likely.) What is the expected value of $X$ , the number of people that get back their own hat?

For the $i$ th person. we introduce a random variable $X_i$ that takes the value 1 if the person selects his/her own hat. and takes the value 0 otherwise. Since $P(X_i = 1) = 1/n$ and $P(X_i = 0) = 1 - 1/n$ , the mean of $X_i$ is $1 / n$ .
We now have
$X = X_1 + X_2 +· · ·+ X_n$ so that
$E[X_1] + E[X_2] +· · · + E[X_n] = n\cdot \frac{1}{n}= 1$

Problem 27. The multinomial distribution.
A die with $r$ faces, numbered $1, . . . . r$ . is rolled a fixed number of times $n$ . The probability that the $i$ th face comes up on any one roll is denoted $p_i$ , and the results of different rolls are assumed independent. Let $X_i$ be the number of times that the $i$ th face comes up. Find $E[X_iX_j]$ for $i\neq j.$

SOLUTION

Let $Y_{i,k}$ (or $Y_{j,k}$ ) be the Bernoulli random variable that takes the value 1 if face $i$ (respectively, $j$ ) comes up on the $k$ th roll. and the value 0 otherwise. Note that $Y_{i,k}Y_{j,k}= 0$ , and that for $l\neq k$ , $Y_{i,k}$ and $Y_{i,l}$ are independent, so that $E[Y_{i,k},Y_{j,l}] = p_ip_j$ . Therefore,
$\begin{aligned}E[X_iX_j]&= E[(Y_{i,1}+...+Y_{i,n})(Y_{j,1}+...+Y_{j,n})] \\&=n(n -1)E[Y_{i,1},Y_{j,2}] \\&=n(n-1)p_ip_j \end{aligned}$

Problem 29. The incIusion-exclusion formula. (容斥不等式)
Let $A_1 , A_2 , ... , A_n$ be events. Let $S_1 = \{ i|1 \leq i\leq n\}$ , $S_2 = \{ (i_1 , i_2 ) |1\leq i_1 < i_2\leq n\}$ ，and more generally, let $S_m$ be the set of all $m$ -tuples $i_1, ... ,i_m)$ of indices that satisfy $1\leq i_1< i_2<...< i_m\leq n$ . Show that
在这里插入图片描述

Hint: Let $X_i$ be a binary random variable which is equal to 1 when $A_i$ occurs, and equal to 0 otherwise. Relate the event of interest to the random variable $1 -X_1)( 1 -X_2)... (1 - X_n )$

SOLUTION

扫描二维码关注公众号，回复： 13138472 查看本文章

Let us express the event $\cup_{k=1}^n A_k$ in terms of the random variables $X_1 , ... , X_n$ . The event $B^C$ occurs when $Y = (1 -X_1)( 1 -X_2)... (1 - X_n )$ is equal to $1$ .
$P(B^C ) = P(Y = 1) = E[Y]$ Therefore,
$\begin{aligned}P(B)&=1-E[(1 -X_1)( 1 -X_2)... (1 - X_n )]\\ &=E[X_1+...+X_n]-E[\sum_{(i_1,i_2)\in S_2}X_{i_1}X_{i_2}]+...+(-1)^{n-1}E[X_1...X_n]\end{aligned}$ We note that
$E[X_i]=P(A_i),\ \ \ \ \ E[X_{i_1}X_{i_2}]=P(A_{i_1}\cap A_{i_2})\\ E[X_{i_1}X_{i_2}X_{i_3}]=P(A_{i_1}\cap A_{i_2}\cap A_{i_3}),\ \ \ \ \ E[X_1X_2...X_n]=P(\cap_{k=1}^nA_k)$ etc., from which the desired formula follows.

Problem 30.
Alvin’s database of friends contains $n$ entries, but due to a software glitch, the addresses correspond to the names in a totally random fashion. Alvin writes a holiday card to each of his friends and sends it to the (software-corrupted) address. What is the probability that at least one of his friends will get the correct card?

Hint: Use the inclusion-exclusion formula.

SOLUTION

Let $A_k$ be the event that the $k$ th card is sent to the correct address. We have for any $k, j, i$ ,
etc., and
Applying the inclusion-exclusion formula, we obtain the desired probability

When $n$ is large, this probability can be approximated by $1 - e^{-1}$ .

Chapter 2 (Discrete Random Variables): Joint PMFs of Multiple Random Variables (多个随机变量的联合分布列)

目录

Joint PMF

Functions of Multiple Random Variables

More than Two Random Variables

猜你喜欢