本文为 $I n t r o d u c t i o n$ $t o$ $P r o b a b i l i t y$ 的读书笔记

Continuous Random Variables and PDFs

A random variable $X$ is called continuous if there is a nonnegative function $f_X$ , called the probability density function of $X$ , or PDF for short, such that
$P(X\in B)=\int_Bf_X(x)dx$ for every subset $B$ of the realline. Note that to qualify as a PDF, a function $f_X$ must be nonnegative, and must also have the normalization property
$\int_{-\infty}^\infty f_X(x)dx=P(-\infty<X<\infty)=1$
In particular, the probability that the value of $X$ falls within an interval is
$P(a\leq X\leq B)=\int_a^bf_X(x)dx$ For any single value $a$ , we have
$P(X=a)=\int_a^af_X(x)dx=0$ For this reason, including or excluding the endpoints of an interval has no effect on its probability:
$P(a\leq X\leq B)=P(a< X<B)=P(a\leq X< B)=P(a< X\leq B)$
To interpret the PDF, note that for an interval $[x,x+\delta]$ with a very small length $\delta$ , we have
$P([x,x+\delta])=\int_{x}^{x+\delta}f_X(t)dt\approx f_X(x)\cdot\delta$ so we can view $f_X(x)$ as the “probability mass per unit length” near $x$ . It is important to realize that even though a PDF is used to calculate event probabilities, $f_X (x)$ is not the probability of any particular event. In particular, it is not restricted to be less than or equal to one.

Example 3.2. Piecewise Constant PDF. (逐段常数的 PDF)
Alvin’s driving time to work is between 15 and 20 minutes if the day is sunny, and between 20 and 25 minutes if the day is rainy, with all times being equally likely in each case. Assume that a day is sunny with probability $2 / 3$ and rainy with probability $1 / 3$ . What is the PDF of the driving time, viewed as a random variable $X$ ?

SOLUTION

PDF:

where $c_1$ and $c_2$ are some constants. We can determine these constants by using the given probabilities of a sunny and of a rainy day:
$\frac{2}{3}=P(sunny\ day)=\int_{15}^{20}f_X(x)dx=5c_1,\ \ \ \ c_1=\frac{2}{15} \\\frac{1}{3}=P(rainy\ day)=\int_{20}^{25}f_X(x)dx=5c_2,\ \ \ \ c_2=\frac{1}{15}$

Example 3.3. A PDF Can Take Arbitrarily Large Values.

Consider a random variable $X$ with PDF
Even though $f_X(x)$ becomes infinitely large as $x$ approaches zero, this is still a valid PDF, because
$\int_{-\infty}^\infty f_X(x)dx=\int_0^1\frac{1}{2\sqrt x}dx=1$

Expectation

The expected value or expectation or mean of a continuous random variable $X$ is defined by
$E[X]=\int_{-\infty}^\infty xf_X(x)dx$
If $X$ is a continuous random variable with given PDF. any real-valued function $Y = g (X)$ of $X$ is also a random variable.
$E[g(X)]=\int_{-\infty}^\infty g(x)f_X(x)dx$
The variance of $X$ is defined by
$\begin{aligned}var(X)&=E[(X-E[X])^2]=\int_{-\infty}^\infty (x-E[X])^2f_X(x)dx \\&=E[X^2]-(E[X])^2\end{aligned}$
If $Y = a X + b$ , where $a$ and $b$ are given scalars, then
$E[Y]=aE[X]+b\\ var(Y)=a^2var(X)$

One has to deal with the possibility that the integral $\int_{-\infty}^\infty xf_X(x)dx$ is infinite or undefined. More concretely. we will say that the expectation is well-defined if $\boldsymbol{\int_{-\infty}^\infty |x|f_X(x)dx<\infty}$ . In that case, it is known that the integral $\int_{-\infty}^\infty xf_X(x)dx$ takes a finite and unambiguous value. Throughout this book. in the absence of an indication to the contrary, we implicitly assume that the expected value of any random variable of interest is well-defined.

Example 3.4. Mean and Variance of the Uniform Random Variable. (均匀随机变量)

We can consider a random variable $X$ that takes values in an interval $[a, b]$ , and again assume that any two subintervals of the same length have the same probability. We refer to this type of random variable as uniform or uniformly distributed. Its PDF has the form

$E[X]=\int_{-\infty}^\infty xf_X(x)dx=\int_a^b\frac{x}{b-a}dx=\frac{a+b}{2}\\ E[X^2]=\int_a^b\frac{x^2}{b-a}dx=\frac{a^2+ab+b^2}{3}\\ var(X)=E[X^2]-(E[X])^2=\frac{(b-a)^2}{12}$

Problem 3
Show that the expected value of a discrete or continuous random variable $X$ satisfies
$\int_0^\infty P(X > x) dx - \int_0^\infty P(X < -x) dx$

SOLUTION

Suppose that $X$ is continuous. We then have
$\begin{aligned}\int_0^\infty P(X > x)dx &=\int_0^\infty (\int_x^\infty f_X(y)dy)dx \\&=\int_0^\infty (\int_0^y f_X(y)dx)dy \\&=\int_0^\infty f_X(y)(\int_0^y dx )dy \\&=\int_0^\infty yf_X(y)dy\end{aligned}$ , where for the second equality we have reversed the order of integration by writing the set $\{(x.y) | 0\leq x <\infty, x\leq y <\infty\}$ as $\{(x.y) |0\leq x\leq y, 0\leq y <\infty\}$ . Similarly. we can show that
$\int_0^\infty P(X < -x) dx = - \int^0_{-\infty} yf_X(y) dy$ Combining the two relations above, we obtain the desired result.
If $X$ is discrete, we have
$\begin{aligned}P(X > x) &=\int_0^\infty(\sum_{y>x}p_X(y))dx \\&=\sum_{y>0}(\int_0^y p_X(y)dx) \\&=\sum_{y>0}p_X(y)(\int_0^y dx) \\&=\sum_{y>0} p_X(y)y\end{aligned}$ and the rest of the argument is similar to the continuous case.

Problem 4.
Establish the validity of the expected value rule
$E[g(X)]=\int_{-\infty}^\infty g(x)f_X(x)dx$ where $X$ is a continuous random variable with PDF $f_X$ .

SOLUTION

Let us express the function $g$ as the difference of two nonnegative functions,
$g(x) =g^+(x)-g^-(x)$ where $g^+(x)= max\{g(x ),0\}$ , and $g^-(x) = max\{-g( x ),0\}$ . We will use the result
$\int_0^\infty P(g(X)> x) dx - \int_0^\infty P(g(X)< -x) dx$ from the preceding problem. The first term in the right-hand side is equal to
$\int_0^\infty \int_{\{x|g(x)>t\}}f_X(x) dx dt = \int_{-\infty}^\infty\int_{\{t|0\leq t<g(x)\}}f_X(x)dtdx=\int_{-\infty}^\infty f_X(x)g^+(x)dx$ By a symmetrical argument, the second term in the right-hand side is given by
$\int_{-\infty}^\infty f_X(x)g^-(x)dx$
Combining the above equalities, we obtain
$=\int_{-\infty}^\infty f_X(x)g^+(x)dx-\int_{-\infty}^\infty f_X(x)g^-(x)dx=\int_{-\infty}^\infty f_X(x)g(x)dx$

Exponential Random Variable

指数随机变量

An exponential random variable has a PDF of the form

where $\lambda$ is a positive parameter characterizing the PDF. This is a legitimate PDF because
$\int_{-\infty}^\infty f_X(x)dx=\int_0^\infty\lambda e^{-\lambda x}=1$
Note that the probability that $X$ exceeds a certain value decreases exponentially. Indeed, for any $\geq 0$ , we have
$P(X\geq a)=\int_a^\infty\lambda e^{-\lambda x}dx=e^{-\lambda a}$ ( $P(a\leq X \leq b)=P(X\geq a)-P(X\geq b)=e^{-\lambda a}-e^{-\lambda b}$ )
The mean and the variance can be calculated to be
$E[X]=\frac{1}{\lambda},\ \ \ \ \ \ \ \ var(X)=\frac{1}{\lambda^2}$ $\begin{aligned}E[X]&=\int_0^\infty x\lambda e^{-\lambda x}dx\\ &=(-xe^{-\lambda x})\Big|^\infty_0+\int_0^\infty e^{-\lambda x}dx \\&=0-\frac{e^{-\lambda x}}{\lambda}\Big|^\infty_0 \\&=\frac{1}{\lambda} \\E[X^2]&=\int_0^\infty x^2\lambda e^{-\lambda x}dx\\ &=(-x^2e^{-\lambda x})\Big|^\infty_0+\int_0^\infty 2xe^{-\lambda x}dx \\&=0+\frac{2}{\lambda}E[X] \\&=\frac{2}{\lambda^2} \\var(X)&=E[X^2]-(E[X])^2=\frac{1}{\lambda^2}\end{aligned}$

An exponential random variable can, for example, be a good model for the amount of time until an incident of interest takes place. We will see that it is closely connected to the geometric random variable, which also relates to the (discrete) time that will elapse until an incident of interest takes place.

Chapter 3 (General Random Variables): Continuous Random Variables and PDFs (连续随机变量、概率密度函数)

目录

Continuous Random Variables and PDFs

Expectation

Exponential Random Variable

猜你喜欢