Roll a four-sided die twice and let
X equal the larger of the two outcomes if there are different and the common value if they are the same. The sample space for this experiment is
, where each of this 16 points has probability 1/16. Then
,
, and similarly
and
. That is, the p. d.f. of
X can be written simply as
We could add that
elsewhere; but if we do not, one should take
f(x) to equal zero when
.
A better understanding of a particular probability distribution can often be obtained with a graph that depicts the p.d.f. of
X .
the graph of the p.d.f. when
, would be simply the set of points {
}, where
R is the space of
X .
Two types of graphs can be used to give a better visual appreciation of the p.d.f., namely,
a bar graph and
a probability histogram . A bar graph of the p.d.f.
f(x) of the random variable
X is a graph having a vertical line segment drawn from
to
at each
x in
R , the space of
X . If
X can only assume integer values,
a probability histogram of the p.d.f. f(x) is a graphical representation that has a rectangle of height
f(x) and a base of length 1, centered at
x , for each
, the space of
X .
CUMULATIVE DISTRIBUTION FUNCTION
Let define the function
F(x) by
The function
F(x) is called
the distribution function (sometimes
cumulative distribution function ) of the discrete-type random variable
X .
Several properties of a distribution function
F(x) can be listed as a consequence of the fact that probability must be a value between 0 and 1, inclusive:
because
F(x) is a probability,
F(x) is a nondecreasing function of
x ,
, where
y is any value greater than or equal to the largest value in
R ; and
, where
z is any value less than the smallest value in
R ;
If
X is a random variable of the discrete type, then
F(x) is a step function, and the height at a step at
x ,
, equals the probability
.
It is clear that the probability distribution associated with the random variable
X can be described by either the distribution function
F(x) or by the probability density function
f(x) . The function used is a matter of convenience; in most instances,
f(x) is easier to use than
F(x) .
Graphical representation of the relationship between p.d.f. and c.d.f.
Area under p.d.f. curve to
a equal to a value of c.d.f. curve at a point
a .
MATHEMATICAL EXPECTATION
If
f(x) is the p.d.f. of the random variable
X of the discrete type with space
R and if the summation
exists, then the sum is called
the mathematical expectation or
the expected value of the function
u(X) , and it is denoted by
. That is,
We can think of the expected value
as a weighted mean of
u(x) ,
, where the weights are the probabilities
.
The usual definition of the mathematical expectation of
u(X) requires that the sum converges absolutely; that is,
exists.
There is another important observation that must be made about consistency of this definition. Certainly, this function
u(X) of the random variable
X is itself a random variable, say
Y . Suppose that we find the p.d.f. of
Y to be
g(y) on the support
. Then
E(Y) is given by the summation
In general it is true that
that is, the same expectation is obtained by either method.
Let
X be the random variable defined by the outcome of the cast of the die. Thus the p.d.f. of
X is
,
.
In terms of the observed value
x , the function is as follows