Discrete distribution (Page 2/2)

Introduction to statistics Page 2 / 2

Roll a four-sided die twice and let X equal the larger of the two outcomes if there are different and the common value if they are the same. The sample space for this experiment is $S = [(d_{1}, d_{2}) : d_{1} = 1,2,3,4; d_{2} = 1,2,3,4]$ , where each of this 16 points has probability 1/16. Then $P (X = 1) = P [(1,1)] = 1 / 16$ , $P (X = 2) = P [(1,2), (2,1), (2,2)] = 3 / 16$ , and similarly $P (X = 3) = 5 / 16$ and $P (X = 4) = 7 / 16$ . That is, the p. d.f. of X can be written simply as $f (x) = P (X = x) = \frac{2 x - 1}{16}, x = 1,2,3,4.$

We could add that $f (x) = 0$ elsewhere; but if we do not, one should take f(x) to equal zero when $x \notin R$ .

Got questions? Get instant answers now!

A better understanding of a particular probability distribution can often be obtained with a graph that depicts the p.d.f. of X .

the graph of the p.d.f. when

f (x) > 0

, would be simply the set of points {

[x, f (x)] : x \in R

}, where R is the space of X .

Two types of graphs can be used to give a better visual appreciation of the p.d.f., namely, a bar graph and a probability histogram . A bar graph of the p.d.f. f(x) of the random variable X is a graph having a vertical line segment drawn from $(x,0)$ to $[x, f (x)]$ at each x in R , the space of X . If X can only assume integer values, a probability histogram of the p.d.f. f(x) is a graphical representation that has a rectangle of height f(x) and a base of length 1, centered at x , for each $x \in R$ , the space of X .

CUMULATIVE DISTRIBUTION FUNCTION

Let define the function F(x) by

F (x) = P (X \leq x) = \sum_{t \in A} f (t) .

The function F(x) is called the distribution function (sometimes cumulative distribution function ) of the discrete-type random variable X .

Several properties of a distribution function F(x) can be listed as a consequence of the fact that probability must be a value between 0 and 1, inclusive:

$0 \leq F (x) \leq 1$ because F(x) is a probability,
F(x) is a nondecreasing function of x ,
$F (y) = 1$ , where y is any value greater than or equal to the largest value in R ; and $F (z) = 0$ , where z is any value less than the smallest value in R ;
If X is a random variable of the discrete type, then F(x) is a step function, and the height at a step at x , $x \in R$ , equals the probability $P (X = x)$ .

It is clear that the probability distribution associated with the random variable X can be described by either the distribution function F(x) or by the probability density function f(x) . The function used is a matter of convenience; in most instances, f(x) is easier to use than F(x) .

Graphical representation of the relationship between p.d.f. and c.d.f.

Area under p.d.f. curve to a equal to a value of c.d.f. curve at a point a .

MATHEMATICAL EXPECTATION

If f(x) is the p.d.f. of the random variable X of the discrete type with space R and if the summation

\sum_{R} u (x) f (x) = \sum_{x \in R} u (x) f (x)

exists, then the sum is called the mathematical expectation or the expected value of the function u(X) , and it is denoted by

E [u (X)]

. That is,

E [u (X)] = \sum_{R} u (x) f (x) .

We can think of the expected value

E [u (X)]

as a weighted mean of u(x) ,

x \in R

, where the weights are the probabilities

f (x) = P (X = x)

The usual definition of the mathematical expectation of u(X) requires that the sum converges absolutely; that is,

\sum_{x \in R} | u (x) | f (x)

exists.

There is another important observation that must be made about consistency of this definition. Certainly, this function u(X) of the random variable X is itself a random variable, say Y . Suppose that we find the p.d.f. of Y to be g(y) on the support $R_{1}$ . Then E(Y) is given by the summation $\sum_{y \in R_{1}} y g (y)$

In general it is true that $\sum_{R} u (x) f (x) = \sum_{y \in R_{1}} y g (y);$ that is, the same expectation is obtained by either method.

Let X be the random variable defined by the outcome of the cast of the die. Thus the p.d.f. of X is

$f (x) = \frac{1}{6}$ , $x = 1,2,3,4,5,6$ .

In terms of the observed value x , the function is as follows

$u (x) = {\begin{cases} 1, x = 1,2,3, \\ 5, x = 4,5, \\ 35, x = 6. \end{cases}$

The mathematical expectation is equal to

\sum_{x = 1}^{6} u (x) f (x) = 1 (\frac{1}{6}) + 1 (\frac{1}{6}) + 1 (\frac{1}{6}) + 5 (\frac{1}{6}) + 5 (\frac{1}{6}) + 35 (\frac{1}{6}) = 1 (\frac{3}{6}) + 5 (\frac{2}{6}) + 35 (\frac{1}{6}) = 8.

Got questions? Get instant answers now!

Let the random variable X have the p.d.f. $f (x) = \frac{1}{3}$ , $x \in R$ , where R ={-1,0,1}. Let $u (X) = X^{2}$ . Then

\sum_{x \in R} x^{2} f (x) = {(- 1)}^{2} (\frac{1}{3}) + {(0)}^{2} (\frac{1}{3}) + {(1)}^{2} (\frac{1}{3}) = \frac{2}{3} .

However, the support of random variable $Y = X^{2}$ is $R_{1} = (0,1)$ and

$\begin{array}{l} P (Y = 0) = P (X = 0) = \frac{1}{3} \\ P (Y = 1) = P (X = - 1) + P (X = 1) = \frac{1}{3} + \frac{1}{3} = \frac{2}{3} . \end{array}$

That is, $g (y) = {\begin{cases} \frac{1}{3}, y = 0, \\ \frac{2}{3}, y = 1; \end{cases}$ and $R_{1} = (0,1)$ . Hence

$\sum_{y \in R_{1}} y g (y) = 0 (\frac{1}{3}) + 1 (\frac{2}{3}) = \frac{2}{3},$ , which illustrates the preceding observation.

Got questions? Get instant answers now!

When it exists, mathematical expectation E satisfies the following properties:

If c is a constant, E ( c )= c ,
If c is a constant and u is a function, $E [c u (X)] = c E [u (X)]$ ,
If $c_{1}$ and $c_{2}$ are constants and $u_{1}$ and $u_{2}$ are functions, then $E [c_{1} u_{1} (X) + c_{2} u_{2} (X)] = c_{1} E [u_{1} (X)] + c_{2} E [u_{2} (X)]$

First, we have for the proof of (1) that

$E (c) = \sum_{R} c f (x) = c \sum_{R} f (x) = c$

because $\sum_{R} f (x) = 1.$

Next, to prove (2), we see that

$E [c u (X)] = \sum_{R} c u (x) f (x) = c \sum_{R} u (x) f (x) = c E [u (X)] .$

Finally, the proof of (3) is given by

$E [c_{1} u_{1} (X) + c_{2} u_{2} (X)] = \sum_{R} [c_{1} u_{1} (x) + c_{2} u_{2} (x)] f (x) = \sum_{R} c_{1} u_{1} (x) f (x) + \sum_{R} c_{2} u_{2} (x) f (x) .$

By applying (2), we obtain

$E [c_{1} u_{1} (X) + c_{2} u_{2} (X)] = c_{1} E [u_{1} (x)] + c_{2} E [u_{2} (x)] .$

Property (3) can be extended to more than two terms by mathematical induction; That is, we have

3'. $E [\sum_{i = 1}^{k} c_{i} u_{i} (X)] = \sum_{i = 1}^{k} c_{i} E [u_{i} (X)] .$

Because of property (3’), mathematical expectation E is called a linear or distributive operator .

Let X have the p.d.f. $f (x) = \frac{x}{10}$ , x =1,2,3,4.

then

$\begin{array}{l} E (X) = \sum_{x = 1}^{4} x (\frac{x}{10}) = 1 (\frac{1}{10}) + 2 (\frac{2}{10}) + 3 (\frac{3}{10}) + 4 (\frac{4}{10}) = 3 \\ E (X^{2}) = \sum_{x = 1}^{4} x^{2} (\frac{x}{10}) = 1^{2} (\frac{1}{10}) + 2^{2} (\frac{2}{10}) + 3^{2} (\frac{3}{10}) + 4^{2} (\frac{4}{10}) = 10, \end{array}$

and

$E [X (5 - X)] = 5 E (X) - E (X^{2}) = (5) (3) - 10 = 5.$

Got questions? Get instant answers now!

the MEAN, VARIANCE, and STANDARD DEVIATION

<< Chapter < Page Page > Chapter >>

Read also:

Get Jobilize Job Search Mobile App in your pocket Now!

100% Free Mobile Applications
Receive real-time job alerts and never miss the right job again

Source: OpenStax, Introduction to statistics. OpenStax CNX. Oct 09, 2007 Download for free at http://cnx.org/content/col10343/1.3

Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Introduction to statistics' conversation and receive update notifications?

Ask

	2 Muscular System MCQ By Nick Swain Start Quiz
	20 Sociology 20 Population Urbanization Environment By OpenStax Start Quiz
	Vocabulary Week 1-3 By Rachel Woolard Start Quiz
	3 AP 03 Cellular Level of Organization MCQ By OpenStax Start Quiz
	1 Lec:1 Descriptive Epidemiology By Janet Forrester Start Quiz
	27 Biology 27 Animal Diversity MCQ By OpenStax Start Quiz
	1 Gastrointestinal Pathophysiology By Laurence Bailen Start Exam
	7 Sociology 07 Deviance, Crime, Social Control MCQ By OpenStax Start Quiz
©flickr: U.S.	Molecular Cellular Biology By Ann Schlosser Start Quiz
	16 Sociology 16 Education MCQ By OpenStax Start Quiz