8.2 A single population mean using the student t distribution

Business statistics -- bsta 20 Page 1 / 22

In practice, we rarely know the population standard deviation . In the past, when the sample size was large, this did not present a problem to statisticians. They used the sample standard deviation s as an estimate for σ and proceeded as before to calculate a confidence interval with close enough results. However, statisticians ran into problems when the sample size was small. A small sample size caused inaccuracies in the confidence interval.

William S. Goset (1876–1937) of the Guinness brewery in Dublin, Ireland ran into this problem. His experiments with hops and barley produced very few samples. Just replacing σ with s did not produce accurate results when he tried to calculate a confidence interval. He realized that he could not use a normal distribution for the calculation; he found that the actual distribution depends on the sample size. This problem led him to "discover" what is called the Student's t-distribution . The name comes from the fact that Gosset wrote under the pen name "Student."

Up until the mid-1970s, some statisticians used the normal distribution approximation for large sample sizes and only used the Student's t-distribution only for sample sizes of at most 30.

If you draw a simple random sample of size n from a population with mean μ and unknown population standard deviation σ and calculate the t -score t = $\frac{\bar{x} - μ}{(\frac{s}{\sqrt{n}})}$ , then the t -scores follow a Student's t-distribution with n – 1 degrees of freedom . The t -score has the same interpretation as the z -score . It measures how far in standard deviation units $\bar{x}$ is from its mean μ . For each sample size n , there is a different Student's t-distribution.

The degrees of freedom , n – 1 , come from the calculation of the sample standard deviation s . Remember when we first calculated a sample standard deviation we divided the sum of the squared deviations by n − 1, but we used n deviations $(x - \bar{x} values)$ to calculate s . Because the sum of the deviations is zero, we can find the last deviation once we know the other n – 1 deviations. The other n – 1 deviations can change or vary freely. We call the number n – 1 the degrees of freedom (df) in recognition that one is lost in the calculations. The effect of losing a degree of freedom is that the t-value increases and the confidence interval increases in width.

Properties of the student's t-distribution

The graph for the Student's t-distribution is similar to the standard normal curve and at infinite degrees of freedom it is the normal distribution. You can confirm this by reading the bottom line at infinite degrees of freedom for a familiar level of confidence, e.g. at column 0.05, 95% level of confidence, we find the t-value of 1.96 at infinite degrees of freedom.
The mean for the Student's t-distribution is zero and the distribution is symmetric about zero, again like the standard normal distribution.
The Student's t-distribution has more probability in its tails than the standard normal distribution because the spread of the t-distribution is greater than the spread of the standard normal. So the graph of the Student's t-distribution will be thicker in the tails and shorter in the center than the graph of the standard normal distribution.
The exact shape of the Student's t-distribution depends on the degrees of freedom. As the degrees of freedom increases, the graph of Student's t-distribution becomes more like the graph of the standard normal distribution.
The underlying population of individual observations is assumed to be normally distributed with unknown population mean μ and unknown population standard deviation σ . This assumption comes from the Central Limit theorem because the individual observations in this case are the $\bar{x}$ s of the sampling distribution. The size of the underlying population is generally not relevant unless it is very small. If it is normal then the assumption is met and doesn't need discussion. Random sampling is assumed, but that is a completely separate assumption from normality.

<< Chapter < Page Page > Chapter >>

Practice FlashCards 13 Key Terms 4

Read also:

Get Jobilize Job Search Mobile App in your pocket Now!

100% Free Mobile Applications
Receive real-time job alerts and never miss the right job again

Source: OpenStax, Business statistics -- bsta 200 -- humber college -- version 2016reva -- draft 2016-04-04. OpenStax CNX. Apr 05, 2016 Download for free at http://legacy.cnx.org/content/col11969/1.5

Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Business statistics -- bsta 200 -- humber college -- version 2016reva -- draft 2016-04-04' conversation and receive update notifications?

Ask

	4 BOD Hemolymphatic -Dr. Han By Brooke Delaney Start Exam
	25 AP 25 Urinary System MCQ By OpenStax Start Quiz
	Fundamentals of electrical engineering i By OpenStax Read Online Course
	Thermal-Fluid Systems MCQ By Steve Gibbs Start Quiz
	2 AP 02 Chemical Level of Organization MCQ By OpenStax Start Quiz
©flickr:	Biology 1 By Jill Zerressen Start Quiz
	NCE Ch 09 Research and Program Evaluation By Anh Dao Start Quiz
	26 Biology 26 Seed Plants MCQ By OpenStax Start Quiz
	9 Neuroanatomy 09 The Auditory System By Stephen Voron Start Quiz
	Anthropology Biology Culture By Richley Crapo Start Assignment