Four-Step Process
- STATE
- PLAN
- Check conditions
- DO
- CONCLUDE
Unit 6 through 9 covers inference, which is the practice of analyzing statistics to infer properties of a population.
- Point estimates are single statistics based on sample data to estimate a parameter
| Name | Parameter | Point Estimate |
|---|---|---|
| Variance | ||
| Standard Deviation | ||
| Mean | ||
| Size | ||
| Proportion (categorical) |
- The standard deviation of a statistic is called the standard error
- The -score of a test statistic is calculated as:
- Standard deviation is calculated with ,
CONFIDENCE INTERVALS
- Confidence intervals are intervals expected to contain the true population parameter
- refers to the area outside of the confidence interval (critical level () is inv. CDF of )
- Margin of error refers to the portion inside the brackets
- Recall from Chapter 1 that
Written Answers
Respond to confidence interval questions in the following format:
We are [x]% confident that the true [parameter] of [context] is between [upper bound] and [lower bound].
Conditions for creating a confidence interval are that:
- the sample must be random
- the sample must be <10% of the population
- the sample must contain >10 p and q
Example β Create a Confidence Interval
A random sample of 200 taxi drivers found that 23% responded βYESβ to a monthly income of greater than $5000.
We will create a confidence interval of 98%.
Conditions:
- The sample is random
- 200 is likely less than 10% of all taxi drivers
- More than 10 answered βYESβ and βNOβ
Creating the interval:
- therefore the area at each tail = 0.01
InvNormCD(area=0.99, Ο=1, ΞΌ=0) >>> 2.326- Bounds (0.161, 0.299)
Answering in context: βWe are 98% confident that the true proportion of taxi drivers who have a monthly income greater than $5500 is between 16.1% and 29.9%.β
Example β Determine What Sample Size is Needed
The Delta School District wants to find the proportion of teachers who smoke by constructing a 95% CI with a margin of error of 2%.
We are determining sample size.
To find :
InvNormCD(area=0.975, Ο=1, ΞΌ=0) >>> 1.960Applying the formula for margin of error:
can be substituted with 0.50 since it is unknown:
Solving for sample size :
To make the MOE smaller:
- reduce the percent confidence to lower the confidence level
- obtain a sample with less variability
- obtain a larger sample
Null Hypothesis
- The hypothesis is always stated in terms of (not )
- Each test is conducted with the assumption of the null hypothesis ()
- has no effect on
- The tested statement that is contrary to the null is the alternate hypothesis ()
- The -value is the probability of the result occurring assuming the null is true
- fails to be rejected if
- is accepted if
Errors
- Type I errors () occur when is rejected but is actually true
- Type II errors () occur when fails to be rejected but is actually false
- Power () is the probability that a test will reject the null when it is false
- and are inversely related
| Increasing | P(Type I) | P(Type II) | Power |
|---|---|---|---|
| (sample size) | |||
| (observed difference) |
Choosing Inference Procesures
Qualitative Data
- homogeneity: multiple samples
- independence: one sample; one variable
- goodness-of-fit: one sample; two variables
Quantitative Data
Proportions:
- 1 proportion : one sample
- 2 proportion : two samples
Means:
- 1 sample : one sample
- 2 sample : two samples
- Matched pair : two grouped samples
Slopes:
- Linear regression