Hypothesis test and confidence interval
Procedures for Hypothesis Test
- Define context
- H0:parameter= ...
- Ha:parameter condition ...
- parameter is the ...
- α= ...
- Verify inference conditions
- Find p-value
- Conclusion
- p condition α
- H0 should/should not be rejected
- The data provides sufficient evidence that ...
Procedures for Confidence Interval
- Verify confidence interval conditions
- Find confidence interval
- CI=statistic±(critical value)×(standard error of statistic)
- We can be C% confident that the interval from lower limit to upper limit captures the actual value of the parameter
Population Proportions
One-sample z-test.
Conditions:
- Independence condition
- Random sampling/assignment
- If sampling without replacement, n<0.1N
- Approximately normally distributed
- {np0n(1−p0)≥10≥10
Test statistic:
- z=np0(1−p0)p^−p0
- standard error = np0(1−p0)
Differences in Population Proportions
Two-sample z-test .
Conditions:
- Independence condition
- Random sampling/assignment
- If sampling without replacement, n<0.1N
- Approximately normally distributed
- Combined proportion/pooled proportion p^c=n1+n2X1+X2, X=np^
- ⎩⎨⎧n1p^cn1(1−p^c)n2p^cn2(1−p^c)≥10≥10≥10≥10
Test statistic:
- z=n1p^1(1−p^1)+n2p^2(1−p^2)(p^1−p^2)−0
- standard error = n1p^1(1−p^1)+n2p^2(1−p^2)
Population Means
One-sample t-test.
Conditions:
- Independence condition
- Random sampling/assignment
- If sampling without replacement, n<0.1N
- Approximately normally distributed
- Approximately symmetric
- No outliers
- If very skewed, n≥30
Test statistic:
- t=nsx−μ
- standard error = ns
Differences in Population Means
Two-sample t-test.
Conditions:
- Independence condition
- Random sampling/assignment
- If sampling without replacement, n<0.1N
- Approximately normally distributed
- Approximately symmetric
- No outliers
- If very skewed, n≥30
Test statistic:
- t=n1s12+n2s22x1−x2
- standard error = n1s12+n2s22
Differences in Matched Pairs
One-sample t-test.
Conditions:
- Two measures come from the same items within the population
- Independence condition
- Random sampling/assignment
- If sampling without replacement, n<0.1N
- Approximately normally distributed
- Approximately symmetric
- No outliers
- If very skewed, n≥30
Test statistic:
- t=nsdxd−μd
- standard error = nsd
Goodness of Fit
χ2-test.
Conditions:
- Independence condition
- Random sampling
- If sampling without replacement, n<0.1N
- Large counts condition
Test statistic:
- χ2=∑expected(observed−expected)2
Independence
One-sample χ2-test.
Conditions:
- Independence condition
- Random sampling
- If sampling without replacement, n<0.1N
- Large counts condition
- Each expected value ≥ 5
- or ≥ 80\% expected values > 5 and all are ≥ 1
Test statistic:
- χ2=∑expected(observed−expected)2
- Expected value = Total number in sampleRow total×Column total
- dof=(nrow−1)(ncol−1)
Homogeneity
Multi-sample χ2-test.
Conditions:
- Independence condition
- Random sampling
- If sampling without replacement, n<0.1N
- Large counts condition
Test statistic:
- χ2=∑expected(observed−expected)2
- dof=(nrow−1)(ncol−1)
Regression Line
One-sample t-test.
Conditions:
- Relationship between x and y is linear
- σy cannot vary with x
- Independence condition
- Random sampling
- If sampling without replacement, n<0.1N
- For a given value of x, y-values follow an approximate normal distribution
- If n<30, y-values distribution have no strong skew and no outliers
Test statistic:
- t=sbb−β
- standard error sb=sxn−1s
- s=n−2∑(yi−yi^)2
- sx=n−1∑(xi−x)2
- t-distribution with dof=n−2
| Predictor |
Coef |
SE Coef |
T |
P |
| Constant |
a |
|
|
|
| x-variable |
b |
sb |
|
|