Skip to content

Michael Qiu's Blog

Inference for Means

gitmichaelqiu/blog

Michael Qiu's Blog

gitmichaelqiu/blog

Portfolio
Home
Home
- About
- Some Websites
My Projects
My Projects
- macOSer Series
  macOSer Series
- LexiGen
  LexiGen
- Other Projects
  Other Projects
  - Discord Spam Spoiler
  - Other Projects
Algorithms
Algorithms
- Number Theory
  Number Theory
  - GCD & LCM
  - Euler Sieve
- Search
  Search
  - A*
- Shortest Path
  Shortest Path
  - Bellman Ford
  - Dijkstra
  - Floyd
  - SPFA
- Sort
  Sort
- Quick Pow
- Data Structures
  Data Structures
  - Bignum
  - Binary Heap
  - Disjoint Set Union
  - Sparse Table
  - Stack
  - Tree
    Tree
    
    Binary Tree Traversals
    
    Segment Tree
- OI Tips
  OI Tips
Academic Notes
Academic Notes
- A-Level
  A-Level
  - Physics
    Physics
    
    AS Physics
    
    Copyright Claim
  - Chemistry
    Chemistry
    
    Bonding & Structures
    
    Isomerism
    
    Organic Compounds
  - Advanced Math
    Advanced Math
    
    Primes
    
    Polynomials
- AP
  AP
  - Calculus
    Calculus
    
    Limit
    
    Differentiation
    
    Fundamental Calculus
  - Microeconomics
    Microeconomics
    
    Basic Economic Concept
    
    Demand and Supply
    
    Production, Cost and Competition
  - Statistics
    Statistics
    
    Summaries
    Summaries
    
    Hypothesis Test & Confidence Interval
    
    Unit 1: Exploring One-Variable Data
    Unit 1: Exploring One-Variable Data
    
    1. Summary Statistics
    
    2. Graphical Representations
    
    3. The Normal Distribution
    
    Unit 2: Exploring Two-Variable Data
    Unit 2: Exploring Two-Variable Data
    
    1. Tables & Graphs
    
    2. Scatterplots & Regression
    
    3. Experimental Design
    
    Unit 3: Collecting Data
    Unit 3: Collecting Data
    
    1. Sampling Methods & Bias
    
    2. Experimental Design
    
    Unit 4: Probability, Random Variables, and Probability Distributions
    Unit 4: Probability, Random Variables, and Probability Distributions
    
    1. Discrete Random Variables
    
    2. Binomial & Geometric Distributions
    
    Unit 5: Sampling Distributions
    Unit 5: Sampling Distributions
    
    1. Sampling Distributions
    
    Unit 6: Inference for Categorical Data: Proportions
    Unit 6: Inference for Categorical Data: Proportions
    
    1. Introduction to Inferences
    
    2. Inference for Proportions
    
    3. Errors in Hypothesis Tests
    
    Unit 7: Inference for Quantitative Data: Means
    Unit 7: Inference for Quantitative Data: Means
    
    1. Inference for Means 1. Inference for Means
    On this page
    
    t-Distribution
    
    What Is t-Distribution
    
    When Is t-Distribution Used
    
    Hypothesis Tests for Population Means
    
    One-Sample t-test for Mean
    
    Conditions for One-Sample t-test
    
    Calculate t-value
    
    Calculate dof
    
    For Differences in Population
    
    t-scores VS z-scores
    
    Paired t-test
    
    Calculate t-value
    
    Unit 8: Inference for Categorical Data: Chi-Square
    Unit 8: Inference for Categorical Data: Chi-Square
    
    1. Goodness of Fit
    
    2. Independence and Homogeneity
    
    Unit 9: Inference for Quantitative Data: Slopes
    Unit 9: Inference for Quantitative Data: Slopes
    
    1. Inference for Regression Slopes
    
    Copyright Claim
- 体制内
  体制内
  - 初中地理
  - 学考生物
    学考生物
    
    细胞的分子组成
    
    细胞的结构
  - 学考政治
    学考政治
    
    必修一
    
    必修二
  - 学考中国古代史
  - 学考化学物质俗名
  - 学考抛物运动
  - 学考集合
Games Reviews
Games Reviews
- Alan Wake 2
- Red Dead Redemption 2

Inference for Means¶

t-Distribution¶

What Is t-Distribution¶

A continuous probability distribution similar to the normal distribution

The tails are thicker \(\implies\) more chances of getting extreme values

Degrees of freedom, dof
- \(\uparrow\) dof \(\implies\) peak sharper and tails thinner \(\implies\) closer to normal distribution
\(\mu = 0\)
\(\sigma > 1\), closer to 1 as dof increases

When Is t-Distribution Used¶

\(\sigma\) is unknown and population is approximately normally distributed
\(n < 30\)
t-distribution can be used to
1. Perform hypothesis tests for \(\mu\)
2. Form confidence intervals for \(\mu\)

Hypothesis Tests for Population Means¶

One-Sample t-test for Mean¶

Test whether the population mean of a normally distributed population has changed

\(\sigma\) is unknown

Conditions for One-Sample t-test¶

If the population is very skewed, a t-test can only be done when \(n \geq 30\)

Calculate t-value¶

\(t = \dfrac{\overline{x} - \mu}{standard\ error}\)
\(standard\ error = \dfrac{s}{\sqrt{n}}\)

Calculate dof¶

\(dof = n-1\), if there are multiple \(n\), choose the smallest one.

For Differences in Population¶

\(standard\ error = \sqrt{\dfrac{s_A^2}{n_A} + \dfrac{s_B^2}{n_B}}\)

t-scores VS z-scores¶

graph LR;

H(Start);
I{Normally distributed?};
H --> I;
I -->|Yes| G;
I -->|No| F;
G{Population variance known?};
G -->|Yes| B(z-score);
G -->|No| C{n < 30?};
C -->|Yes| D(t-score);
C -->|No| B;

F{n ≥ 30?} -->|"Yes (CLT)"| B;
F -->|No| J(Non-parametric tests);

Paired t-test¶

Test whether or not the population means of two pieces of data that are linked are equal by examining the differences between paired data

The data for a two-sample t-test is from two independent populations
The data for a paired t-test is linked and come from one population
Use \(d\) for the difference of two measures. For instance, \(\mu_d\)

Calculate t-value¶

\(t = \dfrac{\overline{x_d} - \mu_d}{\frac{s_d}{\sqrt{n}}}\)