Learning Objectives

By the end of this chapter, you will be able to:

  • Understand when to use paired t-test
  • Calculate the difference scores and their statistics
  • Perform paired t-test step by step
  • Interpret results in context
  • Distinguish between independent and paired samples

When to Use Paired t-Test

Use paired t-test when:

  1. Same subjects measured twice (before-after)
  2. Matched pairs (twins, siblings, matched characteristics)
  3. Related samples (husband-wife, left-right measurements)
flowchart TD
    A[Two groups to compare]
    B{Are observations<br/>paired/matched?}
    C[Independent t-test]
    D[Paired t-test]

    A --> B
    B -->|No| C
    B -->|Yes| D

Examples of Paired Data

Type Example
Before-After Weight before and after diet
Pre-Post Test scores before and after training
Matched Pairs Twin study comparing treatments
Same Subject Left eye vs right eye measurements
Related Samples Husband and wife income comparison

The Paired t-Test Procedure

Step 1: Calculate Differences

For each pair: $d_i = x_{1i} - x_{2i}$

Step 2: Calculate Statistics of Differences

Mean of differences: \(\bar{d} = \frac{\sum d_i}{n}\)

Standard deviation of differences: \(s_d = \sqrt{\frac{\sum(d_i - \bar{d})^2}{n-1}}\)

Step 3: Calculate Test Statistic

\[t = \frac{\bar{d} - 0}{s_d/\sqrt{n}}\]

Degrees of freedom: df = n - 1 (where n = number of pairs)


Step-by-Step Example 1: Before-After Training

Problem: A training program was conducted for 8 employees. Their performance scores (out of 100) are:

Employee Before After
1 65 72
2 70 78
3 55 60
4 80 85
5 68 75
6 72 80
7 58 65
8 75 82

Test at α = 0.05 if training improved performance.

Solution:

Step 1: State hypotheses

  • $H_0: \mu_d = 0$ (no improvement)
  • $H_1: \mu_d > 0$ (improvement - after is higher)

Let d = After - Before (so positive d means improvement)

Step 2: Calculate differences

Employee Before After d = After - Before
1 65 72 7 49
2 70 78 8 64
3 55 60 5 25
4 80 85 5 25
5 68 75 7 49
6 72 80 8 64
7 58 65 7 49
8 75 82 7 49
Sum     54 374

Step 3: Calculate mean and SD of differences

\[\bar{d} = \frac{54}{8} = 6.75\] \[s_d = \sqrt{\frac{374 - \frac{(54)^2}{8}}{8-1}} = \sqrt{\frac{374 - 364.5}{7}} = \sqrt{\frac{9.5}{7}} = \sqrt{1.357} = 1.165\]

Step 4: Calculate test statistic

\[t = \frac{6.75 - 0}{1.165/\sqrt{8}} = \frac{6.75}{0.412} = 16.38\]

Step 5: Find critical value

  • df = 8 - 1 = 7
  • Right-tailed, α = 0.05
  • From t-table: t* = 1.895

Step 6: Decision

  • t = 16.38 > 1.895
  • Reject H₀

Step 7: Conclusion At the 0.05 level of significance, there is strong evidence that the training program significantly improved employee performance scores.


Step-by-Step Example 2: Policy Impact Assessment

Problem: Blood pressure readings for 10 patients before and after a new medication:

Patient Before After d = Before - After
1 142 138 4
2 155 145 10
3 140 142 -2
4 148 140 8
5 165 155 10
6 150 148 2
7 138 130 8
8 160 152 8
9 145 140 5
10 158 150 8

Test at α = 0.01 if medication reduces blood pressure.

Solution:

Step 1: State hypotheses

  • $H_0: \mu_d = 0$ (no reduction)
  • $H_1: \mu_d > 0$ (reduction - before is higher)

Let d = Before - After (positive d means reduction)

Step 2: Calculate statistics

  • $\sum d = 61$
  • $\sum d^2 = 521$
  • n = 10
\[\bar{d} = \frac{61}{10} = 6.1\] \[s_d = \sqrt{\frac{521 - \frac{(61)^2}{10}}{9}} = \sqrt{\frac{521 - 372.1}{9}} = \sqrt{\frac{148.9}{9}} = \sqrt{16.544} = 4.07\]

Step 3: Calculate test statistic

\[t = \frac{6.1 - 0}{4.07/\sqrt{10}} = \frac{6.1}{1.287} = 4.74\]

Step 4: Find critical value

  • df = 10 - 1 = 9
  • Right-tailed, α = 0.01
  • From t-table: t* = 2.821

Step 5: Decision

  • t = 4.74 > 2.821
  • Reject H₀

Step 6: Conclusion At the 0.01 level of significance, there is strong evidence that the medication significantly reduces blood pressure.


Step-by-Step Example 3: Two-Tailed Paired Test

Problem: Reaction times (milliseconds) of 6 subjects using right and left hands:

Subject Right Left
1 350 380
2 320 340
3 400 390
4 280 310
5 360 350
6 310 325

Test at α = 0.05 if there is a difference.

Solution:

Step 1: State hypotheses

  • $H_0: \mu_d = 0$ (no difference)
  • $H_1: \mu_d \neq 0$ (two-tailed)

Step 2: Calculate differences (d = Right - Left)

Subject d
1 -30 900
2 -20 400
3 10 100
4 -30 900
5 10 100
6 -15 225
Sum -75 2625

Step 3: Calculate statistics \(\bar{d} = \frac{-75}{6} = -12.5\)

\[s_d = \sqrt{\frac{2625 - \frac{(-75)^2}{6}}{5}} = \sqrt{\frac{2625 - 937.5}{5}} = \sqrt{337.5} = 18.37\]

Step 4: Calculate test statistic \(t = \frac{-12.5 - 0}{18.37/\sqrt{6}} = \frac{-12.5}{7.50} = -1.67\)

Step 5: Find critical value

  • df = 6 - 1 = 5
  • Two-tailed, α = 0.05
  • From t-table: t* = ±2.571

Step 6: Decision

  • |t| = 1.67 < 2.571
  • Fail to Reject H₀

Step 7: Conclusion At the 0.05 level of significance, there is insufficient evidence to conclude that reaction times differ between right and left hands.


Confidence Interval for Mean Difference

\[\bar{d} \pm t^* \times \frac{s_d}{\sqrt{n}}\]

Example 4: 95% CI for Mean Improvement

Using Example 1 data:

  • $\bar{d}$ = 6.75
  • $s_d$ = 1.165
  • n = 8
  • df = 7, t* = 2.365 (two-tailed, α = 0.05)

\(95\% \text{ CI} = 6.75 \pm 2.365 \times \frac{1.165}{\sqrt{8}}\) \(= 6.75 \pm 2.365 \times 0.412 = 6.75 \pm 0.97\) \(= (5.78, 7.72)\)

Interpretation: We are 95% confident that the mean improvement in scores after training is between 5.78 and 7.72 points.


Decision Flow: Independent vs Paired

flowchart TD
    A[Comparing two means]
    B{Same subjects<br/>measured twice?}
    C{Subjects<br/>matched?}
    D[Paired t-test]
    E[Independent t-test]

    A --> B
    B -->|Yes| D
    B -->|No| C
    C -->|Yes| D
    C -->|No| E

Why Not Use Independent t-Test for Paired Data?

Aspect Paired t-test Independent t-test
Controls for individual differences Yes No
df n - 1 n₁ + n₂ - 2
Power Higher (removes variation) Lower
When to use Related samples Unrelated samples

Using independent t-test on paired data:

  • Ignores the pairing
  • Loses statistical power
  • May miss real differences

Summary Table

Component Formula
Difference $d_i = x_{1i} - x_{2i}$
Mean difference $\bar{d} = \frac{\sum d_i}{n}$
SD of differences $s_d = \sqrt{\frac{\sum(d_i - \bar{d})^2}{n-1}}$
Test statistic $t = \frac{\bar{d}}{s_d/\sqrt{n}}$
Degrees of freedom df = n - 1
Confidence interval $\bar{d} \pm t^* \times \frac{s_d}{\sqrt{n}}$

Practice Problems

Problem 1

A weight loss program measured 10 participants:

Person Before After
1 85 80
2 92 88
3 78 76
4 95 90
5 88 82
6 76 74
7 82 78
8 90 85
9 84 80
10 88 84

Test at α = 0.05 if the program reduced weight.

Problem 2

Calculate a 95% confidence interval for the mean weight loss in Problem 1.

Problem 3

Job satisfaction scores (1-10) before and after policy change:

Employee Before After
1 6 8
2 5 6
3 7 7
4 4 6
5 6 7
6 5 8

Test at α = 0.05 if satisfaction improved.

Problem 4

Test scores of students taught by two different methods (same students, both methods):

Student Method A Method B
1 75 78
2 82 85
3 68 70
4 90 88
5 85 86

Test at α = 0.05 if there is a significant difference.


Summary

  1. Paired t-test is used for dependent/matched samples
  2. Calculate differences first, then test if mean difference ≠ 0
  3. df = n - 1 (number of pairs minus 1)
  4. More powerful than independent t-test when pairing is relevant
  5. Common applications: before-after studies, matched designs

Next Topic

In the next chapter, we will study the Chi-Square Test for testing categorical data - association between variables and goodness of fit.