How to Run a Paired Samples T-Test in SPSS and Interpret Results

The paired samples t-test compares the means of two related measurements from the same group of participants. It is the standard test for pre-test/post-test designs, before-and-after studies, and matched-pair experiments. If you measured the same people twice under different conditions, this is the test you need.

When to Use the Paired Samples T-Test

Use this test when:

You have one group of participants measured at two time points (pre and post)
You are comparing two related conditions measured on the same participants (e.g., test performance under noise vs. silence)
You have matched pairs where each participant in one group is matched with a participant in the other group
The dependent variable is continuous (interval or ratio scale)

Example research questions:

Did the training program significantly improve employee performance scores?
Is there a significant difference in anxiety levels before and after the intervention?
Do students score differently on math versus reading assessments?

Assumptions

Before running the test, verify these assumptions:

1. Continuous Dependent Variable

The dependent variable must be measured on an interval or ratio scale (e.g., test scores, time, weight, rating scales treated as continuous).

2. Related Pairs

Each observation in one condition must be meaningfully paired with an observation in the other condition. Typically, this means the same participant is measured twice.

3. No Significant Outliers in the Difference Scores

Calculate the difference between paired observations. There should be no extreme outliers in these difference scores.

How to check: Create the difference variable (Transform → Compute Variable), then use Analyze → Explore to generate a boxplot. Cases beyond the whiskers are potential outliers.

4. Approximate Normality of the Difference Scores

The distribution of the difference scores should be approximately normal. This is especially important for small samples (n < 30).

How to check: Run the Shapiro-Wilk test on the difference scores (Analyze → Explore → Plots → Normality plots with tests). If p > .05, the normality assumption is met.

For samples larger than 30, the paired samples t-test is robust to mild violations of normality due to the Central Limit Theorem.

Step-by-Step in SPSS

Step 1: Set Up Your Data

Your data should have one row per participant and two columns — one for each measurement. For example:

Participant	Pre_Score	Post_Score
1	42	58
2	55	61
3	38	49
4	47	52

Step 2: Run the Test

Go to Analyze → Compare Means → Paired-Samples T Test
Select your two variables and move them into the Paired Variables box
- Variable 1: Pre-test measurement
- Variable 2: Post-test measurement
Click OK

Step 3: Optional — Enable Effect Size

In newer versions of SPSS (Version 27+):

Click Options before running the test
Check Effect size (Cohen's d)
Click Continue, then OK

Interpreting the Output

SPSS produces three tables. Here is what each one tells you.

Table 1: Paired Samples Statistics

This table shows the descriptive statistics for each variable:

Mean: The average score for each time point
N: The number of valid pairs
Std. Deviation: The standard deviation for each variable
Std. Error Mean: The standard error of the mean

Look at the means to see the direction of change. If the post-test mean is higher than the pre-test mean, scores increased on average.

Table 2: Paired Samples Correlations

This table shows the Pearson correlation between the two measurements.

A positive correlation means participants who scored high at pre-test tended to score high at post-test as well
The correlation should typically be positive and significant in a pre-test/post-test design, confirming that the measurements are indeed related

If the correlation is very low or negative, consider whether the paired design is appropriate.

Table 3: Paired Samples Test

This is the most important table. It tests whether the mean difference between the paired measurements is significantly different from zero.

Key values:

Mean (of the difference): The average difference between paired observations. If Pre was subtracted from Post, a positive mean indicates an increase
Std. Deviation: The standard deviation of the difference scores
Std. Error Mean: The standard error of the mean difference
95% Confidence Interval of the Difference: If this interval does not contain zero, the difference is significant at the .05 level
t: The t-statistic
df: Degrees of freedom = N − 1 (number of pairs minus 1)
Sig. (2-tailed): The p-value

Decision rule:

If Sig. (2-tailed) < .05, there is a statistically significant difference between the two measurements
If Sig. (2-tailed) > .05, the difference is not statistically significant

Calculating Effect Size

Effect size tells you how large the difference is in practical terms. The most common measure for paired samples t-tests is Cohen's d.

If SPSS Reports It

In SPSS Version 27 and later, Cohen's d appears in the output if you enabled it in Options.

Manual Calculation

If your version of SPSS does not report it, calculate it yourself:

Cohen's d = Mean difference ÷ Standard deviation of the difference scores

Both values are available in the Paired Samples Test table.

Interpretation:

d = 0.2: Small effect
d = 0.5: Medium effect
d = 0.8: Large effect

APA Reporting

Format

A paired samples t-test was conducted to compare [DV] before and after [intervention/condition]. There was a significant difference in scores between pre-test (M = [mean], SD = [SD]) and post-test (M = [mean], SD = [SD]); t(df) = [t-value], p = [p-value], d = [Cohen's d].

Example

A paired samples t-test was conducted to compare anxiety scores before and after the mindfulness intervention. There was a significant reduction in anxiety from pre-intervention (M = 42.30, SD = 8.15) to post-intervention (M = 35.60, SD = 7.42); t(29) = 4.87, p < .001, d = 0.89, indicating a large effect.

When Results Are Not Significant

A paired samples t-test indicated no significant difference in job satisfaction between the first assessment (M = 3.85, SD = 0.92) and the second assessment (M = 3.91, SD = 0.88); t(44) = 0.72, p = .476, d = 0.11.

One-Tailed vs. Two-Tailed Tests

By default, SPSS reports a two-tailed p-value. If you have a directional hypothesis (e.g., "scores will increase after the intervention"), you can use a one-tailed test:

One-tailed p-value = Two-tailed p-value ÷ 2

Only use a one-tailed test when your hypothesis was directional before you collected the data and this is stated in your proposal.

What to Do When Assumptions Are Violated

Outliers in Difference Scores

If you find extreme outliers:

Verify they are not data entry errors
Run the analysis with and without the outliers
If results differ substantially, report both and discuss the impact
Consider using a non-parametric alternative

Non-Normal Difference Scores

If the Shapiro-Wilk test is significant (p < .05), indicating non-normality:

Use the Wilcoxon Signed-Rank Test — the non-parametric equivalent of the paired samples t-test.

Go to Analyze → Nonparametric Tests → Legacy Dialogs → 2 Related Samples
Move your paired variables into the Test Pairs box
Check Wilcoxon
Click OK

Common Mistakes to Avoid

Using an independent samples t-test instead — If the same participants are measured twice, you must use the paired test. An independent test treats the observations as unrelated and produces incorrect results
Ignoring the direction of the difference — The sign of the mean difference depends on which variable SPSS subtracted from which. Always check the Paired Samples Statistics table to confirm the direction
Not checking assumptions — Especially normality of difference scores in small samples. Violations may require a non-parametric alternative
Reporting the wrong means — Report the means for each time point from Table 1, not the mean difference from Table 3
Forgetting effect size — Statistical significance tells you whether the difference is real, but effect size tells you whether it matters. Always report Cohen's d

Paired Samples T-Test vs. Other Tests

Scenario	Correct Test
Two related measurements, continuous DV, normal differences	Paired samples t-test
Two related measurements, ordinal DV or non-normal differences	Wilcoxon signed-rank test
Two independent groups, continuous DV	Independent samples t-test
Three or more related measurements	Repeated measures ANOVA
Two related categorical measurements	McNemar's test

Need help running a paired samples t-test or any pre-test/post-test analysis for your thesis? Our team handles the full analysis from assumption checking through APA reporting. Get a free quote.