Between-subject design and within-subject anlyses

Question

I have a between-subject design in which there are two groups (A, B) and two conditions (x, y). I ran an ANOVA and found no interaction whatsoever and no significant differences in the two conditions.
The reviewer is asking me to consider running two different analyses testing for x vs. y, but one analysis only on group A and one analysis on group B.
The reviewer believes that in B the comparison between x vs. y might be significant, but we have not enough power to capture this in a between-subject design.
I believe that this analysis cannot be done - unless you get a group*condition interaction - but is this true? Or is it possible to run within-subject analyses in between-subject designs?

BruceET · Answer

This is a crude exploration of my concern about false discovery:
Below I 'deconstruct' a 2-factor ANOVA into t tests, looking A vs B, x vs y, and interaction and finding nothing significant at the 5% level. Then looking at x vs y for B alone, I find
a significant result.
set.seed(2020)
ax = rnorm(12,100, 15)
ay = rnorm(12,101, 15)
bx = rnorm(12,102, 15)
by = rnorm(12, 98, 15)

t.test(c(ax,by), c(ay,bx))$p.val # "int"
[1] 0.05200111
t.test(c(ax,bx), c(ay,by))$p.val # x vs y
[1] 0.3837236
t.test(c(ax,ay), c(bx,by))$p.val # a vs b
[1] 0.8644972
t.test(bx,by)$p.val              # x vs y for b
[1] 0.006613897                  # **

So maybe the reviewer has a valid point.
But wait, there's more. Change the seed, but keep exactly the
same parameters--as if someone wants to replicate your work.
Same tests, but finding nothing anywhere.
set.seed(720)
ax = rnorm(12,100, 15)
ay = rnorm(12,101, 15)
bx = rnorm(12,102, 15)
by = rnorm(12, 98, 15)
t.test(c(ax,by), c(ay,bx))$p.val # "int"
[1] 0.1583043
t.test(c(ax,bx), c(ay,by))$p.val # x vs y
[1] 0.5041701
t.test(c(ax,ay), c(bx,by))$p.val # a vs b
[1] 0.668125
t.test(bx,by)$p.val              # x vs y for b
[1] 0.1737395

Maybe your reviewer is inviting you to a "P-hacking" party.
Addendum: With 12 observations from each distribution, the power of
a Welch t test to distinguish between $mathsf{Norm}(102,15)$
and $mathsf{Norm}(98,15)$ is less than 10%.
set.seed(1234)
pv = replicate(10^5, t.test(rnorm(12,102,15),rnorm(12,98,15))$p.val)
mean(pv <= .05)
[1] 0.09417

Between-subject design and within-subject anlyses

One Answer

Add your own answers!

Ask a Question