Two-sample comparison of proportions power calculation
n = 1486
p1 = 0.4975
p2 = 0.5025
sig.level = 0.05
power = 0.05855136
alternative = two.sided
NOTE: n is number in *each* group
Defense Against the Dark Arts
University of Oxford
If your only tool is a hammer, then every problem looks like a nail. – Anonymous
If philosophy is outlawed, only outlaws will do philosophy. – Andrew Gelman
Two researchers carried out independent studies to answer the same research question. The first reports an effect estimate and standard error of 25 ± 10. The second reports 10 ± 10.
True or False. If false, explain.
Inductive logic studies risky arguments. A risky argument can be a very good one, and yet its conclusion can be false, even when all the premises are true. Most of our arguments are risky.
Either an exceptionally rare chance has ocurred, or the theory [\(H_0\)] is not true. – Fisher
Warning
Alternatively: assumptions used to compute p are wrong and nothing rare happened!
If \(H_0\) is true, then this result (statistical significance) would probably not occur. This result has occurred. Then \(H_0\) is probably not true.
A logically equivalent argument:
If a person is an American [\(H_0\) is true], then he is probably not a member of Congress. (TRUE, RIGHT?) This person is a member of Congress. Therefore, he is probably not an American [\(H_0\) is probably not true].
A spectacular vindication of the principle that each individual coin spun individually (he spins one) is as likely to come down heads as tails and therefore it should cause no surprise each individual time it does.
– Rosencrantz & Guildenstern are Dead
P(HHHHHHHH)= P(TTTTHHTH) = P(HTHTHTHT) = 1/256
Inferring from an unlikely result to the presence of a significant effect presupposes that the observed result is much more likely under an implicitly conceived alternative than under the null. – Sprenger (2016)
One in a hundred women has breast cancer. If you have breast cancer (\(H_1\)), there is a 95% chance that you will test positive; if you do not have breast cancer \((H_0)\), there is a 2% chance that you will nonetheless test positive. We know nothing about Alice other than the fact that she tested positive. How likely is it that she has breast cancer?
Classical statistics is directed towards the use of sample information … in making inferences about \(\theta\). These classical inferences are for the most part without regard to the use to which they are put. In decision theory, on the other hand, an attempt is made to combine the sample information with other relevant aspects of the problem in order to make the best decision.
In the field of pure research no assessment of the cost of wrong conclusions … can conceivably be more than a pretence, and in any case such an assessment would be in admissible and irrelevant in judging the state of the scientific evidence
In other words: Fisher thinks Neyman is missing the point of science. It’s not generally about solving decision problems, even if such problems do genuinely arise in some areas.
Suppose that \(\widehat{\theta} \sim \text{N}(\theta, \text{SE})\) and define the shorthand \[ T \equiv \frac{\widehat{\theta} - \theta_0}{\text{SE}}, \quad \kappa \equiv (\theta - \theta_0) / \text{SE}, \quad c_p \equiv \texttt{qnorm}(1 - p) \] One-sided test of \(H_0\colon \theta = \theta_0\) versus \(H_0\colon \theta > \theta_0\) \[ \mathbb{P}(T>c_{\alpha}) = \mathbb{P}(Z + \kappa > c_\alpha) = 1 - \texttt{pnorm}(c_\alpha - \kappa) \] Two-sided test of \(H_0\colon \theta = \theta_0\) versus \(H_1 \colon \theta \neq \theta_0\) \[ \mathbb{P}(|T|>c_{\alpha/2}) = \mathbb{P}(|Z+\kappa|>c_{\alpha/2}) = \texttt{pnorm}(-c_{\alpha/2} - \kappa) + 1 - \texttt{pnorm}(c_{\alpha/2}-\kappa) \]
\(X_1, \dots, X_n \sim \text{N}(\mu, \sigma^2)\); estimate \(\mu\) using \(\bar{X}_n\)
power.t.test()
does both for t-tests.Kanazawa (2007): Beautiful people have more daughters
There is a large literature on variation in the sex ratio of human births, and the effects that have been found have been on the order of 1 percentage point (for example, the probability of a girl birth shifting from 48.5 percent to 49.5 percent). Variation attributable to factors such as race, parental age, birth order, maternal weight, partnership status and season of birth is estimated at from less than 0.3 percentage points to about 2 percentage points, with larger changes (as high as 3 percentage points) arising under economic conditions of poverty and famine.
If your test has low power and you reject the null: