The earth is flat (p > 0.05): significance thresholds and the crisis of unreplicable research

RT @vamrhein: @lewis_halsey @RSocPublishing Thanks, important paper! Of course all those alternatives could be misused, in the same way as…
“putting more emphasis on interpreting effect sizes and interval estimates, using non-automated informed judgment” https://t.co/xFTr70QHx5
2416 days ago
RT @vamrhein: @lewis_halsey @RSocPublishing Thanks, important paper! Of course all those alternatives could be misused, in the same way as…
RT @vamrhein: @lewis_halsey @RSocPublishing Thanks, important paper! Of course all those alternatives could be misused, in the same way as…
RT @vamrhein: @lewis_halsey @RSocPublishing Thanks, important paper! Of course all those alternatives could be misused, in the same way as…
@lewis_halsey @RSocPublishing Thanks, important paper! Of course all those alternatives could be misused, in the same way as P-values, for making "yes" or "no" decisions from single studies. Also model selection based on delta AIC thresholds will lead to inflated effect sizes. https://t.co/PvNvK2WOu2
2466 days ago
https://t.co/JFSIM9pkO1
2467 days ago
RT @vamrhein: The null hypothesis cannot be 'confirmed' nor 'strengthened' using a p-value, because very likely there are many better hypot…
RT @vamrhein: The null hypothesis cannot be 'confirmed' nor 'strengthened' using a p-value, because very likely there are many better hypot…
RT @laurentbusca: "Surely, God loves the .06 nearly as much as the .05” (Rosnow & Rosenthal, 1989). La littérature sur les tests statistiqu…
2474 days ago
RT @GUINALIU: Una excelente revisión del tema científico de moda: el uso del p-value. https://t.co/phLedjMjPY
Una excelente revisión del tema científico de moda: el uso del p-value. https://t.co/phLedjMjPY
RT @laurentbusca: "Surely, God loves the .06 nearly as much as the .05” (Rosnow & Rosenthal, 1989). La littérature sur les tests statistiqu…
RT @laurentbusca: "Surely, God loves the .06 nearly as much as the .05” (Rosnow & Rosenthal, 1989). La littérature sur les tests statistiqu…
"Surely, God loves the .06 nearly as much as the .05” (Rosnow & Rosenthal, 1989). La littérature sur les tests statistiques d'hypothèses est parfois merveilleuse, contrairement aux idées reçues. Lisez : https://t.co/HUlOaC1uEG
First, consider reading https://t.co/VqDMi6T2UE and then consider subscribe the comment. Researchers are capable to more profound discussion about science rather than p-value based binary decisions! https://t.co/LOemTrhzj6
Interesting paper on p-values and replication that is worth 30 minutes of your time https://t.co/Cey35VX5c3
RT @vamrhein: Selective reporting was encouraged since Fisher (1937): "it is usual and convenient for experimenters to take 5 per cent as a…
RT @vamrhein: Selective reporting was encouraged since Fisher (1937): "it is usual and convenient for experimenters to take 5 per cent as a…
Selective reporting was encouraged since Fisher (1937): "it is usual and convenient for experimenters to take 5 per cent as a standard level of significance, in the sense that they are prepared to ignore all results which fail to reach this standard." https://t.co/bWu0iag4pw
Enjoyed very much discussing @vamrhein's "The earth is flat (p > 0.05)" in our @IZWberlin journal group!
@PieterHog @robustgar @lakens @NeuroStats @learnfromerror The Neyman–Pearson decision procedure was particularly suitable for industrial quality control, or "sampling tests laid down in commercial specifications" (Neyman & Pearson 1933). https://t.co/FVD5GmX5Eo
2545 days ago
RT @vamrhein: A significance test often does not make a clear statement about an effect, but instead it "examines if the sample size is lar…
RT @vamrhein: A significance test often does not make a clear statement about an effect, but instead it "examines if the sample size is lar…
RT @vamrhein: A significance test often does not make a clear statement about an effect, but instead it "examines if the sample size is lar…