Inferential statistics are descriptive statistics
- Published
- Accepted
- Subject Areas
- Science and Medical Education, Science Policy, Statistics, Data Science
- Keywords
- NHST, Significance, Publication bias, Replicability, P-value, Descriptive statistics, Inferential statistics, Uncertainty, Reproducibility, Hypothesis testing
- Copyright
- © 2018 Amrhein et al.
- Licence
- This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Preprints) and either DOI or URL of the article must be cited.
- Cite this article
- 2018. Inferential statistics are descriptive statistics. PeerJ Preprints 6:e26857v2 https://doi.org/10.7287/peerj.preprints.26857v2
Abstract
There has been much discussion of a "replication crisis" related to statistical inference, which has largely been attributed to overemphasis on and abuse of hypothesis testing. Much of the abuse stems from failure to recognize that statistical tests not only test hypotheses, but countless assumptions and the entire environment in which research takes place. Honestly reported results must vary from replication to replication because of varying assumption violations and random variation; excessive agreement itself would suggest deeper problems, such as failure to publish results in conflict with group expectations or desires. Considerable non-replication is thus to be expected even with the best reporting practices, and generalizations from single studies are rarely if ever warranted. Because of all the uncertain and unknown assumptions that underpin statistical inferences, we should treat inferential statistics as highly unstable local descriptions of relations between assumptions and data, rather than as generalizable inferences about hypotheses or models. And that means we should treat statistical results as being much more incomplete and uncertain than is currently the norm. Rather than focusing our study reports on uncertain conclusions, we should thus focus on describing accurately how the study was conducted, what data resulted, what analysis methods were used and why, and what problems occurred.
Author Comment
This is a revision for 'The American Statistician.'