Anastasi, A., & Urbina, S. (1997). Psychological testing (7th ed.). London: Prentice-Hall International.

Basu, D. (1981). On ancillary statistics, pivotal quantities, and confidence statements. In Y. P. Chaubey & T. D. Dwivedi (Eds.), Topics in applied statistics (pp. 1–29). Montreal: Concordia University.

Berger, J. O. (2006). Bayes factors. In S. Kotz, N. Balakrishnan, C. Read, B. Vidakovic, & N. L. Johnson (Eds.), Encyclopedia of statistical sciences (Second edition., Vol. 1, pp. 378–386). Hoboken, New Jersey: John Wiley & Sons.

Berger, J. O., & Wolpert, R. L. (1988). The likelihood principle (2nd ed.). Hayward, CA: Institute of Mathematical Statistics.

Blaker, H., & Spjøtvoll, E. (2000). Paradoxes and improvements in interval estimation. The American Statistician, 54(4), 242–247.

Bolstad, W. (2007). Introduction to Bayesian statistics. Hoboken, NJ: Wiley.

Bonett, D. G., & Price, R. M. (2002). Statistical inference for a linear function of medians: Confidence intervals, hypothesis testing, and sample size requirements. Psychological Methods, 7, 370–383.

Brown, L. (1967). The conditional level of Student’s \(t\) test. The Annals of Mathematical Statistics, 38(4), 1068–1071.

Buehler, R. J. (1959). Some validity criteria for statistical inferences. The Annals of Mathematical Statistics, 30(4), 845–863.

Buehler, R. J., & Feddersen, A. P. (1963). Note on a conditional property of Student’s \(t^1\). The Annals of Mathematical Statistics, 34(3), 1098–1100.

Casella, G. (1992). Conditional inference from confidence sets. Lecture Notes-Monograph Series, 17, 1–12.

Casella, G., & Berger, R. L. (2002). Statistical inference. Pacific Grove, CA: Duxbury.

Cronbach, L. J. (1990). Essentials of psychological testing (5th ed.). New York: Harper & Row.

Cumming, G. (2014). The new statistics: Why and how. Psychological Science, 25, 7–29.

Cumming, G., & Fidler, F. (2009). Confidence intervals: Better answers to better questions. Zeitschrift für Psychologie, 217, 15–26.

Cumming, G., & Finch, S. (2001). A primer on the understanding, use, and calculation of confidence intervals that are based on central and noncentral distributions. Educational and Psychological Measurement, 61, 532–574.

Cumming, G., & Finch, S. (2005). Inference by eye: Confidence intervals and how to read pictures of data. American Psychologist, 60(2), 170–180.

Cumming, S. P., Sherar, L. B., Gammon, C., Standage, M., & Malina, R. M. (2012). Physical Activity and Physical Self-Concept in Adolescence: A Comparison of Girls at the Extremes of the Biological Maturation Continuum. Journal of Research on Adolescence, 22(4), 746–757.

Dienes, Z. (2011). Bayesian versus orthodox statistics: Which side are you on? Perspectives on Psychological Science, 6, 274–290.

Dufour, J.-M. (1997). Some impossibility theorems in econometrics with applications to structural and dynamic models. Econometrica, 65(6), 1365–1387.

Fidler, F., & Loftus, G. R. (2009). Why figures with error bars should replace \(p\) values: Some conceptual arguments and empirical demonstrations. Zeitschrift fūr Psychologie, 217(1), 27–37.

Fidler, F., & Thompson, B. (2001). Computing correct confidence intervals for ANOVA fixed- and random-effects effect sizes. Educational and Psychological Measurement, 61, 575–604.

Finch, W. H., & French, B. F. (2012). A Comparison of Methods for Estimating Confidence Intervals for Omega-Squared Effect Size. Educational and Psychological Measurement, 72(1), 68–77.

Fisher, R. A. (1935). The fiducial argument in statistical inference. Annals of eugenics, 6, 391–398.

Fisher, R. A. (1955). Statistical methods and scientific induction. Journal of the Royal Statistical Society. Series B (Methodological), 17, 69–78.

Fisher, R. A. (1959). Statistical Methods and Scientific Inference (Second.). Edinburgh, UK: Oliver; Boyd.

Gelman, A. (2008). Rejoinder. Bayesian analysis, 3, 467–478.

Gelman, A. (2011, August). Why it doesn’t make sense in general to form confidence intervals by inverting hypothesis tests. [blog post]. Retrieved from

Gelman, A., Carlin, J. B., Stern, H. S., & Rubin, D. B. (2004). Bayesian data analysis (2nd edition). London: Chapman; Hall.

Gilroy, K. E., & Pearce, J. M. (2014). The Role of Local, Distal, and Global Information in Latent Spatial Learning. Journal of Experimental Psychology, 40(2), 212–224.

Hamerman, E. J., & Morewedge, C. K. (2015). Reliance on Luck: Identifying Which Achievement Goals Elicit Superstitious Behavior. Personality and Social Psychology Bulletin, 41(3), 323–335.

Hoekstra, R., Finch, S., Kiers, H. A. L., & Johnson, A. (2006). Probability as certainty: Dichotomous thinking and the misuse of \(p\) values. Psychonomic Bulletin & Review, 13, 1033–1037.

Hoekstra, R., Morey, R. D., Rouder, J. N., & Wagenmakers, E.-J. (2014). Robust misinterpretation of confidence intervals. Psychonomic Bulletin & Review, 21(5), 1157–1164.

Hollingdale, J., & Greitemeyer, T. (2014). The Effect of Online Violent Video Games on Levels of Aggression. PLoS ONE, 9(11), e111790. Retrieved from

Howson, C., & Urbach, P. (2006). Scientific reasoning: The Bayesian approach. La Salle, Illinois: Open Court.

Jackman, S. (2009). Bayesian analysis for the social sciences. Chichester, United Kingdom: John Wiley & Sons.

Jaynes, E. (2003). Probability theory: The logic of science. Cambridge, UK: Cambridge University Press.

Jeffreys, H. (1961). Theory of probability (3rd edition). New York: Oxford University Press.

Kelley, K. (2007a). Confidence intervals for standardized effect sizes: Theory, application, and implementation. Journal of Statistical Software, 20(8).

Kelley, K. (2007b). Methods for the behavioral, educational, and social sciences: An R package. Behavioral Research Methods, 39(4), 979–984.

Kruschke, J. K. (2010). What to believe: Bayesian methods for data analysis. Trends in Cognitive Sciences, 14(7), 293–300.

Lahiri, D. K., Maloney, B., Rogers, J. T., & Ge, Y.-W. (2013). PuF, an antimetastatic and developmental signaling protein, interacts with the Alzheimer’s amyloid-beta precursor protein via a tissue-specific proximal regulatory element (PRE). Bmc Genomics, 14, 68.

Lee, M. D., & Wagenmakers, E.-J. (2013). Bayesian modeling for cognitive science: A practical course. Cambridge University Press.

Lehmann, E. H. (1959). Testing statistical hypotheses. New York: John Wiley & Sons.

Lindley, D. V. (1965). Introduction to probability and statistics from a Bayesian point of view, part 2: Inference. Cambridge, England: Cambridge University Press.

Lindley, D. V. (1985). Making decisions (2nd ed.). London: Wiley.

Loftus, G. R. (1993). A picture is worth a thousand \(p\)-values: On the irrelevance of hypothesis testing in the computer age. Behavior Research Methods, Instrumentation and Computers, 25, 250–256.

Loftus, G. R. (1996). Psychology will be a much better science when we change the way we analyze data. Current directions in psychological science, 5, 161–171.

Lynch, S. M. (2007). Introduction to applied Bayesian statistics and estimation for social scientists. New York: Springer.

Masson, M. E. J., & Loftus, G. R. (2003). Using confidence intervals for graphically based data interpretation. Canadian Journal of Experimental Psychology, 57, 203–220.

Mayo, D. G. (1981). In defense of the Neyman-Pearson theory of confidence intervals. Philosophy of Science, 48(2), 269–280.

Mayo, D. G. (1982). On after-trial criticisms of Neyman-Pearson theory of statistics. PSA: Proceedings of the Biennial Meeting of the Philosophy of Science Association, 1982, 145–158.

Mayo, D. G., & Cox, D. R. (2006). Frequentist statistics as a theory of inductive inference. Institute of Mathematical Statistics Lecture Notes - Monograph Series, 49, 77–97.

Mayo, D. G., & Spanos, A. (2006). Severe testing as a basic concept in a Neyman-Pearson philosophy of induction. British Journal for the philosophy of science, 57, 323–357.

McGrayne, S. B. (2011). The theory that would not die. New Haven: Yale University Press.

Morey, R. D., Rouder, J. N., Verhagen, J., & Wagenmakers, E.-J. (2014). Why hypothesis tests are essential for psychological science: A comment on Cumming. Psychological Science, 1289–1290.

Neyman, J. (1934). On the two different aspects of the representative method: The method of stratified sampling and the method of purposive selection. Journal of the Royal Statistical Society, 97(4), 558–625.

Neyman, J. (1937). Outline of a theory of statistical estimation based on the classical theory of probability. Philosophical Transactions of the Royal Society of London. Series A, Mathematical and Physical Sciences, 236, 333–380.

Neyman, J. (1941). Fiducial argument and the theory of confidence intervals. Biometrika, 32(2), 128–150.

Neyman, J. (1952). Lectures and conferences on mathematical statistics and probability. Washington, D.C.: Graduate School, U.S. Department of Agriculture.

Neyman, J. (1977). Frequentist probability and frequentist statistics. Synthese, 36(1), 97–131.

Ntzoufras, I. (2009). Bayesian Modeling Using WinBUGS. Hoboken, New Jersey: Wiley.

Olive, D. J. (2008). Applied robust statistics. online electronic book. Retrieved from

Pratt, J. W. (1961). Book review: Testing Statistical Hypotheses, by E. L. Lehmann. Journal of the American Statistical Association, 56(293), pp. 163–167.

Pratt, J. W., Raiffa, H., & Schlaifer, R. (1995). Introduction to statistical decision theory. Cambridge, MA: MIT Press.

Psychonomics Society. (2012). Psychonomic Society guidelines on statistical issues. Retrieved from

Reichenbach, H. (1949). The theory of probability. Berkeley, University of California Press.

Robinson, G. K. (1979). Conditional properties of statistical procedures. The Annals of Statistics, 7(4), 742–755. Retrieved from

Rouder, J. N., Morey, R. D., Speckman, P. L., & Province, J. M. (2012). Default Bayes factors for ANOVA designs. Journal of Mathematical Psychology, 56, 356–374. Retrieved from

Rouder, J. N., Speckman, P. L., Sun, D., Morey, R. D., & Iverson, G. (2009). Bayesian \(t\)-tests for accepting and rejecting the null hypothesis. Psychonomic Bulletin and Review, 16, 225–237. Retrieved from

Rusu, F., & Dobra, A. (2008). Sketches for size of join estimation. ACM Transactions on Database Systems, 33, 15:1–15:46.

Spanos, A. (2011). Revisiting the Welch uniform model: A case for conditional inference? Advances and Applications in Statistical Science, 5, 33–52.

Steiger, J. H. (2004). Beyond the \(F\) test: Effect size confidence intervals and tests of close fit in the analysis of variance and contrast analysis. Psychological Methods, 9(2), 164–182.

Steiger, J. H., & Fouladi, R. T. (1997). Noncentrality interval estimation and the evaluation of statistical models. In L. Harlow, S. Mulaik, & J. Steiger (Eds.), What if there were no significance tests? (pp. 221–257). Mahwah, New Jersey: Erlbaum.

Stock, J. H., & Wright, J. H. (2000). GMM with weak identification. Econometrica, 68(5), 1055–1096.

Todd, T. P., Vurbic, D., & Bouton, M. E. (2014). Mechanisms of Renewal After the Extinction of Discriminated Operant Behavior. Journal of Experimental Psychology, 40(3), 355–368.

Velicer, W. F., Cumming, G., Fava, J. L., Rossi, J. S., Prochaska, J. O., & Johnson, J. (2008). Theory testing using quantitative predictions of effect size. Applied Psychology, 57(4), 589–608. Retrieved from

Venn, J. (1888). The logic of chance (third edition.). London: Macmillan. Retrieved from

Wagenmakers, E.-J., Lee, M. D., Lodewyckx, T., & Iverson, G. (2008). Bayesian versus frequentist inference. In H. Hoijtink, I. Klugkist, & P. Boelen (Eds.), Practical Bayesian approaches to testing behavioral and social science hypotheses (pp. 181–207). New York: Springer.

Wagenmakers, E.-J., Verhagen, J., Ly, A., Bakker, M., Lee, D., M. D. Matzke, Rouder, J. N., & Morey, R. D. (2014). A power fallacy. Behavioral Research Methods.

Wasserman, L. (2008). Comment on article by Gelman. Bayesian Analysis, 3, 463–466.

Welch, B. L. (1939). On confidence limits and sufficiency, with particular reference to parameters of location. The Annals of Mathematical Statistics, 10(1), 58–69.

Wetzels, R., Grasman, R. P., & Wagenmakers, E.-J. (2012). A default Bayesian hypothesis test for ANOVA designs. American Statistician, 66, 104–111.

Wilkinson, L., & Task Force on Statistical Inference. (1999). Statistical methods in psychology journals: Guidelines and explanations. American Psychologist, 54, 594–604.

Winter, C., Van Acker, F., Bonduelle, M., Desmyttere, S., De Schrijver, F., & Nekkebroeck, J. (2014). Cognitive and psychomotor development of 5-to 6-year-old singletons born after PGD: A prospective case-controlled matched study. Human Reproduction, 29(9), 1968–1977.

Woods, C. M. (2007). Confidence intervals for gamma-family measures of ordinal association. Psychological Methods, 12(2), 185–204.

Young, K. D., & Lewis, R. J. (1997). What is confidence? Part 1: The use and interpretation of confidence intervals. Annals of Emergency Medicine, 30(3), 307–310.

Zou, G. Y. (2007). Toward using confidence intervals to compare correlations. Psychological Methods, 12(4), 399–413.