Search | VHL Regional Portal

1.

Subjective evidence evaluation survey for many-analysts studies.

Sarafoglou, Alexandra; Hoogeveen, Suzanne; van den Bergh, Don; Aczel, Balazs; Albers, Casper J; Althoff, Tim; Botvinik-Nezer, Rotem; Busch, Niko A; Cataldo, Andrea M; Devezer, Berna; van Dongen, Noah N N; Dreber, Anna; Fried, Eiko I; Hoekstra, Rink; Hoffman, Sabine; Holzmeister, Felix; Huber, Jürgen; Huntington-Klein, Nick; Ioannidis, John; Johannesson, Magnus; Kirchler, Michael; Loken, Eric; Mangin, Jan-Francois; Matzke, Dora; Menkveld, Albert J; Nilsonne, Gustav; van Ravenzwaaij, Don; Schweinsberg, Martin; Schulz-Kuempel, Hannah; Shanks, David R; Simons, Daniel J; Spellman, Barbara A; Stoevenbelt, Andrea H; Szaszi, Barnabas; Trübutschek, Darinka; Tuerlinckx, Francis; Uhlmann, Eric L; Vanpaemel, Wolf; Wicherts, Jelte; Wagenmakers, Eric-Jan.

R Soc Open Sci ; 11(7): 240125, 2024 Jul.

Article in English | MEDLINE | ID: mdl-39050728

ABSTRACT

Many-analysts studies explore how well an empirical claim withstands plausible alternative analyses of the same dataset by multiple, independent analysis teams. Conclusions from these studies typically rely on a single outcome metric (e.g. effect size) provided by each analysis team. Although informative about the range of plausible effects in a dataset, a single effect size from each team does not provide a complete, nuanced understanding of how analysis choices are related to the outcome. We used the Delphi consensus technique with input from 37 experts to develop an 18-item subjective evidence evaluation survey (SEES) to evaluate how each analysis team views the methodological appropriateness of the research design and the strength of evidence for the hypothesis. We illustrate the usefulness of the SEES in providing richer evidence assessment with pilot data from a previous many-analysts study.

2.

Testing Conditional Independence in Psychometric Networks: An Analysis of Three Bayesian Methods.

Sekulovski, Nikola; Keetelaar, Sara; Huth, Karoline; Wagenmakers, Eric-Jan; van Bork, Riet; van den Bergh, Don; Marsman, Maarten.

Multivariate Behav Res ; : 1-21, 2024 May 11.

Article in English | MEDLINE | ID: mdl-38733319

ABSTRACT

Network psychometrics uses graphical models to assess the network structure of psychological variables. An important task in their analysis is determining which variables are unrelated in the network, i.e., are independent given the rest of the network variables. This conditional independence structure is a gateway to understanding the causal structure underlying psychological processes. Thus, it is crucial to have an appropriate method for evaluating conditional independence and dependence hypotheses. Bayesian approaches to testing such hypotheses allow researchers to differentiate between absence of evidence and evidence of absence of connections (edges) between pairs of variables in a network. Three Bayesian approaches to assessing conditional independence have been proposed in the network psychometrics literature. We believe that their theoretical foundations are not widely known, and therefore we provide a conceptual review of the proposed methods and highlight their strengths and limitations through a simulation study. We also illustrate the methods using an empirical example with data on Dark Triad Personality. Finally, we provide recommendations on how to choose the optimal method and discuss the current gaps in the literature on this important topic.

3.

Power priors for replication studies.

Pawel, Samuel; Aust, Frederik; Held, Leonhard; Wagenmakers, Eric-Jan.

Test (Madr) ; 33(1): 127-154, 2024.

Article in English | MEDLINE | ID: mdl-38585622

ABSTRACT

The ongoing replication crisis in science has increased interest in the methodology of replication studies. We propose a novel Bayesian analysis approach using power priors: The likelihood of the original study's data is raised to the power of α, and then used as the prior distribution in the analysis of the replication data. Posterior distribution and Bayes factor hypothesis tests related to the power parameter α quantify the degree of compatibility between the original and replication study. Inferences for other parameters, such as effect sizes, dynamically borrow information from the original study. The degree of borrowing depends on the conflict between the two studies. The practical value of the approach is illustrated on data from three replication studies, and the connection to hierarchical modeling approaches explored. We generalize the known connection between normal power priors and normal hierarchical models for fixed parameters and show that normal power prior inferences with a beta prior on the power parameter α align with normal hierarchical model inferences using a generalized beta prior on the relative heterogeneity variance I2. The connection illustrates that power prior modeling is unnatural from the perspective of hierarchical modeling since it corresponds to specifying priors on a relative rather than an absolute heterogeneity scale.

4.

Footprint of publication selection bias on meta-analyses in medicine, environmental sciences, psychology, and economics.

Bartos, Frantisek; Maier, Maximilian; Wagenmakers, Eric-Jan; Nippold, Franziska; Doucouliagos, Hristos; Ioannidis, John P A; Otte, Willem M; Sladekova, Martina; Deresssa, Teshome K; Bruns, Stephan B; Fanelli, Daniele; Stanley, T D.

Res Synth Methods ; 15(3): 500-511, 2024 May.

Article in English | MEDLINE | ID: mdl-38327122

ABSTRACT

Publication selection bias undermines the systematic accumulation of evidence. To assess the extent of this problem, we survey over 68,000 meta-analyses containing over 700,000 effect size estimates from medicine (67,386/597,699), environmental sciences (199/12,707), psychology (605/23,563), and economics (327/91,421). Our results indicate that meta-analyses in economics are the most severely contaminated by publication selection bias, closely followed by meta-analyses in environmental sciences and psychology, whereas meta-analyses in medicine are contaminated the least. After adjusting for publication selection bias, the median probability of the presence of an effect decreased from 99.9% to 29.7% in economics, from 98.9% to 55.7% in psychology, from 99.8% to 70.7% in environmental sciences, and from 38.0% to 29.7% in medicine. The median absolute effect sizes (in terms of standardized mean differences) decreased from d = 0.20 to d = 0.07 in economics, from d = 0.37 to d = 0.26 in psychology, from d = 0.62 to d = 0.43 in environmental sciences, and from d = 0.24 to d = 0.13 in medicine.

Subject(s)

Economics , Meta-Analysis as Topic , Psychology , Publication Bias , Humans , Ecology , Research Design , Selection Bias , Probability , Medicine

5.

Interim design analysis using Bayes factor forecasts.

Stefan, Angelika M; Gronau, Quentin F; Wagenmakers, Eric-Jan.

Psychol Methods ; 2024 Feb 08.

Article in English | MEDLINE | ID: mdl-38330340

ABSTRACT

A fundamental part of experimental design is to determine the sample size of a study. However, sparse information about population parameters and effect sizes before data collection renders effective sample size planning challenging. Specifically, sparse information may lead research designs to be based on inaccurate a priori assumptions, causing studies to use resources inefficiently or to produce inconclusive results. Despite its deleterious impact on sample size planning, many prominent methods for experimental design fail to adequately address the challenge of sparse a-priori information. Here we propose a Bayesian Monte Carlo methodology for interim design analyses that allows researchers to analyze and adapt their sampling plans throughout the course of a study. At any point in time, the methodology uses the best available knowledge about parameters to make projections about expected evidence trajectories. Two simulated application examples demonstrate how interim design analyses can be integrated into common designs to inform sampling plans on the fly. The proposed methodology addresses the problem of sample size planning with sparse a-priori information and yields research designs that are efficient, informative, and flexible. (PsycInfo Database Record (c) 2024 APA, all rights reserved).

6.

Exploring open science practices in behavioural public policy research.

Maier, Maximilian; Bartos, Frantisek; Raihani, Nichola; Shanks, David R; Stanley, T D; Wagenmakers, Eric-Jan; Harris, Adam J L.

R Soc Open Sci ; 11(2): 231486, 2024 Feb.

Article in English | MEDLINE | ID: mdl-38384774

ABSTRACT

In their book 'Nudge: Improving Decisions About Health, Wealth and Happiness', Thaler & Sunstein (2009) argue that choice architectures are promising public policy interventions. This research programme motivated the creation of 'nudge units', government agencies which aim to apply insights from behavioural science to improve public policy. We closely examine a meta-analysis of the evidence gathered by two of the largest and most influential nudge units (DellaVigna & Linos (2022 Econometrica 90, 81-116 (doi:10.3982/ECTA18709))) and use statistical techniques to detect reporting biases. Our analysis shows evidence suggestive of selective reporting. We additionally evaluate the public pre-analysis plans from one of the two nudge units (Office of Evaluation Sciences). We identify several instances of excellent practice; however, we also find that the analysis plans and reporting often lack sufficient detail to evaluate (unintentional) reporting biases. We highlight several improvements that would enhance the effectiveness of the pre-analysis plans and reports as a means to combat reporting biases. Our findings and suggestions can further improve the evidence base for policy decisions.

7.

"This behavior strikes us as ideal": assessment and anticipations of Huisman (2022).

Sarafoglou, Alexandra; Bartos, Frantisek; Stefan, Angelika; Haaf, Julia M; Wagenmakers, Eric-Jan.

Psychon Bull Rev ; 31(1): 242-248, 2024 Feb.

Article in English | MEDLINE | ID: mdl-37542014

ABSTRACT

Huisman (Psychonomic Bulletin & Review, 1-10. 2022) argued that a valid measure of evidence should indicate more support in favor of a true alternative hypothesis when sample size is large than when it is small. Bayes factors may violate this pattern and hence Huisman concluded that Bayes factors are invalid as a measure of evidence. In this brief comment we call attention to the following: (1) Huisman's purported anomaly is in fact dictated by probability theory; (2) Huisman's anomaly has been discussed and explained in the statistical literature since 1939; the anomaly was also highlighted in the Psychonomic Bulletin & Review article by Rouder et al. (2009), who interpreted the anomaly as "ideal": an interpretation diametrically opposed to that of Huisman. We conclude that when intuition clashes with probability theory, chances are that it is intuition that needs schooling.

Subject(s)

Bayes Theorem , Humans , Probability , Sample Size

8.

A tutorial on Bayesian model-averaged meta-analysis in JASP.

Berkhout, Sophie W; Haaf, Julia M; Gronau, Quentin F; Heck, Daniel W; Wagenmakers, Eric-Jan.

Behav Res Methods ; 56(3): 1260-1282, 2024 Mar.

Article in English | MEDLINE | ID: mdl-37099263

ABSTRACT

Researchers conduct meta-analyses in order to synthesize information across different studies. Compared to standard meta-analytic methods, Bayesian model-averaged meta-analysis offers several practical advantages including the ability to quantify evidence in favor of the absence of an effect, the ability to monitor evidence as individual studies accumulate indefinitely, and the ability to draw inferences based on multiple models simultaneously. This tutorial introduces the concepts and logic underlying Bayesian model-averaged meta-analysis and illustrates its application using the open-source software JASP. As a running example, we perform a Bayesian meta-analysis on language development in children. We show how to conduct a Bayesian model-averaged meta-analysis and how to interpret the results.

Subject(s)

Research Design , Software , Child , Humans , Bayes Theorem

9.

Meta-analyses in psychology often overestimate evidence for and size of effects.

Bartos, Frantisek; Maier, Maximilian; Shanks, David R; Stanley, T D; Sladekova, Martina; Wagenmakers, Eric-Jan.

R Soc Open Sci ; 10(7): 230224, 2023 Jul.

Article in English | MEDLINE | ID: mdl-37416830

ABSTRACT

Adjusting for publication bias is essential when drawing meta-analytic inferences. However, most methods that adjust for publication bias do not perform well across a range of research conditions, such as the degree of heterogeneity in effect sizes across studies. Sladekova et al. 2022 (Estimating the change in meta-analytic effect size estimates after the application of publication bias adjustment methods. Psychol. Methods) tried to circumvent this complication by selecting the methods that are most appropriate for a given set of conditions, and concluded that publication bias on average causes only minimal over-estimation of effect sizes in psychology. However, this approach suffers from a 'Catch-22' problem-to know the underlying research conditions, one needs to have adjusted for publication bias correctly, but to correctly adjust for publication bias, one needs to know the underlying research conditions. To alleviate this problem, we conduct an alternative analysis, robust Bayesian meta-analysis (RoBMA), which is not based on model-selection but on model-averaging. In RoBMA, models that predict the observed results better are given correspondingly larger weights. A RoBMA reanalysis of Sladekova et al.'s dataset reveals that more than 60% of meta-analyses in psychology notably overestimate the evidence for the presence of the meta-analytic effect and more than 50% overestimate its magnitude.

10.

Multibridge: an R package to evaluate informed hypotheses in binomial and multinomial models.

Sarafoglou, Alexandra; Aust, Frederik; Marsman, Maarten; Bartos, Frantisek; Wagenmakers, Eric-Jan; Haaf, Julia M.

Behav Res Methods ; 55(8): 4343-4368, 2023 12.

Article in English | MEDLINE | ID: mdl-37277644

ABSTRACT

The multibridge R package allows a Bayesian evaluation of informed hypotheses [Formula: see text] applied to frequency data from an independent binomial or multinomial distribution. multibridge uses bridge sampling to efficiently compute Bayes factors for the following hypotheses concerning the latent category proportions ð: (a) hypotheses that postulate equality constraints (e.g., ð1 = ð2 = ð3); (b) hypotheses that postulate inequality constraints (e.g., ð1 < ð2 < ð3 or ð1 > ð2 > ð3); (c) hypotheses that postulate combinations of inequality constraints and equality constraints (e.g., ð1 < ð2 = ð3); and (d) hypotheses that postulate combinations of (a)-(c) (e.g., ð1 < (ð2 = ð3),ð4). Any informed hypothesis [Formula: see text] may be compared against the encompassing hypothesis [Formula: see text] that all category proportions vary freely, or against the null hypothesis [Formula: see text] that all category proportions are equal. multibridge facilitates the fast and accurate comparison of large models with many constraints and models for which relatively little posterior mass falls in the restricted parameter space. This paper describes the underlying methodology and illustrates the use of multibridge through fully reproducible examples.

Subject(s)

Bayes Theorem , Humans , Statistical Distributions

11.

Inclusion Bayes factors for mixed hierarchical diffusion decision models.

Boehm, Udo; Evans, Nathan J; Gronau, Quentin F; Matzke, Dora; Wagenmakers, Eric-Jan; Heathcote, Andrew J.

Psychol Methods ; 2023 May 11.

Article in English | MEDLINE | ID: mdl-37166854

ABSTRACT

Cognitive models provide a substantively meaningful quantitative description of latent cognitive processes. The quantitative formulation of these models supports cumulative theory building and enables strong empirical tests. However, the nonlinearity of these models and pervasive correlations among model parameters pose special challenges when applying cognitive models to data. Firstly, estimating cognitive models typically requires large hierarchical data sets that need to be accommodated by an appropriate statistical structure within the model. Secondly, statistical inference needs to appropriately account for model uncertainty to avoid overconfidence and biased parameter estimates. In the present work, we show how these challenges can be addressed through a combination of Bayesian hierarchical modeling and Bayesian model averaging. To illustrate these techniques, we apply the popular diffusion decision model to data from a collaborative selective influence study. (PsycInfo Database Record (c) 2023 APA, all rights reserved).

12.

Is the biggest challenge to scientific thinking science itself?

Wagenmakers, Eric-Jan.

Nature ; 617(7962): 669-670, 2023 05.

Article in English | MEDLINE | ID: mdl-37217667

Subject(s)

Thinking

13.

Bayes Factors for Mixed Models: Perspective on Responses.

van Doorn, Johnny; Aust, Frederik; Haaf, Julia M; Stefan, Angelika M; Wagenmakers, Eric-Jan.

Comput Brain Behav ; 6(1): 127-139, 2023.

Article in English | MEDLINE | ID: mdl-36879767

ABSTRACT

In van Doorn et al. (2021), we outlined a series of open questions concerning Bayes factors for mixed effects model comparison, with an emphasis on the impact of aggregation, the effect of measurement error, the choice of prior distributions, and the detection of interactions. Seven expert commentaries (partially) addressed these initial questions. Surprisingly perhaps, the experts disagreed (often strongly) on what is best practice-a testament to the intricacy of conducting a mixed effect model comparison. Here, we provide our perspective on these comments and highlight topics that warrant further discussion. In general, we agree with many of the commentaries that in order to take full advantage of Bayesian mixed model comparison, it is important to be aware of the specific assumptions that underlie the to-be-compared models.

14.

Reducing bias, increasing transparency and calibrating confidence with preregistration.

Hardwicke, Tom E; Wagenmakers, Eric-Jan.

Nat Hum Behav ; 7(1): 15-26, 2023 01.

Article in English | MEDLINE | ID: mdl-36707644

ABSTRACT

Flexibility in the design, analysis and interpretation of scientific studies creates a multiplicity of possible research outcomes. Scientists are granted considerable latitude to selectively use and report the hypotheses, variables and analyses that create the most positive, coherent and attractive story while suppressing those that are negative or inconvenient. This creates a risk of bias that can lead to scientists fooling themselves and fooling others. Preregistration involves declaring a research plan (for example, hypotheses, design and statistical analyses) in a public registry before the research outcomes are known. Preregistration (1) reduces the risk of bias by encouraging outcome-independent decision-making and (2) increases transparency, enabling others to assess the risk of bias and calibrate their confidence in research outcomes. In this Perspective, we briefly review the historical evolution of preregistration in medicine, psychology and other domains, clarify its pragmatic functions, discuss relevant meta-research, and provide recommendations for scientists and journal editors.

Subject(s)

Mental Processes , Research Design , Humans , Registries

15.

A tutorial on Bayesian single-test reliability analysis with JASP.

Pfadt, Julius M; Bergh, Don van den; Sijtsma, Klaas; Wagenmakers, Eric-Jan.

Behav Res Methods ; 55(3): 1069-1078, 2023 04.

Article in English | MEDLINE | ID: mdl-35581436

ABSTRACT

The current practice of reliability analysis is both uniform and troublesome: most reports consider only Cronbach's α, and almost all reports focus exclusively on a point estimate, disregarding the impact of sampling error. In an attempt to improve the status quo we have implemented Bayesian estimation routines for five popular single-test reliability coefficients in the open-source statistical software program JASP. Using JASP, researchers can easily obtain Bayesian credible intervals to indicate a range of plausible values and thereby quantify the precision of the point estimate. In addition, researchers may use the posterior distribution of the reliability coefficients to address practically relevant questions such as "What is the probability that the reliability of my test is larger than a threshold value of .80?". In this tutorial article, we outline how to conduct a Bayesian reliability analysis in JASP and correctly interpret the results. By making available a computationally complex procedure in an easy-to-use software package, we hope to motivate researchers to include uncertainty estimates whenever reporting the results of a single-test reliability analysis.

Subject(s)

Software , Humans , Bayes Theorem , Reproducibility of Results , Uncertainty

16.

Robust Bayesian meta-analysis: Addressing publication bias with model-averaging.

Maier, Maximilian; Bartos, Frantisek; Wagenmakers, Eric-Jan.

Psychol Methods ; 28(1): 107-122, 2023 Feb.

Article in English | MEDLINE | ID: mdl-35588075

ABSTRACT

Meta-analysis is an important quantitative tool for cumulative science, but its application is frustrated by publication bias. In order to test and adjust for publication bias, we extend model-averaged Bayesian meta-analysis with selection models. The resulting robust Bayesian meta-analysis (RoBMA) methodology does not require all-or-none decisions about the presence of publication bias, can quantify evidence in favor of the absence of publication bias, and performs well under high heterogeneity. By model-averaging over a set of 12 models, RoBMA is relatively robust to model misspecification and simulations show that it outperforms existing methods. We demonstrate that RoBMA finds evidence for the absence of publication bias in Registered Replication Reports and reliably avoids false positives. We provide an implementation in R so that researchers can easily use the new methodology in practice. (PsycInfo Database Record (c) 2023 APA, all rights reserved).

Subject(s)

Publication Bias , Humans , Bayes Theorem

17.

A review of applications of the Bayes factor in psychological research.

Heck, Daniel W; Boehm, Udo; Böing-Messing, Florian; Bürkner, Paul-Christian; Derks, Koen; Dienes, Zoltan; Fu, Qianrao; Gu, Xin; Karimova, Diana; Kiers, Henk A L; Klugkist, Irene; Kuiper, Rebecca M; Lee, Michael D; Leenders, Roger; Leplaa, Hidde J; Linde, Maximilian; Ly, Alexander; Meijerink-Bosman, Marlyne; Moerbeek, Mirjam; Mulder, Joris; Palfi, Bence; Schönbrodt, Felix D; Tendeiro, Jorge N; van den Bergh, Don; Van Lissa, Caspar J; van Ravenzwaaij, Don; Vanpaemel, Wolf; Wagenmakers, Eric-Jan; Williams, Donald R; Zondervan-Zwijnenburg, Mariëlle; Hoijtink, Herbert.

Psychol Methods ; 28(3): 558-579, 2023 Jun.

Article in English | MEDLINE | ID: mdl-35298215

ABSTRACT

The last 25 years have shown a steady increase in attention for the Bayes factor as a tool for hypothesis evaluation and model selection. The present review highlights the potential of the Bayes factor in psychological research. We discuss six types of applications: Bayesian evaluation of point null, interval, and informative hypotheses, Bayesian evidence synthesis, Bayesian variable selection and model averaging, and Bayesian evaluation of cognitive models. We elaborate what each application entails, give illustrative examples, and provide an overview of key references and software with links to other applications. The article is concluded with a discussion of the opportunities and pitfalls of Bayes factor applications and a sketch of corresponding future research lines. (PsycInfo Database Record (c) 2023 APA, all rights reserved).

Subject(s)

Bayes Theorem , Behavioral Research , Psychology , Humans , Behavioral Research/methods , Psychology/methods , Software , Research Design

18.

The Benefits, Barriers, and Risks of Big-Team Science.

Forscher, Patrick S; Wagenmakers, Eric-Jan; Coles, Nicholas A; Silan, Miguel Alejandro; Dutra, Natália; Basnight-Brown, Dana; IJzerman, Hans.

Perspect Psychol Sci ; 18(3): 607-623, 2023 05.

Article in English | MEDLINE | ID: mdl-36190899

ABSTRACT

Progress in psychology has been frustrated by challenges concerning replicability, generalizability, strategy selection, inferential reproducibility, and computational reproducibility. Although often discussed separately, these five challenges may share a common cause: insufficient investment of intellectual and nonintellectual resources into the typical psychology study. We suggest that the emerging emphasis on big-team science can help address these challenges by allowing researchers to pool their resources together to increase the amount available for a single study. However, the current incentives, infrastructure, and institutions in academic science have all developed under the assumption that science is conducted by solo principal investigators and their dependent trainees, an assumption that creates barriers to sustainable big-team science. We also anticipate that big-team science carries unique risks, such as the potential for big-team-science organizations to be co-opted by unaccountable leaders, become overly conservative, and make mistakes at a grand scale. Big-team-science organizations must also acquire personnel who are properly compensated and have clear roles. Not doing so raises risks related to mismanagement and a lack of financial sustainability. If researchers can manage its unique barriers and risks, big-team science has the potential to spur great progress in psychology and beyond.

Subject(s)

Interdisciplinary Research , Humans , Reproducibility of Results

19.

Robust Bayesian meta-analysis: Model-averaging across complementary publication bias adjustment methods.

Bartos, Frantisek; Maier, Maximilian; Wagenmakers, Eric-Jan; Doucouliagos, Hristos; Stanley, T D.

Res Synth Methods ; 14(1): 99-116, 2023 Jan.

Article in English | MEDLINE | ID: mdl-35869696

ABSTRACT

Publication bias is a ubiquitous threat to the validity of meta-analysis and the accumulation of scientific evidence. In order to estimate and counteract the impact of publication bias, multiple methods have been developed; however, recent simulation studies have shown the methods' performance to depend on the true data generating process, and no method consistently outperforms the others across a wide range of conditions. Unfortunately, when different methods lead to contradicting conclusions, researchers can choose those methods that lead to a desired outcome. To avoid the condition-dependent, all-or-none choice between competing methods and conflicting results, we extend robust Bayesian meta-analysis and model-average across two prominent approaches of adjusting for publication bias: (1) selection models of p-values and (2) models adjusting for small-study effects. The resulting model ensemble weights the estimates and the evidence for the absence/presence of the effect from the competing approaches with the support they receive from the data. Applications, simulations, and comparisons to preregistered, multi-lab replications demonstrate the benefits of Bayesian model-averaging of complementary publication bias adjustment methods.

Subject(s)

Models, Statistical , Bayes Theorem , Publication Bias , Computer Simulation , Bias

20.

A Bayesian perspective on severity: risky predictions and specific hypotheses.

van Dongen, Noah; Sprenger, Jan; Wagenmakers, Eric-Jan.

Psychon Bull Rev ; 30(2): 516-533, 2023 Apr.

Article in English | MEDLINE | ID: mdl-35969359

ABSTRACT

A tradition that goes back to Sir Karl R. Popper assesses the value of a statistical test primarily by its severity: was there an honest and stringent attempt to prove the tested hypothesis wrong? For "error statisticians" such as Mayo (1996, 2018), and frequentists more generally, severity is a key virtue in hypothesis tests. Conversely, failure to incorporate severity into statistical inference, as allegedly happens in Bayesian inference, counts as a major methodological shortcoming. Our paper pursues a double goal: First, we argue that the error-statistical explication of severity has substantive drawbacks; specifically, the neglect of research context and the specificity of the predictions of the hypothesis. Second, we argue that severity matters for Bayesian inference via the value of specific, risky predictions: severity boosts the expected evidential value of a Bayesian hypothesis test. We illustrate severity-based reasoning in Bayesian statistics by means of a practical example and discuss its advantages and potential drawbacks.

Subject(s)

Bayes Theorem , Humans

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL