TY - JOUR
T1 - Statistics in the Service of Science
T2 - Don’t Let the Tail Wag the Dog
AU - Singmann, Henrik
AU - Kellen, David
AU - Cox, Gregory E.
AU - Chandramouli, Suyog H.
AU - Davis-Stober, Clintin P.
AU - Dunn, John C.
AU - Gronau, Quentin F.
AU - Kalish, Michael L.
AU - McMullin, Sara D.
AU - Navarro, Danielle J.
AU - Shiffrin, Richard M.
PY - 2023/3
Y1 - 2023/3
N2 - Statistical modeling is generally meant to describe patterns in data in service of the broader scientific goal of developing theories to explain those patterns. Statistical models support meaningful inferences when models are built so as to align parameters of the model with potential causal mechanisms and how they manifest in data. When statistical models are instead based on assumptions chosen by default, attempts to draw inferences can be uninformative or even paradoxical—in essence, the tail is trying to wag the dog. These issues are illustrated by van Doorn et al. (this issue) in the context of using Bayes Factors to identify effects and interactions in linear mixed models. We show that the problems identified in their applications (along with other problems identified here) can be circumvented by using priors over inherently meaningful units instead of default priors on standardized scales. This case study illustrates how researchers must directly engage with a number of substantive issues in order to support meaningful inferences, of which we highlight two: The first is the problem of coordination, which requires a researcher to specify how the theoretical constructs postulated by a model are functionally related to observable variables. The second is the problem of generalization, which requires a researcher to consider how a model may represent theoretical constructs shared across similar but non-identical situations, along with the fact that model comparison metrics like Bayes Factors do not directly address this form of generalization. For statistical modeling to serve the goals of science, models cannot be based on default assumptions, but should instead be based on an understanding of their coordination function and on how they represent causal mechanisms that may be expected to generalize to other related scenarios.
AB - Statistical modeling is generally meant to describe patterns in data in service of the broader scientific goal of developing theories to explain those patterns. Statistical models support meaningful inferences when models are built so as to align parameters of the model with potential causal mechanisms and how they manifest in data. When statistical models are instead based on assumptions chosen by default, attempts to draw inferences can be uninformative or even paradoxical—in essence, the tail is trying to wag the dog. These issues are illustrated by van Doorn et al. (this issue) in the context of using Bayes Factors to identify effects and interactions in linear mixed models. We show that the problems identified in their applications (along with other problems identified here) can be circumvented by using priors over inherently meaningful units instead of default priors on standardized scales. This case study illustrates how researchers must directly engage with a number of substantive issues in order to support meaningful inferences, of which we highlight two: The first is the problem of coordination, which requires a researcher to specify how the theoretical constructs postulated by a model are functionally related to observable variables. The second is the problem of generalization, which requires a researcher to consider how a model may represent theoretical constructs shared across similar but non-identical situations, along with the fact that model comparison metrics like Bayes Factors do not directly address this form of generalization. For statistical modeling to serve the goals of science, models cannot be based on default assumptions, but should instead be based on an understanding of their coordination function and on how they represent causal mechanisms that may be expected to generalize to other related scenarios.
KW - Coordination function
KW - Default Bayes Factors
KW - Generalizability
KW - Mixed models
KW - Standardized effect sizes
KW - Statistics
UR - http://www.scopus.com/inward/record.url?scp=85130141439&partnerID=8YFLogxK
U2 - 10.1007/s42113-022-00129-2
DO - 10.1007/s42113-022-00129-2
M3 - Article
AN - SCOPUS:85130141439
SN - 2522-0861
VL - 6
SP - 64
EP - 83
JO - Computational Brain and Behavior
JF - Computational Brain and Behavior
IS - 1
ER -