-
45NewPerspectiveson (SomeOld) Problems of Frequentist StatisticsIn Deborah G. Mayo & Aris Spanos (eds.), Error and Inference: Recent Exchanges on Experimental Reasoning, Reliability, and the Objectivity and Rationality of Science, Cambridge University Press. pp. 247. 2009.
-
110Frequentist statistics as a theory of inductive inferenceIn Deborah G. Mayo & Aris Spanos (eds.), Error and Inference: Recent Exchanges on Experimental Reasoning, Reliability, and the Objectivity and Rationality of Science, Cambridge University Press. 2009.After some general remarks about the interrelation between philosophical and statistical thinking, the discussion centres largely on significance tests. These are defined as the calculation of p-values rather than as formal procedures for ‘acceptance‘ and ‘rejection‘. A number of types of null hypothesis are described and a principle for evidential interpretation set out governing the implications of p- values in the specific circumstances of each application, as contrasted with a long-run inter…Read more
-
77Objectivity and conditionality in frequentist inferenceIn Deborah G. Mayo & Aris Spanos (eds.), Error and Inference: Recent Exchanges on Experimental Reasoning, Reliability, and the Objectivity and Rationality of Science, Cambridge University Press. pp. 276. 2009.
-
181Ontology & MethodologySynthese 192 (11): 3413-3423. 2015.Philosophers of science have long been concerned with the question of what a given scientific theory tells us about the contents of the world, but relatively little attention has been paid to how we set out to build theories and to the relevance of pre-theoretical methodology on a theory’s interpretation. In the traditional view, the form and content of a mature theory can be separated from any tentative ontological assumptions that went into its development. For this reason, the target of inter…Read more
-
71Severe Testing: Error Statistics versus Bayes Factor TestsBritish Journal for the Philosophy of Science. forthcoming.
-
193Duhem's problem, the bayesian way, and error statistics, or "what's belief got to do with it?"Philosophy of Science 64 (2): 222-244. 1997.I argue that the Bayesian Way of reconstructing Duhem's problem fails to advance a solution to the problem of which of a group of hypotheses ought to be rejected or "blamed" when experiment disagrees with prediction. But scientists do regularly tackle and often enough solve Duhemian problems. When they do, they employ a logic and methodology which may be called error statistics. I discuss the key properties of this approach which enable it to split off the task of testing auxiliary hypotheses fr…Read more
-
'Peirce-pectives' on Metaphysics and the SciencesTransactions of the Charles S. Peirce Society 41 (2): 237-365. 2005.
-
33Science, Error Statistics, and Arguing from ErrorPoznan Studies in the Philosophy of the Sciences and the Humanities 71 95-111. 2000.
-
21Toward a More Objective Understanding of the Evidence of Carcinogenic RiskPSA Proceedings of the Biennial Meeting of the Philosophy of Science Association 1988 (2): 489-503. 1988.The field of quantified risk assessment is a new field, only about 20 years old, and already it is considered to be in a crisis. As Funtowicz and J.R. Ravetz (1985) put it:The concept of risk in terms of probability has proved to be so elusive, and statistical inference so problematic, that many experts in the field have recently either lost hope of finding a scientific solution or lost faith in Risk Analysis as a tool for decisionmaking. (p.219)Thus the ‘art’ of the assessment of risks… is at a…Read more
-
44Error and the Growth of Experimental KnowledgeUniversity of Chicago. 1996.This text provides a critique of the subjective Bayesian view of statistical inference, and proposes the author's own error-statistical approach as an alternative framework for the epistemology of experiment. It seeks to address the needs of researchers who work with statistical analysis.
-
22Cartwright, Causality, and CoincidencePSA Proceedings of the Biennial Meeting of the Philosophy of Science Association 1986 (1): 42-58. 1986.In How the Laws of Physics Lie (1983)2 Cartwright argues for being a realist about theoretical entities but non-realist about theoretical laws. Her reason for this distinction is that only the former involves causal explanation, and accepting causal explanations commits us to the existence of the causal entity invoked. “What is special about explanation by theoretical entity is that it is causal explanation, and existence is an internal characteristic of causal claims. There is nothing similar f…Read more
-
62Error, tests and theory confirmationIn Deborah G. Mayo & Aris Spanos (eds.), Error and Inference: Recent Exchanges on Experimental Reasoning, Reliability, and the Objectivity and Rationality of Science, Cambridge University Press. pp. 125-154. 2009.
-
123Statistical significance and its critics: practicing damaging science, or damaging scientific practice?Synthese 200 (3): 1-33. 2022.While the common procedure of statistical significance testing and its accompanying concept of p-values have long been surrounded by controversy, renewed concern has been triggered by the replication crisis in science. Many blame statistical significance tests themselves, and some regard them as sufficiently damaging to scientific practice as to warrant being abandoned. We take a contrary position, arguing that the central criticisms arise from misunderstanding and misusing the statistical tools…Read more
-
94Some surprising facts about surprising factsStudies in History and Philosophy of Science Part A 45 79-86. 2014.A common intuition about evidence is that if data x have been used to construct a hypothesis H, then x should not be used again in support of H. It is no surprise that x fits H, if H was deliberately constructed to accord with x. The question of when and why we should avoid such “double-counting” continues to be debated in philosophy and statistics. It arises as a prohibition against data mining, hunting for significance, tuning on the signal, and ad hoc hypotheses, and as a preference for prede…Read more
-
278How to discount double-counting when it counts: Some clarificationsBritish Journal for the Philosophy of Science 59 (4): 857-879. 2008.The issues of double-counting, use-constructing, and selection effects have long been the subject of debate in the philosophical as well as statistical literature. I have argued that it is the severity, stringency, or probativeness of the test—or lack of it—that should determine if a double-use of data is admissible. Hitchcock and Sober ([2004]) question whether this severity criterion' can perform its intended job. I argue that their criticisms stem from a flawed interpretation of the severity…Read more
-
88Introduction to recent issues in philosophy of statistics: evidence, testing, and applicationsSynthese 201 (4): 1-5. 2023.
-
55Increasing Public Participation in Controversies Involving Hazards: The Value of Metastatistical RulesScience, Technology and Human Values 10 (4): 55-65. 1985.
-
69Causal Modeling, Explanation and Severe TestingIn Deborah G. Mayo & Aris Spanos (eds.), Error and Inference: Recent Exchanges on Experimental Reasoning, Reliability, and the Objectivity and Rationality of Science, Cambridge University Press. pp. 331-375. 2009.
-
108Significance Tests: Vitiated or Vindicated by the Replication Crisis in Psychology?Review of Philosophy and Psychology 12 (1): 101-120. 2020.The crisis of replication has led many to blame statistical significance tests for making it too easy to find impressive looking effects that do not replicate. However, the very fact it becomes difficult to replicate effects when features of the tests are tied down actually serves to vindicate statistical significance tests. While statistical significance tests, used correctly, serve to bound the probabilities of erroneous interpretations of data, this error control is nullified by data-dredging…Read more
-
297Methodology in Practice: Statistical Misspecification TestingPhilosophy of Science 71 (5): 1007-1025. 2004.The growing availability of computer power and statistical software has greatly increased the ease with which practitioners apply statistical methods, but this has not been accompanied by attention to checking the assumptions on which these methods are based. At the same time, disagreements about inferences based on statistical research frequently revolve around whether the assumptions are actually met in the studies available, e.g., in psychology, ecology, biology, risk assessment. Philosophica…Read more
-
49Acceptable Evidence (edited book)Oxford University Press USA. 1994.Discussions of science and values in risk management have largely focused on how values enter into arguments about risks, that is, issues of acceptable risk. Instead this volume concentrates on how values enter into collecting, interpreting, communicating, and evaluating the evidence of risks, that is, issues of the acceptability of evidence of risk. By focusing on acceptable evidence, this volume avoids two barriers to progress. One barrier assumes that evidence of risk is largely a matter of o…Read more
-
172Some methodological issues in experimental economicsPhilosophy of Science 75 (5): 633-645. 2008.The growing acceptance and success of experimental economics has increased the interest of researchers in tackling philosophical and methodological challenges to which their work increasingly gives rise. I sketch some general issues that call for the combined expertise of experimental economists and philosophers of science, of experiment, and of inductive‐statistical inference and modeling. †To contact the author, please write to: 235 Major Williams, Virginia Tech, Blacksburg, VA 24061‐0126; e‐m…Read more
-
649Severe testing as a basic concept in a neyman–pearson philosophy of inductionBritish Journal for the Philosophy of Science 57 (2): 323-357. 2006.Despite the widespread use of key concepts of the Neyman–Pearson (N–P) statistical paradigm—type I and II errors, significance levels, power, confidence levels—they have been the subject of philosophical controversy and debate for over 60 years. Both current and long-standing problems of N–P tests stem from unclarity and confusion, even among N–P adherents, as to how a test's (pre-data) error probabilities are to be used for (post-data) inductive inference as opposed to inductive behavior. We ar…Read more
-
142Error and the Growth of Experimental KnowledgePhilosophical Review 107 (2): 324. 1998.Once upon a time, logic was the philosopher’s tool for analyzing scientific reasoning. Nowadays, probability and statistics have largely replaced logic, and their most popular application—Bayesianism—has replaced the qualitative deductive relationship between a hypothesis h and evidence e with a quantitative measure of h’s probability in light of e.
-
53Principles of inference and their consequencesIn David Corfield & Jon Williamson (eds.), Foundations of Bayesianism, Kluwer Academic Publishers. pp. 381--403. 2001.
-
190In defense of the Neyman-Pearson theory of confidence intervalsPhilosophy of Science 48 (2): 269-280. 1981.In Philosophical Problems of Statistical Inference, Seidenfeld argues that the Neyman-Pearson (NP) theory of confidence intervals is inadequate for a theory of inductive inference because, for a given situation, the 'best' NP confidence interval, [CIλ], sometimes yields intervals which are trivial (i.e., tautologous). I argue that (1) Seidenfeld's criticism of trivial intervals is based upon illegitimately interpreting confidence levels as measures of final precision; (2) for the situation which…Read more
-
35Explanation and testing exchanges with Clark GlymourIn Deborah G. Mayo & Aris Spanos (eds.), Error and Inference: Recent Exchanges on Experimental Reasoning, Reliability, and the Objectivity and Rationality of Science, Cambridge University Press. pp. 351. 2009.
-
180The New Experimentalism, Topical Hypotheses, and Learning from ErrorPSA: Proceedings of the Biennial Meeting of the Philosophy of Science Association 1994 270-279. 1994.An important theme to have emerged from the new experimentalist movement is that much of actual scientific practice deals not with appraising full-blown theories but with the manifold local tasks required to arrive at data, distinguish fact from artifact, and estimate backgrounds. Still, no program for working out a philosophy of experiment based on this recognition has been demarcated. I suggest why the new experimentalism has come up short, and propose a remedy appealing to the practice of sta…Read more
-
387Behavioristic, evidentialist, and learning models of statistical testingPhilosophy of Science 52 (4): 493-516. 1985.While orthodox (Neyman-Pearson) statistical tests enjoy widespread use in science, the philosophical controversy over their appropriateness for obtaining scientific knowledge remains unresolved. I shall suggest an explanation and a resolution of this controversy. The source of the controversy, I argue, is that orthodox tests are typically interpreted as rules for making optimal decisions as to how to behave--where optimality is measured by the frequency of errors the test would commit in a long …Read more
Blacksburg, Virginia, United States of America