Pittsburgh, Pennsylvania, United States of America
  •  28
    We study the identifiability and estimation of functional causal models under selection bias, with a focus on the situation where the selection depends solely on the effect variable, which is known as outcome-dependent selection. We address two questions of identifiability: the identifiability of the causal direction between two variables in the presence of selection bias, and, given the causal direction, the identifiability of the model with outcome-dependent selection. Regarding the first, we …Read more
  •  13
    In recent papers we have described a framework for inferring causal structure from relations of statistical independence among a set of measured variables. Using Pearl's notion of the perfect representation of a set of independence relations by a directed acyclic graph we proved..
  •  11
    Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference
    with J. Pearl, F. Bacchus, P. Spirtes, and R. Scheines
    Synthese 104 (1): 161-176. 1988.
  •  55
    Clark Glymour, Richard Scheines, Peter Spirtes, and Christopher Meek. Regression and Causation
  •  34
    Peter Spirtes, Clark Glymour, Richard Scheines, Christopher Meek, S. Fineberg, E. Slate. Prediction and Experimental Design with Graphical Causal Models
  •  42
    An Evaluation of Machine-Learning Methods for Predicting Pneumonia Mortality
    with Gregory F. Cooper, Constantin F. Aliferis, Richard Ambrosino, John Aronis, Bruce G. Buchanon, Richard Caruana, Michael J. Fine, Geoffrey Gordon, Barbara H. Hanusa, Janine E. Janosky, Christopher Meek, Tom Mitchell, Thomas Richardson, and Peter Spirtes
    This paper describes the application of eight statistical and machine-learning methods to derive computer models for predicting mortality of hospital patients with pneumonia from their findings at initial presentation. The eight models were each constructed based on 9847 patient cases and they were each evaluated on 4352 additional cases. The primary evaluation metric was the error in predicted survival as a function of the fraction of patients predicted to survive. This metric is useful in asse…Read more
  •  9
    Peter Spirtes, Richard Scheines and Clark Glymour. Simulated Studies of the Reliability of Computer-Aided Model Specification Using the TETRAD, EQS and LISREL Programs
  •  25
    Analysis of Microarray Data for Treated Fat Cells
    with Nicoleta Serban, Larry Wasserman, David Peters, Peter Spirtes, Robert O'Doherty, Daniel Handley, and Richard Scheines
    DNA microarrays are perfectly suited for comparing gene expression in different populations of cells. An important application of microarray techniques is identifying genes which are activated by a particular drug of interest. This process will allow biologists to identify therapies targeted to particular diseases, and, eventually, to gain more knowledge about the biological processes in organisms. Such an application is described in this paper. It is focused on diabetes and obesity, which is a …Read more
  •  9
    The ability to identify the mineral composition of rocks and softs is an important tool for the exploration of geological sites. For instance, NASA intends to design robots that are sufficiently autonomous to perform this task on planetary missions. Spectrometer readings provide one important source of data for identifying sites with minerals of interest. Reflectance spectrometers measure intensities of light reflected from surfaces over a range of wavelengths. Spectral intensity patterns may in…Read more
  •  12
    Peter Spirtes and Clark Glymour. Casual Structure Among Measured Variables Preserved with Unmeasured Variables
  •  17
    Cartwright, N. 42
    with L. J. Cohen, R. G. Collingwood, R. Colodny, R. Giere, E. M. Gold, R. Goldblatt, and W. Goldfarb
    In Wolfgang Balzer & Carles Ulises Moulines (eds.), Structuralist theory of science: focal issues, new results, Walter De Gruyter. pp. 287. 1996.
  •  82
    Data analysis that merely fits an empirical covariance matrix or that finds the best least squares linear estimator of a variable is not of itself a reliable guide to judgements about policy, which inevitably involve causal conclusions. The policy implications of empirical data can be completely reversed by alternative hypotheses about the causal relations of variables, and the estimates of a particular causal influence can be radically altered by changes in the assumptions made about other depe…Read more
  •  41
    Researchers routinely face the problem of inferring causal relationships from large amounts of data, sometimes involving hundreds of variables. Often, it is the causal relationships between "latent" (unmeasured) variables that are of primary interest. The problem is how causal relationships between unmeasured variables can be inferred from measured data. For example, naval manpower researchers have been asked to infer the causal relations among psychological traits such as job satisfaction and j…Read more
  •  26
    For most of the contributions to this volume, the project is this: Fill out “Event X is a cause of event Y if and only if……” where the dots on the right are to be filled in by a claims formulated in terms using any of (1) descriptions of possible worlds and their relations; (2) a special predicate, “is a law;” (3) “chances;” and (4) anything else one thinks one needs. The form of analysis is roughly the same as that sought in the Meno, and the methodology is likewise Socratic—proposals, examples…Read more
  •  47
    It is “well known” that in linear models: (1) testable constraints on the marginal distribution of observed variables distinguish certain cases in which an unobserved cause jointly influences several observed variables; (2) the technique of “instrumental variables” sometimes permits an estimation of the influence of one variable on another even when the association between the variables may be confounded by unobserved common causes; (3) the association (or conditional probability distribution of…Read more
  •  36
    Tianjaou Chu, David Danks, and Clark Glymour. Data Driven Methods for Nonlinear Granger Causality: Climate Teleconnection Mechanisms
  •  44
    nature of modern data collection and storage techniques, and the increases in the speed and storage capacities of computers. Statistics books from 30 years ago often presented examples with fewer than 10 variables, in domains where some background knowledge was plausible. In contrast, in new domains, such as climate research where satellite data now provide daily quantities of data unthinkable a few decades ago, fMRI brain imaging, and microarray measurements of gene expression, the number of va…Read more
  •  58
    It has been shown in Spirtes(1995) that X and Y are d-separated given Z in a directed graph associated with a recursive or non-recursive linear model without correlated errors if and only if the model entails that ρXY.Z = 0. This result cannot be directly applied to a linear model with correlated errors, however, because the standard graphical representation of a linear model with correlated errors is not a directed graph. The main result of this paper is to show how to associate a directed grap…Read more
  •  45
    Comment: Statistics and metaphysics
    Journal of the American Statistical Association 81 964-966. 2012.
  •  31
    Various algorithms have been proposed for learning (partial) genetic regulatory networks through systematic measurements of differential expression in wild type versus strains in which expression of specific genes has been suppressed or enhanced, as well as for determining the most informative next experiment in a sequence. While the behavior of these algorithms has been investigated for toy examples, the full computational complexity of the problem has not received sufficient attention. We show…Read more
  •  40
    Reflectance spectroscopy is a standard tool for studying the mineral composition of rock and soil samples and for remote sensing of terrestrial and extraterrestrial surfaces. We describe research on automated methods of mineral identification from reflectance spectra and give evidence that a simple algorithm, adapted from a well-known search procedure for Bayes nets, identifies the most frequently occurring classes of carbonates with reliability equal to or greater than that of human experts. We…Read more