skip to main content
research-article
Open Access

The seven tools of causal inference, with reflections on machine learning

Published:21 February 2019Publication History
Skip Abstract Section

Abstract

The kind of causal inference seen in natural human thought can be "algorithmitized" to help produce human-level machine intelligence.

References

  1. Balke, A. and Pearl, J. Probabilistic evaluation of counterfactual queries. In Proceedings of the 12<sup>th</sup> National Conference on Artificial Intelligence (Seattle, WA, July 31-Aug. 4). MIT Press, Menlo Park, CA, 1994, 230--237. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Bareinboim, E. and Pearl, J. Causal inference by surrogate experiments: z-identifiability. In Proceedings of the 28<sup>th</sup> Conference on Uncertainty in Artificial Intelligence, N. de Freitas and K. Murphy, Eds. (Catalina Island, CA, Aug. 14--18). AUAI Press, Corvallis, OR, 2012, 113--120. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Bareinboim, E. and Pearl, J. Causal inference and the data-fusion problem. Proceedings of the National Academy of Sciences 113, 27 (2016), 7345--7352.Google ScholarGoogle ScholarCross RefCross Ref
  4. Chen, Z. and Liu, B. Lifelong Machine Learning. Morgan and Claypool Publishers, San Rafael, CA, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Darwiche, A. Human-Level Intelligence or Animal-Like Abilities? Technical Report. Department of Computer Science, University of California, Los Angeles, CA, 2017; https://arxiv.org/pdf/1707.04327.pdfGoogle ScholarGoogle Scholar
  6. Graham, J. Missing Data: Analysis and Design (Statistics for Social and Behavioral Sciences). Springer, 2012.Google ScholarGoogle ScholarCross RefCross Ref
  7. Halpern, J.H. and Pearl, J. Causes and explanations: A structural-model approach: Part I: Causes. British Journal of Philosophy of Science 56 (2005), 843--887.Google ScholarGoogle ScholarCross RefCross Ref
  8. Hutson, M. AI researchers allege that machine learning is alchemy. Science (May 3, 2018); https://www.sciencemag.org/news/2018/05/ai-researchers-allege-machine-learning-alchemyGoogle ScholarGoogle Scholar
  9. Jaber, A., Zhang, J.J., and Bareinboim, E. Causal identification under Markov equivalence. In Proceedings of the 34<sup>th</sup> Conference on Uncertainty in Artificial Intelligence, A. Globerson and R. Silva, Eds. (Monterey, CA, Aug. 6--10). AUAI Press, Corvallis, OR, 2018, 978--987.Google ScholarGoogle Scholar
  10. Lake, B.M., Salakhutdinov, R., and Tenenbaum, J.B. Human-level concept learning through probabilistic program induction. Science 350, 6266 (Dec. 2015), 1332--1338.Google ScholarGoogle ScholarCross RefCross Ref
  11. Marcus, G. Deep Learning: A Critical Appraisal. Technical Report. Departments of Psychology and Neural Science, New York University, New York, 2018; https://arxiv.org/pdf/1801.00631.pdfGoogle ScholarGoogle Scholar
  12. Mohan, K. and Pearl, J. Graphical Models for Processing Missing Data. Technical Report R-473. Department of Computer Science, University of California, Los Angeles, CA, 2018; forthcoming, Journal of American Statistical Association; http://ftp.cs.ucla.edu/pub/stat_ser/r473.pdfGoogle ScholarGoogle Scholar
  13. Mohan, K., Pearl, J., and Tian, J. Graphical models for inference with missing data. In Advances in Neural Information Processing Systems 26, C.J.C. Burges, L. Bottou, M. Welling, Z. Ghahramani, and K.Q. Weinberger, Eds. Curran Associates, Inc., Red Hook, NY, 2013, 1277--1285; http://papers.nips.cc/paper/4899-graphical-models-for-inference-with-missing-data.pdf Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Morgan, S.L. and Winship, C. Counterfactuals and Causal Inference: Methods and Principles for Social Research (Analytical Methods for Social Research), Second Edition. Cambridge University Press, New York, 2015.Google ScholarGoogle Scholar
  15. Pearl, J. Probabilistic Reasoning in Intelligent Systems. Morgan Kaufmann, San Mateo, CA, 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Pearl, J. Comment: Graphical models, causality, and intervention. Statistical Science 8, 3 (1993), 266--269.Google ScholarGoogle ScholarCross RefCross Ref
  17. Pearl, J. Causal diagrams for empirical research. Biometrika 82, 4 (Dec. 1995), 669--710.Google ScholarGoogle ScholarCross RefCross Ref
  18. Pearl, J. Causality: Models, Reasoning, and Inference. Cambridge University Press, New York, 2000; Second Edition, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Pearl, J. Direct and indirect effects. In Proceedings of the 17<sup>th</sup> Conference on Uncertainty in Artificial Intelligence (Seattle, WA, Aug. 2--5). Morgan Kaufmann, San Francisco, CA, 2001, 411--420. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Pearl, J. Causes of effects and effects of causes. Journal of Sociological Methods and Research 44, 1 (2015a), 149--164.Google ScholarGoogle Scholar
  21. Pearl, J. Trygve Haavelmo and the emergence of causal calculus. Econometric Theory 31, 1 (2015b), 152--179; special issue on Haavelmo centennialGoogle ScholarGoogle ScholarCross RefCross Ref
  22. Pearl, J. and Bareinboim, E. External validity: From do-calculus to transportability across populations. Statistical Science 29, 4 (2014), 579--595.Google ScholarGoogle ScholarCross RefCross Ref
  23. Pearl, J. and Mackenzie, D. The Book of Why: The New Science of Cause and Effect. Basic Books, New York, 2018. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Peters, J., Janzing, D. and Schölkopf, B. Elements of Causal Inference: Foundations and Learning Algorithms. MIT Press, Cambridge, MA, 2017. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Porta, M. The deconstruction of paradoxes in epidemiology. OUPblog, Oct. 17, 2014; https://blog.oup.com/2014/10/deconstruction-paradoxes-sociology-epidemiology/Google ScholarGoogle Scholar
  26. Ribeiro, M.T., Singh, S., and Guestrin, C. Why should I trust you?: Explaining the predictions of any classifier. In Proceedings of the 22<sup>nd</sup> ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (San Francisco, CA, Aug. 13--17). ACM Press, New York, 2016, 1135--1144. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Robins, J.M. and Greenland, S. Identifiability and exchangeability for direct and indirect effects. Epidemiology 3, 2 (Mar. 1992), 143--155.Google ScholarGoogle ScholarCross RefCross Ref
  28. Rosenbaum, P. and Rubin, D. The central role of propensity score in observational studies for causal effects. Biometrika 70, 1 (Apr. 1983), 41--55.Google ScholarGoogle ScholarCross RefCross Ref
  29. Shimizu, S., Hoyer, P.O., Hyvärinen, A., and Kerminen, A.J. A linear non-Gaussian acyclic model for causal discovery. Journal of the Machine Learning Research 7 (Oct. 2006), 2003--2030. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Shpitser, I. and Pearl, J. Complete identification methods for the causal hierarchy. Journal of Machine Learning Research 9 (2008), 1941--1979. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Spirtes, P., Glymour, C.N., and Scheines, R. Causation, Prediction, and Search, Second Edition. MIT Press, Cambridge, MA, 2000.Google ScholarGoogle Scholar
  32. Tian, J. and Pearl, J. A general identification condition for causal effects. In Proceedings of the 18<sup>th</sup> National Conference on Artificial Intelligence (Edmonton, AB, Canada, July 28-Aug. 1). AAAI Press/MIT Press, Menlo Park, CA, 2002, 567--573. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. van der Laan, M.J. and Rose, S. Targeted Learning: Causal Inference for Observational and Experimental Data. Springer, New York, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  34. VanderWeele, T.J. Explanation in Causal Inference: Methods for Mediation and Interaction. Oxford University Press, New York, 2015.Google ScholarGoogle Scholar
  35. Zhang, J. and Bareinboim, E. Transfer learning in multi-armed bandits: A causal approach. In Proceedings of the 26<sup>th</sup> International Joint Conference on Artificial Intelligence (Melbourne, Australia, Aug. 19--25). AAAI Press, Menlo Park, CA, 2017, 1340--1346. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. The seven tools of causal inference, with reflections on machine learning

          Recommendations

          Reviews

          Jonathan P. E. Hodgson

          There are three obstacles to meeting the increasing expectations for artificial intelligence (AI), according to this article: the lack of adaptability or robustness; the lack of explainability; and "the lack of understanding of cause-effect connections." This article is mainly concerned with the latter. The author asserts that an intelligent system should be able to answer such questions as "What would have happened if I had acted differently " The author's claim is that all three obstacles and the answering of such questions can be overcome by using causal reasoning. The article gives an overview of causal reasoning and structured causal models (SCM), emphasizing a three-level hierarchy in which each level is capable of answering different types of questions. The first level manages association-questions such as "What does a symptom tell me about a disease " The second level, "intervention," manages questions like "What if we ban cigarettes " The third level manages counterfactuals, dealing with questions like "Would Kennedy be alive had Oswald not shot him " The tools used for causal reasoning include graphical models that show the causal relationships between variables and the " do -calculus," which simulates physical interventions where the distribution resulting from a specific action is predicted. The article's extended example illustrates the process and presents "a bird's-eye view of seven tasks accomplished through the SCM framework." A critical property of Pearl's system is that it is effectively computable. The article provides a lucid introduction to the ideas and is recommended to all AI workers.

          Access critical reviews of Computing literature here

          Become a reviewer for Computing Reviews.

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in

          Full Access

          • Published in

            cover image Communications of the ACM
            Communications of the ACM  Volume 62, Issue 3
            March 2019
            109 pages
            ISSN:0001-0782
            EISSN:1557-7317
            DOI:10.1145/3314328
            Issue’s Table of Contents

            Copyright © 2019 Owner/Author

            Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 21 February 2019

            Check for updates

            Qualifiers

            • research-article
            • Popular
            • Refereed

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          HTML Format

          View this article in HTML Format .

          View HTML Format