Causal AICausal AI is a technique in artificial intelligence that builds a causal model and can thereby make inferences using causality rather than just correlation. One practical use for causal AI is for organisations to explain decision-making and the causes for a decision.[1][2] Systems based on causal AI, by identifying the underlying web of causality for a behaviour or event, provide insights that solely predictive AI models might fail to extract from historical data.[3] An analysis of causality may be used to supplement human decisions in situations where understanding the causes behind an outcome is necessary, such as quantifying the impact of different interventions, policy decisions or performing scenario planning.[4] A 2024 paper from Google DeepMind demonstrated mathematically that "Any agent capable of adapting to a sufficiently large set of distributional shifts must have learned a causal model".[5] The paper offers the interpretation that learning to generalise beyond the original training set requires learning a causal model, concluding that causal AI is necessary for artificial general intelligence. HistoryThe concept of causal AI and the limits of machine learning were raised by Judea Pearl, the Turing Award-winning computer scientist and philosopher, in 2018's The Book of Why: The New Science of Cause and Effect. Pearl asserted: “Machines' lack of understanding of causal relations is perhaps the biggest roadblock to giving them human-level intelligence.”[6][7] In 2020, Columbia University established a Causal AI Lab under Director Elias Bareinboim. Professor Bareinboim’s research focuses on causal and counterfactual inference and their applications to data-driven fields in the health and social sciences as well as artificial intelligence and machine learning.[8] Technological research and consulting firm Gartner for the first time included causal AI in its 2022 Hype Cycle report, citing it as one of five critical technologies in accelerated AI automation.[9][10] One significant advance in the field is the concept of Algorithmic Information Dynamics:[11] a model-driven approach for causal discovery using Algorithmic Information Theory and perturbation analysis. It solves inverse causal problems by studying dynamical systems computationally. A key application is causal deconvolution, which separates generative mechanisms in data with algorithmic models rather than traditional statistics. [12] This method identifies causal structures in networks and sequences, moving away from probabilistic and regression-based techniques, marking one of the first practical Causal AI approaches using algorithmic complexity and algorithmic probability in Machine Learning. [13] References
|