artificial-intelligence | Asking Why.

Archive of posts with category 'artificial-intelligence'

Causal RL Wrapping Up: Where to?

Causal RL Wrapping Up: Where to?

Coming soon.

St John
16 Jan 2021

CRL Task 6: Causal Imitation Learning

CRL Task 6: Causal Imitation Learning

And as quickly as that, we’re at task 6 of our quest. Causal imitation learning is perhaps the most fanciful-sounding, but at it’s core it remains as simple a goal...

St John
16 Jan 2021

CRL Task 5: Learning Causal Models

CRL Task 5: Learning Causal Models

We’ve now come to one of the most vital aspects of this theory - how can we learn causal models? Learning models is often an exceptionally computationally intensive process, so...

St John
15 Jan 2021

CRL Task 4: Generalisability and Robustness

CRL Task 4: Generalisability and Robustness

At this point we’ve developed a good sense of the technical theory of causal reinforcement learning. This next section brings together many important ideas and generalises notions of data transfer...

St John
15 Jan 2021

CRL Task 3: Counterfactual Decision Making

CRL Task 3: Counterfactual Decision Making

In the previous blog post we discussed some theory of how to select optimal and possibly optimal interventions in a causal framework. For those interested in the decision science, this...

St John
17 Dec 2020

CRL Task 2: Interventions - When and Where?

CRL Task 2: Interventions - When and Where?

In the previous blog post we discussed the gorey details of generalised policy learning - the first task of CRL. We went into some very detailed mathematical description of dynamic...

St John
16 Dec 2020

CRL Task 1: Generalised Policy Learning

CRL Task 1: Generalised Policy Learning

In the previous blog post we developed some ideas and theory needed to discuss a causal approach to reinforcement learning. We formalised notions of multi-armed bandits (MABs), Markov Decision Processes...

St John
14 Dec 2020

Preliminaries for CRL

Preliminaries for CRL

In the previous blog post we discussed and motivated the need for a causal approach to reinforcement learning. We argued that reinforcement learning naturally falls on the interventional rung of...

St John
10 Dec 2020

Causal Reinforcement Learning: A Primer

Causal Reinforcement Learning: A Primer

As part of any honours degree at the University of Cape Town, one is obliged to write a thesis droning on about some topic. Luckily for me, applied mathematics can...

St John
09 Dec 2020

The Do-Calculus

The Do-Calculus

This Series

St John
10 Sep 2020

Faithfulness

Faithfulness

In the last dicussion we sought to rigorously define counterfactual statements and distributions in terms of our DAG formalism of causal inference. This appeared fruitful but the theory is certainly...

St John
07 Sep 2020

Reaching Rung 3: Counterfactual Reasoning

Reaching Rung 3: Counterfactual Reasoning

In our last discussion we discussed the so-called ‘rung two’ of the ladder of causation, discussing interventions and randomisation in control trials. This is an incredibly important field in the...

St John
06 Sep 2020

Interventions and Multivariate SCMs

Interventions and Multivariate SCMs

Last time we discussed how we can learn causal structure from data and thought about how this relates to machine learning. Specifically, we noticed that having more data in a...

St John
01 Sep 2020

Causality and Machine Learning

Causality and Machine Learning

Last time we briefly discussed the theory needed to start thinking about how we can learn, in the statistical sense, causal information from ‘dumb’ data. Some key points were that...

St John
10 Aug 2020

Learning Causal Models

Learning Causal Models

In the last episode we developed the first tools we need to develop the theory needed to formalise interventions and counterfactual reasoning. In this article we’ll discuss how we can...

St John
09 Aug 2020

Causal Models

Causal Models

Last time we discussed and motivated the need for a modern theory of causal inference. We developed some of the basic principles necessary to develop this theory, but we have...

St John
07 Aug 2020

A Causal Perspective

A Causal Perspective

What’s the first thing a statistician will say when you dare say the word cause? If you’ve ever taken a statistics class, I have little doubt it was the classic...

St John
05 Aug 2020

Uncertainty in Model Based RL

Uncertainty in Model Based RL

Perhaps we could use uncertainty estimation to detect where the model may be wrong and then correct for these potential errors without having to collect much more data. This uncertainty...

St John
13 Jun 2020

Exploration vs Exploitation

Exploration vs Exploitation

One of the inherent problems an agent faces in some arbitrary environment is how to decide whether to explore and discover more of the world around it, or to rather...

St John
10 Jun 2020

Free Energy of Expected Future

Free Energy of Expected Future

The active inference framework proposes agents act to maximise the evidence for a biased generative model, whereas in reinforcement learning the agent seeks to maximise the expected discounted cumulative reward....

St John
08 Jun 2020

Introduction to Reinforcement Learning

Introduction to Reinforcement Learning

But what is reinforcement learning? The field of reinforcement learning is at the crossroads between optimal control, animal psychology, artificial intelligence and game theory and has seen a surge of...

St John
07 Jun 2020

Learning with a Model

Learning with a Model

Where we are Up to this point we have discussed methods primarily relying on the learning of value functions, usually approximating these with some neural network. That is, our focus...

St John
04 Jun 2020

World Models: Learning by Imagination

World Models: Learning by Imagination

The World Models (Ha et al., 2018) paper presented at NIPS in 2018 exploits the idea of having an agent train entirely within its latent representation of the world it...

St John
28 May 2020

Learning with a Policy

Learning with a Policy

Model-free reinforcement learning algorithms are a class of algorithms which do not use the transition probability information to train and make decisions. In a sense, they are a class of...

St John
25 May 2020

neuroscience (3) tutorial (3) reinforcement-learning (7) artificial-intelligence (24) machine-learning (17) code (3) causality (20) statistics (20) finance (1) productivity (1) notion (1) research (1) natural-language-processing (1) biology (1) money (1) physics (1) fairness (1)