# Stochastic Representation of a Reaction Model

```{admonition} Part of a series: Uncertainty Quantification for a Dynamical Model.
Follow reading [here](dynamical_model_intro.md)
```

Real world phenomena that are described by [reaction systems](dynamical_model_reactions.md) typically evolve over time stochastically. It is hardly possible to predict the spread of a disease or the variation of gene expression exactly. The dynamics are often subject to randomness induced by internal and external factors {cite}`Warne2019`. In order to capture this behaviour, the dynamics can be characterised by a [stochastic process](stoch_process.md) if certain assumptions are satisfied. Within this article, if not stated otherwise, we adopt the notation from the preceding article about [reaction systems](dynamical_model_reactions.md).

According to {cite}`Gillespie1992`, we impose the assumptions that the system is well-mixed and in thermal equilibrium. It implies that spatial positions of individuals can be ignored and that the dynamics only depend on the current number of each object type. Besides, external parameters, such as temperature, can be neglected. Then, {cite}`Gillespie1992` shows that the system dynamics follow a Markov jump process (MJP) $X=(X(t))_{t\ge 0}$, with $X(t)=(X_1(t),\dots,X_N(t))$. An MJP is a special case of a [Markov process](markov_process.md) that is continuous in time with discrete state space. So, note that $X=(X(t))_{t\ge 0}$ now defines a [stochastic process](stoch_process.md), i.e. a family of multi-dimensional random variables $X(t)$ for time points $t\ge0$. The integer-valued random variable $X_i(t)$ depicts the random number of individuals of object $i$ at time point $t$. In the previous [article](dynamical_model_reactions.md), the temporal dimension plays a minor role and the state variable $X$ is considered to be deterministic. Thorough and formal introductions to Markov processes and stochastic processes in general can be found, for example, in {cite}`Bass2011,Kallenberg2021`.


## Chemical Master Equation
The chemical master equation (CME) is a partial differential equation that describes the time evolution of the state transition distributions of a MJP $X$. The CME is a special case for MJPs of the more general [Chapman-Kolmogorov equation](markov_process.md) (CKE) for arbitrary Markov processes. Another analogous and popular example of a CKE is the Fokker-Planck equation for diffusion processes {cite}`Fuchs2013`. 

Before we are able to formulate the CME, we need to introduce some more notation. Given that $X(t)=x$, for some fixed $t\ge0,~x\in\mathbb{N}_0^N$, the probability that reaction $R_r$ occurs within an infinitesimal time interval $[t,t+dt)$ is denoted by $a_r(x)dt$, where $a_r\colon\mathbb{N}_0^N\to\mathbb{R}_+$ is the propensity function of reaction $R_r$.

This function classically obeys "the law of mass action" that goes back to {cite}`Waage1986` (English translation of the original article from 1864). It states that the probability of reaction $R_r$ to happen depends on its reaction rate $c_r$ and the number of combinations of reacting objects. Thus, a common choice of $a_r$ is {cite}`Wilkinson2019,Schnoerr2017`

$$
\begin{align}
	a_r(x)=c_r\prod_{i=1}^N \binom{x_i}{s_{ir}}=c_r\prod_{i=1}^N\frac{x_i!}{(x_i-s_{ir})!s_{ir}!}.
\end{align}
$$ (eqn:1_propensity)

Moreover, we denote the probability of the system to be in $x\in\mathbb{N}_0^N$ at time $t\ge0$ by $P(x,t)=P(x,t|x(0),t_0)$ conditioned on the event that it was in state $x(0)\in\mathbb{N}_0^N$ at time $t_0\le t$. The change in $P(x,t)$ over time is described by the CME which states {cite}`Gillespie1992`

$$
\begin{align}
\frac{\partial}{\partial t}P(x,t)=\sum_{r=1}^M \left(a_r(x-v_r)P(x-v_r,t)-a_r(x)P(x,t)\right).
\end{align}
$$ (eqn:2_cme)

In plain words, the probability of the system of being in $x$ at time $t$ after an infinitesimal time step consists of two parts. The first part $a_r(x-v_r)P(x-v_r,t)$ on the right-hand side of Equation {eq}`eqn:2_cme` represents the probabilities of reaching state $x$ at $t$ due to a reaction $r$ after having been in a different state. The second part $a_r(x)P(x,t)$, however, corresponds to transitions to another state due to a reaction $r$ and has to be subtracted.

The following example might clarify the meaning of the CME on the basis of the running example from the preceding [article](dynamical_model_reactions.md). 

### Example
In the context of this example of a simple reaction system, the propensity functions defined in Equation {eq}`eqn:1_propensity` are given by

$$
\begin{align*}
	a_1(x)=c_1,\qquad a_2(x)=c_2x_1,\qquad a_3(x)=c_3x_2.
\end{align*}
$$

Consequently, the CME reads in this case

$$
\begin{align*}
	\frac{\partial}{\partial t}P(x,t)=&\left(c_1P(x-v_1,t)-c_1P(x,t)\right)+\left(c_2(x_1+1)P(x-v_2,t)-c_2x_1P(x,t)\right)\\
	&+\left(c_3(x_2+1)P(x-v_3,t)-c_3x_2P(x,t)\right).
\end{align*}
$$

In general, the CME defined in Equation {eq}`eqn:2_cme` might not admit an analytical solution. In order to study the behaviour of such a system, simulation and approximation methods are commonly used instead. 


## Simulation
Stochastic simulation methods are able to simulate sample paths of any MJP. The most fundamental algorithm is the stochastic simulation algorithm (SSA). The algorithm was introduced in {cite}`Gillespie1976,Gillespie1977` and it is also known as Gillespie's algorithm according to its originator. The following formulations are based on {cite}`Fuchs2013,Wilkinson2019`.

The exact simulation of any MJP relies on the fact that the waiting time $\tau$ from a reaction occurring at time $t>0$ to a next reaction at time $t+\tau$ is a random variable that is exponentially distributed with rate

$$
\begin{align*}
	a_0(x(t))=\sum_{r=1}^M a_r(x(t)).
\end{align*}
$$

The probability of reaction $R_r$ to happen at $t+\tau$ is then given by $\frac{a_r(x(t))}{a_0(x(t))}$. Algorithm 1 depicts the complete iterative procedure of simulated reactions on a time interval $[t_0,T]$ yielding a sample path of a MJP on this interval.

```{admonition} Algorithm 1: Gillespies's algorithm
**Inputs** Initial parameters $t_0>0$ and $x(0)\in\mathbb{N}_0^N$, final time point $T>t_0$, propensity functions $a_r,~r=1,\dots,M$.

Initialization with $t=t_0$ and $x(t)=x(0)$.<br>
**While** $t<T$<br>
&ensp; Calculate $a_r(x(t))$, for $r=1,\dots,M$, and $a_0(x(t))$.<br>
&ensp; Draw $\tau\sim Exp(a_0(x(t)))$ and set $\tau^*=\min\{\tau,T-t\}$.<br>
&ensp; Draw index $j\in\{1,\dots,M\}$ with probabilities $\frac{a_j(x(t))}{a_0(x(t))}$.<br>
&ensp; Set $x(s)=x(t)$ for all $s\in(t,t+\tau^*)$ and $x(t+\tau^*)=x(t)+v_j1_{(\tau^*=\tau)}$.<br>
&ensp; Set $t=t+\tau$.<br>
**End**
	
**Output** A sample path $x(t),~t\in[t_0,T]$, of a MJP $(X_t)_{t\in[0,T]}$.
```

Gillespie's algorithm produces a sample path of an underlying MJP. Since it is a probabilistic algorithm, two simulated path of the same system never coincide. In {numref}`fig_mjp`, trajectories from the above expample are depicted. The sample path fully obey the probability distributions of the process that are determined by the CME. Gillespie's algorithm is therefore referred to as an "exact" {cite}`Gillespie1992` algorithm. The algorithm is still subject of ongoing research and extensions have been developed. Some important modifications are outlined in {cite}`Warne2019`. All these exact methods usually go along with high computational costs for larger systems that involve numerous reactions that need to be simulated. Approximate algorithms, as the "tau-leaping method" {cite}`Gillespie2001`, are more efficient in that regard. 

```{figure} figures/dynamical_model/dynamical_model_mjp.png
:alt: Simulated trajectories
:align: center
:class: bg-primary mb-1
:name: fig_mjp
10 trajetories of the 2-dimensional MJP from the example above generated by Gillespie's algorithm with $c_1=20,~c_2=0.1,~c_3=0.15,~x_1(0)=100$ and $x_2(0)=100$
```

The CME admits an analytic solution only in a few special cases {cite}`Schnoerr2017`. Therefore, the likelihood function of an MJP is, in general, not available in order to infer model parameters statistically. This problem can be circumvented by approximations of MJPs or likelihood-free approaches. The former is covered in more detail in the next article in this series.


## References 
```{bibliography}
:filter: docname in docnames  
```

## Authors
Julian Wäsche

## Contributors
Houda Yaqine