# Chance Constrained Optimization

```{admonition} Part of a series: Optimization Under Uncertainty.
Follow reading [here](optimization_under_uncertainty.md)
```

## Description
Chance constrained optimization is an approach to solve optimization problems under uncertainty where the uncertainty is also present in to the inequality constraints. We need a formulation on how to restrict values and processes described by random variables in a meaningful way. The idea of the method is to require that the constraints are satisfied for sufficiently many realizations of the random variables, i.e., with a high enough probability.

Doing this is reasonable due to two aspects: 
1. Soft constraints: When applying optimization to real-life problems we often have the case that constraints do not actually need to be adhered to at any cost all the time. They rather allow for small violations and are known as soft constraints. To give an example: in power systems engineering, there are constraints that limit how much power can be transferred via a power transmission line. In reality however, it is possible to overload these lines for a short period of time without damaging them. Hence, it is acceptable to violate the constraint on this line flow in rare occasions.
2. Long tails: Many common probability distributions in modeling have long tails, such as the normal or exponential distribution. Random variables that follow these distributions take on values above or below a certain threshold with such a small probability that one can assume the possibility of these scenarios to happen to be negligible. Certainly, a mindful consideration between this simplifying assumption and the criticality of the constraint has to be made.

With this method we can restrict the feasible region of a stochastic optimization problem such that the confidence level of the solution to be feasible is high enough.


### Formulation
Consider an optimization problem under uncertainty

$$
    \begin{align}
        \min\ & f(x,\xi) \\
        \text{s.t.}\ & g(x,\xi) = 0 \\
        & h(x,\xi) \geq 0
    \end{align}
$$

where $f$ is the objective function, $g$ denotes the equality constraints and $h$ the inequality constraints. The (deterministic) decision variables are given by vector $x$, while $\xi$ is the vector containing all uncertainties in terms of random variables.

We now introduce *chance constraints* to restate the inequality constraints as

$$
    P(h(x,\xi) \geq 0) \geq 1 - \epsilon_p, \quad \epsilon_p \in [0,1].
$$

With this the requirement is changed such that the constraints $h$ have to be satisfied with a probability $P$ of at least $1 - \epsilon_p$ in order for a solution to be feasible. The smaller we set $\epsilon_p$ the more we restrict the set of feasible solutions which provides a better confidentiality for the results but makes it harder to compute them. Therefore, $\epsilon_p$ serves as a design parameter for a trade-off between security and computational cost. It is often referred to as *violation parameter*. Common values are $\epsilon_p \in \{0.1, 0.05, 0.01\}$.


### Types of chance constraints
There are different ways chance constraints can be stated. The formulations have certain benefits and drawbacks regarding solution precision and problem feasibility.

**Single chance constraints** are the direct reformulation of each of the original constraints $h_1, h_2, \dots, h_n$:

$$
    \begin{align}
        P\left(h_1(x, \xi) \geq 0 \right) & \geq 1 - \epsilon_1 \\
        P\left(h_2(x, \xi) \geq 0 \right) & \geq 1 - \epsilon_2 \\
        \vdots & \\
        P\left(h_n(x, \xi) \geq 0 \right) & \geq 1 - \epsilon_n
    \end{align}
$$

where each chance constraint $h_i$ gets its own tuning parameter $\epsilon_i$.

The advantage of this formulation is that each of the constraints can get assigned a probability according to the criticality of these constraints being adhered to. Furthermore, in case of a violation one directly has information on which bound was not satisfied. However, single chance constraints only enforce that each constraint is satisfied individually at a certain confidentiality level but not all constraints as a whole.

**Joint chance constraints** on the other hand require the conjunction of all inequality constraints to be met by a certain probability

$$
    P\left(h_1(x, \xi) \geq 0 \wedge h_2(x, \xi) \geq 0 \wedge \dots \wedge h_n(x, \xi) \geq 0 \right) \geq 1 - \epsilon_p
$$

for a joint parameter $\epsilon_p$.

Their advantage is that they assure the satisfaction of all constraint at a certain level. The reasoning behind this is the following: when we think of the deterministic constraints we also expect all of them to hold with the same probability of 1, all at once. This leads to a more conservative problem formulation. The problem, however, is that these formulations tend to be really hard to solve even numerically and are hence can only be used in very simple or special scenarios in practice. <!-- cite ?-->


### Solution Approaches
Chance constrained problems in general tend to be hard to solve. This is because the computation of the integral for

$$
    P(h(x,\xi) \geq 0) = \int_{\{\xi \mid h(x,\xi) \geq 0\}} f_\Xi(\xi) d\xi
$$

, where $f_\Xi$ is the probability density function of $\xi$, is often times difficult or even impossible. This is especially true if the model creates a non-linear relationship between input and output uncertainties. Hence, the fallback are approximation strategies.

Some of the most prevalent methods are: 
* **Robust / Scenario-based optimization**<!-- TODO: add reference once that article is written -->: Transfer the problem into a deterministic one. For this, we consider the worst case problem, i.e., require the chance constraints $h(x,\xi) \geq 0$ to be satisfied for as many realizations $\xi^{(i)} \in \Omega, i = 1, \dots, N$ as possible, where $\Omega$ is the set of all realizations. This is done by replacing the probabilistic constraint with new (deterministic) constraints for each sample: $h(x,\xi^{(i)}) \geq 0$. 
By adding enough constraints the new problem solution is likely to be a solution in the feasible region of the original problem as well. The number of samples required depends on the size of the violation parameter $\epsilon_p$. A lower bound can be given mathematically {cite}`Calafiore2005`. In case the violation parameter $\epsilon_p$ is very small, however, a large amount of samples is required to cover the worst case.

* **Back-mapping**: Instead of the direct computation of the probability density, we can derive an equivalent representation. This is done by finding a monotone relation between input and output uncertainty. By performing integration in the probability subspace of the input uncertainties and then transforming the result back one can more easily compute the required integrals. These monotone relations, however, can be demanding to find, if they exist at all. {cite}`Wendt2002`

* **[Polynomial chaos expansion](polynomial-chaos-expansion.md)**: In this more recent approach the uncertainties in the model are replaced by deterministic representations, similar to a Fourier analysis of random variables. By doing so  the chance constraints are also transformed into a deterministic version. The benefit of this specific representation is that it allows one to directly obtain the moments of the represented random variables. No further relaxations of the optimization problem are necessary here. A drawback of this method is that the deterministic models can blow up rapidly in the number of model uncertainties. {cite}`Muhlpfordt2018`, {cite}`Muhlpfordt2019`


<!-- Further approaches: Sampling based approaches --> <!-- Convex approximation --> <!-- moment-based -->


## Applications
Some areas in which stochastic optimization problems are solved with the help of chance constraints are:
* [Power systems engineering](energy_systems_main.md): Computation of optimal power generation and distribution under the probabilistic influence of renewable energy sources. Chance constraints are used for example to describe the engineering limits on the power transmission lines. A solution is feasible if the transmission lines are not overloaded "most of the time" {cite}`Muhlpfordt2019`.
* Financial risk management: Unpredictable and volatile market situations lead to many uncertainties when investing. When aiming to maximize revenue, chance constraints can be used in order to require probabilities for certain outcomes of investments. They can be used redundantly to common risk metrics like [value at risk](financial-risk-optimization.md).
* Unmanned autonomous vehicle navigation: Used for optimal navigation and obstacle avoidance. Uncertainty can stem from random obstacles or sensor measurement errors. With only estimations available for the distributions of the objects and errors one can use chance constraints in order to require a certain confidence level to avoid collisions when navigating through a certain region.


## Variants and Related Methods
An extension of the chance constrained method are *weighted chance constraints*. While small violations of a constraint may be acceptable, larger ones can lead to infeasible problem solutions. In order to avoid this, a weight function is introduced in order to penalize larger constraint violations. {cite}`Roald2015`

Related methods in the field of optimization under uncertainty are:
* [Risk management](financial_risk_optimization.md)
* [Stochastic Programming](stochastic_programming.md)
* Robust Optimization 


<!-- Do you have a general Lehrbuch/ Introduction to chance constraint book, you can cite here?-->
## Literature
```{bibliography}
:filter: docname in docnames
```


## Authors
Adrian Grupp

## Contributor
Sonja Germscheid