\(\DeclarePairedDelimiterX{\Set}[2]{\{}{\}}{#1 \nonscript\;\delimsize\vert\nonscript\; #2}\) \( \DeclarePairedDelimiter{\set}{\{}{\}}\) \( \DeclarePairedDelimiter{\parens}{\left(}{\right)}\) \(\DeclarePairedDelimiterX{\innerproduct}[1]{\langle}{\rangle}{#1}\) \(\newcommand{\ip}[1]{\innerproduct{#1}}\) \(\newcommand{\bmat}[1]{\left[\hspace{2.0pt}\begin{matrix}#1\end{matrix}\hspace{2.0pt}\right]}\) \(\newcommand{\barray}[1]{\left[\hspace{2.0pt}\begin{matrix}#1\end{matrix}\hspace{2.0pt}\right]}\) \(\newcommand{\mat}[1]{\begin{matrix}#1\end{matrix}}\) \(\newcommand{\pmat}[1]{\begin{pmatrix}#1\end{pmatrix}}\) \(\newcommand{\mathword}[1]{\mathop{\textup{#1}}}\)
Needs:
Approximators
Empirical Distribution of a Dataset
Needed by:
Tree Distribution Approximators
Links:
Sheet PDF
Graph PDF

Distribution Approximators

Why

We have a distribution to approximate. It is, for some reason, unsuitable to our needs and we want to replace it with one more suitable.

Definition

A distribution approximator is an approximator of a probability distribution. It is also a distribution. The criterion of approximation is any similarity function on distributions over the same space.

Notation

Let $A$ be a non-empty set and $q: A \to \R $ be a distribution. Let $p: A \to \R $ be a distribution. Then $p: A \to \R $ is a distribution approximator of $q$.

Reasons for approximation

Infeasible to represent. If there are many outcomes, many numbers are required to specify the distribution. If $p: A^n \to \R $ where $\mid A\mid = k$, then there are $k^n$ outcomes; take, for example, $k = 2$ and $n = 100$. So we might want to find a distribution which requires fewer numbers to express. In other words, we want a different distribution, selected from the set of those which is easier to express, which is close to the original.

Unreasonable from common sense. The distribution may be unreasonable as a result of our common sense. For example, it may give zero probability to an outcome which we know to be possible, and would like to model with non-zero probability. This may happen when working with an empirical distribution: a particular outcome does not appear in the dataset, however, our common sense suggests is possible. In this case, we want to find a different distribution, selected from the set of those which is more reasonable based on common sense, which is close to the original.

Copyright © 2023 The Bourbaki Authors — All rights reserved — Version 13a6779cc About Show the old page view