\(\DeclarePairedDelimiterX{\Set}[2]{\{}{\}}{#1 \nonscript\;\delimsize\vert\nonscript\; #2}\) \( \DeclarePairedDelimiter{\set}{\{}{\}}\) \( \DeclarePairedDelimiter{\parens}{\left(}{\right)}\) \(\DeclarePairedDelimiterX{\innerproduct}[1]{\langle}{\rangle}{#1}\) \(\newcommand{\ip}[1]{\innerproduct{#1}}\) \(\newcommand{\bmat}[1]{\left[\hspace{2.0pt}\begin{matrix}#1\end{matrix}\hspace{2.0pt}\right]}\) \(\newcommand{\barray}[1]{\left[\hspace{2.0pt}\begin{matrix}#1\end{matrix}\hspace{2.0pt}\right]}\) \(\newcommand{\mat}[1]{\begin{matrix}#1\end{matrix}}\) \(\newcommand{\pmat}[1]{\begin{pmatrix}#1\end{pmatrix}}\) \(\newcommand{\mathword}[1]{\mathop{\textup{#1}}}\)
Needs:
Typed Graphs
Directed Graph Distributions
Needed by:
Parameterized Distribution Graphs
Links:
Sheet PDF
Graph PDF

Distribution Graphs

Why

How can we construct distributions which factor according to a directed graph?1 Related: how can we compactly represent complex distributions over high-dimensional spaces?

Definition

Let $\bar{G} = (G, A)$ be a typed graph (see Typed Graphs) with directed and acyclic $G$. For source vertices $i$, let $g_i: A_i \to [0, 1]$ be a distribution and otherwise let $g_{i}: A \times A_{\pa_i} \to [0, 1]$ denote a function satisfying $g_i(\cdot , x)$ is a distribution for every $x \in X_{\pa_i}$.

We call the ordered pair $(\bar{G}, g)$ a distribution graph .

The distribution of $(\bar{G}, g)$ is the function $p: \prod_{i} A_i \to [0, 1]$ defined by

\[ p(a) = \prod_{\pa_i = \varnothing} g_i(a_i) \prod_{\pa_i \neq \varnothing} g_i(a_i, a_{\pa_i}). \]

It is, of course, a distribution. And it factors according to the directed and acyclic graph $G$. Also, the $g_i$, $i = 1, \dots , n$, are the conditionals.2

In other words, a distribution graph represents a probability distributions via products of smaller, “local”, conditional probability distributions.

Other terminology

Other terminology includes distribution network, conditional distribution network, conditional distribution graph, bayesian network,3 bayes net, directed probabilistic graphical model, directed graphical model.

Necessity of acyclicity

If $G$ above is not taken to be acyclic, then the “distribution” of the distribution graph need not be a proper probability distribution (the condition which will fail is normalization).


  1. Future editions might flip the order of this sheet with that of directed graph distributions since, in the genetic approach, in may be more natural to think of constructing such distributions before analyzing them. This is partially motivated by the acyclic constraint here, which restricts the graphs according to which a distribution can factor. ↩︎
  2. Future editions will elaborate and give a proof. ↩︎
  3. Indeed, this term is near universal in certain literatures. We avoid it in these sheets as a result of the Bourbaki project’s policy on naming. ↩︎
Copyright © 2023 The Bourbaki Authors — All rights reserved — Version 13a6779cc About Show the old page view