Gaussian distribution

It makes only two assumptions about a distribution, that it has a mean and variance.

f(x,μ,σ)=1σ2πe(xμ2σ)2
  • μ determines the center o the distribution

  • σ2 determines the variance

  • σ2π is the normalization

Properties

  • mean and variance are independent

  • symmetrical p(x)=p(x) *unbounded

  • Governed by the central limit theorem, where average tend to converge to a central limit.

  • Symmetry for tail areas for CDF: Φ(z)=1Φ(z)

  • Symmetry of X,X. If XN(μ,σ) than XN(μ,σ)

Sum difference of two Gaussians:

  1. N(μ1,σ21)N(μ2,σ22)=N(μ1μ2,σ21+σ22)

  2. N(μ1,σ21)+N(μ2,σ22)=N(μ1+μ2,σ21+σ22)

Moments

Mean

E[X]=μ

Variance

var[X]=σ2

Standard normal

Gaussian that has zero mean an unit variance. XN(0,1)

Degenerate

In the limit $limσ20N(x|μ,σ2)=δ(xμ)$

Pecission parametrization

We can parametrize the Gaussian by its precision instead of variance. Precision is given by λ=1/σ2. This means the higher the precission the tighter the distribution.

N(x;μ,Λ)=det(Λ)2πexp(12(xμ)TΛ(xμ))

This may be practical since we do not need to invert the Covariance matrix.

Multivariate

An n dimensional Gaussian is parametrized by a mean vector μ and covariance matrix Σ or precision matrix Λ.

N(x|μ,Σ)1(2π)D/2|Σ|1/2exp[12(xμ)TΣ1(xμ)]

CDF

We integrate the pdf: $Φ(x;μ,σ2)=xN(x|μ,σ2)dx$

This unfortunately has no closed form, so it is built into most software packages.

Usage

Many common biological measurements can be occasionally approximately described by a Gaussian, especially if the samples are very large, of if we are dealing with an average count.

Many common statistical procedures have been derived where de assume that the distribution from which the data has arose is Normal. Specially, inference and hypothesis testing from simple parametric tests like OLS regression, ANOVA, assume that the residuals are Normal distributed with mean of zero. The reliability of these test depend on the degree of conformity to this assumption of normality.

Isotropic Gaussian

A Gaussian where covariance matrix is just a scalar times the identity matrix.

Σ=σI