Difference between revisions of "Theory of Stochastic Signals/Binomial Distribution"

Revision as of 12:01, 11 December 2021

General description of the binomial distribution

$\text{Definition:}$ The binomial distribution represents an important special case for the occurrence probabilities of a discrete random variable.

To derive the binomial distribution, we assume that $I$ binary and statistically independent random variables $b_i$ each can achieve

the value $1$ with probability ${\rm Pr}(b_i = 1) = p$, and
the value $0$ with probability ${\rm Pr}(b_i = 0) = 1-p$.

Then the sum $z$ is also a discrete random variable with the symbol set $\{0, \ 1, \ 2,\hspace{0.1cm}\text{ ...} \hspace{0.1cm}, \ I\}$, which is called binomially distributed:

$$z=\sum_{i=1}^{I}b_i.$$

Thus, the symbol set size is $M = I + 1.$

$\text{Example 1:}$ The binomial distribution finds manifold applications in Communications Engineering as well as in other disciplines:

It describes the distribution of rejects in statistical quality control.
It allows the calculation of the residual error probability in blockwise coding.
The bit error rate of a digital transmission system obtained by simulation is actually a binomially distributed random quantity.

Probabilities of the binomial distribution

$\text{Calculation rule:}$ For the probabilities of the binomial distribution with $μ = 0, \hspace{0.1cm}\text{...} \hspace{0.1cm}, \ I$:

$$p_\mu = {\rm Pr}(z=\mu)={I \choose \mu}\cdot p\hspace{0.05cm}^\mu\cdot ({\rm 1}-p)\hspace{0.05cm}^{I-\mu}.$$

The first term here indicates the number of combinations $($read: $I\ \text{ over }\ μ)$:

$${I \choose \mu}=\frac{I !}{\mu !\cdot (I-\mu) !}=\frac{ {I\cdot (I- 1) \cdot \ \cdots \ \cdot (I-\mu+ 1)} }{ 1\cdot 2\cdot \ \cdots \ \cdot \mu}.$$

Additional notes:

For very large values of $I$, the binomial distribution can be approximated by the Poisson distribution described in the next section.
If at the same time the product $I · p \gg 1$, then according to de Moivre–Laplace's (central limit) theorem, the Poisson distribution (and hence the binomial distribution) transitions to a discrete Gaussian distribution.

$\text{Example 2:}$ The graph shows the probabilities of the binomial distribution are for $I =6$ and $p =0.4$.

Binomial distribution probabilites

Thus $M = I+1=7$ probabilities are different from zero.

In contrast, for $I = 6$ and $p = 0.5$, the probabilities of the binomial distribution are as follows:

$$\begin{align*}{\rm Pr}(z\hspace{-0.05cm} =\hspace{-0.05cm}0) & = {\rm Pr}(z\hspace{-0.05cm} =\hspace{-0.05cm}6)\hspace{-0.05cm} =\hspace{-0.05cm} 1/64\hspace{-0.05cm} = \hspace{-0.05cm}0.015625 ,\\ {\rm Pr}(z\hspace{-0.05cm} =\hspace{-0.05cm}1) & = {\rm Pr}(z\hspace{-0.05cm} =\hspace{-0.05cm}5) \hspace{-0.05cm}= \hspace{-0.05cm}6/64 \hspace{-0.05cm}=\hspace{-0.05cm} 0.09375,\\ {\rm Pr}(z\hspace{-0.05cm} =\hspace{-0.05cm}2) & = {\rm Pr}(z\hspace{-0.05cm} =\hspace{-0.05cm}4)\hspace{-0.05cm} = \hspace{-0.05cm}15/64 \hspace{-0.05cm}= \hspace{-0.05cm}0.234375 ,\\ {\rm Pr}(z\hspace{-0.05cm} =\hspace{-0.05cm}3) & = 20/64 \hspace{-0.05cm}= \hspace{-0.05cm} 0.3125 .\end{align*}$$

These are symmetrical with respect to the abscissa value $\mu = I/2 = 3$.

$\text{Example 3:}$ Another example of the application of the binomial distribution is the Calculation of the block error probability in Digital Signal Transmission.

If one transmits blocks each of $I =10$ binary symbols over a channel

with probability $p = 0.01$ that one symbol is corrupted ⇒ random variable $e_i = 1$, and
correspondingly with probability $1 - p = 0.99$ for a uncorrupted symbol ⇒ random variable $e_i = 0$,

then the new random variable $f$ ⇒ "number of block error" is:

$$f=\sum_{i=1}^{I}e_i.$$

This random variable $f$ can take all integer values between $0$ (no symbol is corrupted) and $I$ (all symbols symbol are corrupted) .

We denote the probabilities for $\mu$ corruptions by $p_μ$.
The case where all $I$ symbols are correctly transmitted occurs with probability $p_0 = 0.99^{10} ≈ 0.9044$ .
This also follows from the binomial formula for $μ = 0$ considering the definition $10\, \text{ over }\, 0 = 1$.
A single symbol error $(f = 1)$ occurs with the following probability:

$$p_1 = \rm 10\cdot 0.01\cdot 0.99^9\approx 0.0914.$$

The first factor considers that there are exactly $10\, \text{ over }\, 1 = 10$ possibilities for the position of a single error.

The other two factors take into account that one symbol must be corrupted and nine must be transmitted correctly if $f =1$ is to hold.

For $f =2$ there are clearly more combinations, namely $10\, \text{ over }\, 2 = 45$, and we get

$$p_2 = \rm 45\cdot 0.01^2\cdot 0.99^8\approx 0.0041.$$

If a block code can correct up to two errors, the block error probability is

$$p_{\rm block} = \it p_{\rm 3} \rm +\hspace{0.1cm}\text{ ...} \hspace{0.1cm} \rm + \it p_{\rm 10}\approx \rm 10^{-4},$$

or

$$p_{\rm block} = \rm 1-\it p_{\rm 0}-\it p_{\rm 1}-p_{\rm 2}\approx \rm 10^{-4}.$$

One can see that for large values of $I$ the second possibility of calculation via the complement leads faster to the goal.
However, one could also consider that for these numerical values $p_{\rm block} ≈ p_3$ holds as an approximation.

Use the interactive HTML5/JS applet Binomial and Poisson distribution to find the binomial probabilities for any $I$ and $p$ .

Moments of the binomial distribution

You can calculate the moments in general using the equations in the chapters Moments of a discrete random variable and probabilities of the binomial distribution .

$\text{Calculation rules:} $ For the $k$-th order moment of a binomially distributed random variable, the general rule is:

$$m_k={\rm E}\big[z^k\big]=\sum_{\mu={\rm 0} }^{I}\mu^k\cdot{I \choose \mu}\cdot p\hspace{0.05cm}^\mu\cdot ({\rm 1}-p)\hspace{0.05cm}^{I-\mu}.$$

From this, after some transformations, we obtain for

the linear mean value:

$$m_1 ={\rm E}\big[z\big]= I\cdot p,$$

the rms value:

$$m_2 ={\rm E}\big[z^2\big]= (I^2-I)\cdot p^2+I\cdot p.$$

The variance and standard deviation are obtained by applyings "Steiner's theorem":

$$\sigma^2 = {m_2-m_1^2} = {I \cdot p\cdot (1-p)} \hspace{0.3cm}\Rightarrow \hspace{0.3cm} \sigma = \sqrt{I \cdot p\cdot (1-p)}.$$

The maximum variance $σ^2 = I/4$ is obtained for the "characteristic probability" $p = 1/2$. In this case, the probabilities are symmetric around the mean $m_1 = I/2 \ ⇒ \ p_μ = p_{I–μ}$.

The more the characteristic probability $p$ deviates from the value $1/2$ ,

the smaller is the standard deviation $σ$, and
the more asymmetric the probabilities become around the mean $m_1 = I · p$.

$\text{Example 4:}$ As in $\text{example 3}$ , we consider a block of $I =10$ binary symbols, each of which is independently corrupted with probability $p = 0.01$ . Then holds:

The mean number of block errors is equal to $m_f = {\rm E}\big[ f\big] = I · p = 0.1$.
Der standard deviation of the random variable $f$ is $σ_f = \sqrt{0.1 \cdot 0.99}≈ 0.315$.

In contrast, in the completely corrupted channel ⇒ falsification probability (*besseres Wort als falsification für Verfälschung?*) $p = 1/2$ results in the values

$m_f = 5$ ⇒ on average, five of the ten bits within a block are wrong,
$σ_f = \sqrt{I}/2 ≈1.581$ ⇒ maximum standard deviation $I = 10$.

Exercises for the chapter

Exercise 2.3: Algebraic Sum of Binary Numbers

Exercise 2.4: Number Lottery (6 from 49)

@@ Line 11: / Line 11: @@
 The&nbsp; '''binomial distribution'''&nbsp; represents an important special case for the occurrence probabilities of a discrete random variable.
+To derive the binomial distribution,&nbsp; we assume that&nbsp; $I$&nbsp; binary and statistically independent random variables&nbsp; $b_i$&nbsp; each can achieve
-To derive the binomial distribution, we assume that&nbsp; $I$&nbsp;binary and statistically independent random variables&nbsp; $b_i$&nbsp; each can achieve
 *the value&nbsp; $1$&nbsp; with probability&nbsp; ${\rm Pr}(b_i = 1) = p$,&nbsp; and
 *the value&nbsp;  $0$&nbsp; with probability&nbsp; ${\rm Pr}(b_i = 0) = 1-p$.
+Then the sum&nbsp; $z$&nbsp; is also a discrete random variable with the symbol set &nbsp; $\{0, \ 1, \ 2,\hspace{0.1cm}\text{ ...} \hspace{0.1cm}, \ I\}$,&nbsp; which is called binomially distributed:
-Then the sum&nbsp; $z$&nbsp; is also a discrete random variable with the symbol set &nbsp; $\{0, \ 1, \ 2,\hspace{0.1cm}\text{ ...} \hspace{0.1cm}, \ I\}$,&nbsp;, which is called binomially distributed:
 :$$z=\sum_{i=1}^{I}b_i.$$
-Thus, the symbol size is&nbsp; $M = I + 1.$ }}
+Thus,&nbsp; the symbol set size is&nbsp; $M = I + 1.$ }}
 {{GraueBox|TEXT=
 $\text{Example 1:}$&nbsp;
-The binomial distribution finds manifold applications in communications engineering as well as in other disciplines:
+The binomial distribution finds manifold applications in Communications Engineering as well as in other disciplines:
 #&nbsp; It describes the distribution of rejects in statistical quality control.
 #&nbsp; It allows the calculation of the residual error probability in blockwise coding.
-#&nbsp;Also the bit error rate of a digital transmission system obtained by simulation is actually a binomially distributed random quantity.
+#&nbsp;The bit error rate of a digital transmission system obtained by simulation is actually a binomially distributed random quantity.}}
-Probabilities of the binomial distribution.}}
 ==Probabilities of the binomial distribution==
@@ Line 38: / Line 35: @@
 For the&nbsp;  '''probabilities of the binomial distribution'''&nbsp;  with&nbsp;  $μ = 0, \hspace{0.1cm}\text{...} \hspace{0.1cm}, \ I$:
 :$$p_\mu = {\rm Pr}(z=\mu)={I \choose \mu}\cdot p\hspace{0.05cm}^\mu\cdot ({\rm 1}-p)\hspace{0.05cm}^{I-\mu}.$$
-The first term here indicates the number of combinations &nbsp; $($read:&nbsp;  $I\ \text{  over  }\ μ)$:
+*The first term here indicates the number of combinations &nbsp; $($read:&nbsp;  $I\ \text{  over  }\ μ)$:
 :$${I \choose \mu}=\frac{I !}{\mu !\cdot (I-\mu) !}=\frac{ {I\cdot (I- 1) \cdot \ \cdots \ \cdot (I-\mu+ 1)} }{ 1\cdot  2\cdot \ \cdots \ \cdot   \mu}.$$}}
-''Additional notes:''
+Additional notes:
-*For very large values of&nbsp;  $I$&nbsp;, the binomial distribution can be approximated by the&nbsp;  [[Theory_of_Stochastic_Signals/Poisson_Distribution|Poisson distribution]]&nbsp; described in the next section.
+*For very large values of&nbsp;  $I$,&nbsp; the binomial distribution can be approximated by the&nbsp;  [[Theory_of_Stochastic_Signals/Poisson_Distribution|Poisson distribution]]&nbsp; described in the next section.
-*If at the same time the product&nbsp;  $I · p \gg 1$,&nbsp;, then according to&nbsp;  [https://en.wikipedia.org/wiki/De_Moivre%E2%80%93Laplace_theorem de Moivre–Laplace's (central limit) theorem]&nbsp; , the Poisson distribution&nbsp;  (and hence the binomial distribution))&nbsp; transitions to a discrete&nbsp; [[Theory_of_Stochastic_Signals/Gaussian_Distributed_Random_Variables|Gaussian distribution]]&nbsp;.
+*If at the same time the product&nbsp;  $I · p \gg 1$,&nbsp; then according to&nbsp;  [https://en.wikipedia.org/wiki/De_Moivre%E2%80%93Laplace_theorem de Moivre–Laplace's (central limit) theorem],&nbsp; the Poisson distribution&nbsp;  (and hence the binomial distribution)&nbsp; transitions to a discrete&nbsp; [[Theory_of_Stochastic_Signals/Gaussian_Distributed_Random_Variables|Gaussian distribution]].
-[[File:P_ID203__Sto_T_2_3_S2_neu.png |frame| Probabilites of the binomial distribution]]
 {{GraueBox|TEXT=
 $\text{Example 2:}$&nbsp;
-The graph shows the probabilities of the binomial distribution are for&nbsp; $I =6$&nbsp;  and&nbsp; $p =0.4$.&nbsp; Thus&nbsp; $M = I+1=7$&nbsp; probabilities are different from zero.
+The graph shows the probabilities of the binomial distribution are for&nbsp; $I =6$&nbsp;  and&nbsp; $p =0.4$.&nbsp;
+[[File:P_ID203__Sto_T_2_3_S2_neu.png |frame|Binomial distribution probabilites]]
+*Thus&nbsp; $M = I+1=7$&nbsp; probabilities are different from zero.
-In contrast, for&nbsp; $I = 6$&nbsp; and&nbsp; $p = 0.5$, the binomial probabilities are as follows:
+*In contrast,&nbsp; for&nbsp; $I = 6$&nbsp; and&nbsp; $p = 0.5$,&nbsp; the probabilities of the binomial distribution are as follows:
 :$$\begin{align*}{\rm Pr}(z\hspace{-0.05cm} =\hspace{-0.05cm}0)  & =  {\rm Pr}(z\hspace{-0.05cm} =\hspace{-0.05cm}6)\hspace{-0.05cm} =\hspace{-0.05cm} 1/64\hspace{-0.05cm} = \hspace{-0.05cm}0.015625 ,\\ {\rm Pr}(z\hspace{-0.05cm} =\hspace{-0.05cm}1)  & =  {\rm Pr}(z\hspace{-0.05cm} =\hspace{-0.05cm}5) \hspace{-0.05cm}= \hspace{-0.05cm}6/64 \hspace{-0.05cm}=\hspace{-0.05cm} 0.09375,\\ {\rm Pr}(z\hspace{-0.05cm} =\hspace{-0.05cm}2)  & =  {\rm Pr}(z\hspace{-0.05cm} =\hspace{-0.05cm}4)\hspace{-0.05cm} = \hspace{-0.05cm}15/64 \hspace{-0.05cm}= \hspace{-0.05cm}0.234375 ,\\ {\rm Pr}(z\hspace{-0.05cm} =\hspace{-0.05cm}3)  & =  20/64 \hspace{-0.05cm}= \hspace{-0.05cm} 0.3125 .\end{align*}$$
-These are symmetrical with respect to the abscissa value&nbsp;  $\mu = I/2 = 3$.}}
+*These are symmetrical with respect to the abscissa value&nbsp;  $\mu = I/2 = 3$.}}
-Another example of the application of the binomial distribution is the&nbsp; '''calculation of the block error probability in digital transmission'''.
+{{GraueBox|TEXT=
+$\text{Example 3:}$&nbsp; Another example of the application of the binomial distribution is the&nbsp; '''Calculation of the block error probability in Digital Signal Transmission'''.
-{{GraueBox|TEXT=
+If one transmits blocks each of&nbsp; $I =10$&nbsp; binary symbols over a channel
-$\text{Example 3:}$&nbsp;
+*with probability&nbsp; $p = 0.01$&nbsp; that one symbol is corrupted &nbsp; &rArr; &nbsp; random variable&nbsp; $e_i = 1$,&nbsp; and
-If one transmits blocks of&nbsp; $I =10$&nbsp; binary symbols each over a channel which is
+*correspondingly with probability&nbsp; $1 - p = 0.99$&nbsp; for a uncorrupted  symbol &nbsp; &rArr; &nbsp; random variable&nbsp; $e_i = 0$,
-*with probability&nbsp; $p = 0.01$&nbsp; one symbol is corrupted &nbsp; &rArr; &nbsp; random variable&nbsp; $e_i = 1$,&nbsp; and
-*correspondingly with probability&nbsp; $1 - p = 0.99$&nbsp; transmits the symbol uncorrupted &nbsp; &rArr; &nbsp; random variable&nbsp; $e_i = 0$,
-then the new random variable&nbsp; $f$&nbsp;  ("block error") is:
+then the new random variable&nbsp; $f$ &nbsp; &rArr; &nbsp; "number of block error"&nbsp;  is:
 :$$f=\sum_{i=1}^{I}e_i.$$
-This random variable&nbsp; $f$&nbsp; can now take all integer values between&nbsp; $0$&nbsp; (no symbol corrupted)&nbsp; and&nbsp; $I$&nbsp; (all symbols incorrect)&nbsp;.&nbsp; We denote the probabilities for&nbsp; $\mu$&nbsp; corruptions by&nbsp; $p_μ$.
+This random variable&nbsp; $f$&nbsp; can take all integer values between&nbsp; $0$&nbsp; (no symbol is corrupted)&nbsp; and&nbsp; $I$&nbsp; (all symbols symbol are corrupted)&nbsp;.&nbsp;
-*The case where all&nbsp; $I$&nbsp; symbols are correctly transmitted occurs with probability&nbsp; $p_0 = 0.99^{10} ≈ 0.9044$&nbsp;. This also follows from the binomial formula for&nbsp; $μ = 0$&nbsp; considering definition&nbsp; $10\, \text{ over }\, 0 = 1$.
+*We denote the probabilities for&nbsp; $\mu$&nbsp; corruptions by&nbsp; $p_μ$.
+*The case where all&nbsp; $I$&nbsp; symbols are correctly transmitted occurs with probability&nbsp; $p_0 = 0.99^{10} ≈ 0.9044$&nbsp;.
+*This also follows from the binomial formula for&nbsp; $μ = 0$&nbsp; considering the definition&nbsp; $10\, \text{ over }\, 0 = 1$.
 *A single symbol error&nbsp; $(f = 1)$&nbsp; occurs with the following probability:
 :$$p_1 = \rm 10\cdot 0.01\cdot 0.99^9\approx 0.0914.$$
-:The first factor considers that there are exactly&nbsp; $10\, \text{ over }\, 1 = 10$&nbsp; possibilities for the position of a single error.&nbsp; The other two factors take into account that one symbol must be corrupted and nine must be transmitted correctly if&nbsp; $f =1$&nbsp; is to hold.
+::The first factor considers that there are exactly&nbsp; $10\, \text{ over }\, 1 = 10$&nbsp; possibilities for the position of a single error.&nbsp;
-*For&nbsp; $f =2$&nbsp; there are clearly more combinations, namely&nbsp; $10\, \text{ over }\, 2 = 45$,&nbsp; and we get
+::The other two factors take into account that one symbol must be corrupted and nine must be transmitted correctly if&nbsp; $f =1$&nbsp; is to hold.
+*For&nbsp; $f =2$&nbsp; there are clearly more combinations,&nbsp; namely&nbsp; $10\, \text{ over }\, 2 = 45$, &nbsp; and we get
 :$$p_2 = \rm 45\cdot 0.01^2\cdot 0.99^8\approx 0.0041.$$
-If a block code can correct up to two errors, the residual error probability is
+If a block code can correct up to two errors,&nbsp; the block error probability is
-:$$p_{\rm R} = \it p_{\rm 3} \rm +\hspace{0.1cm}\text{ ...} \hspace{0.1cm} \rm + \it p_{\rm 10}\approx \rm 10^{-4},$$
+:$$p_{\rm block} = \it p_{\rm 3} \rm +\hspace{0.1cm}\text{ ...} \hspace{0.1cm} \rm + \it p_{\rm 10}\approx \rm 10^{-4},$$
-oder
+or
-:$$p_{\rm R} = \rm 1-\it p_{\rm 0}-\it p_{\rm 1}-p_{\rm 2}\approx \rm 10^{-4}.$$
+:$$p_{\rm block} = \rm 1-\it p_{\rm 0}-\it p_{\rm 1}-p_{\rm 2}\approx \rm 10^{-4}.$$
-*One can see that the second possibility of calculation via the complement for large values of&nbsp; $I$&nbsp; leads faster to the goal.
+*One can see that for large values of&nbsp; $I$&nbsp; the second possibility of calculation via the complement leads faster to the goal.
-*However, one could also consider as an approximation that for these numerical values&nbsp; $p_{\rm R} ≈ p_3$&nbsp; holds. }}
+*However,&nbsp;  one could also consider that for these numerical values&nbsp; $p_{\rm block} ≈ p_3$&nbsp; holds  as an approximation. }}
-Use the interactive applet&nbsp; [[Applets:Binomial-_und_Poissonverteilung_(Applet)|Binomial and Poisson distribution]]&nbsp; to find the binomial probabilities for any&nbsp; $I$&nbsp; and&nbsp; $p$&nbsp;.
+Use the interactive HTML5/JS applet&nbsp; [[Applets:Binomial-_und_Poissonverteilung_(Applet)|Binomial and Poisson distribution]]&nbsp; to find the binomial probabilities for any&nbsp; $I$&nbsp; and&nbsp; $p$&nbsp;.