Contents
# OVERVIEW OF THE THIRD MAIN CHAPTER #
We consider here continuous random variables, i.e., random variables which can assume infinitely many different values, at least in certain ranges of real numbers.
- Their applications in information and communication technology are manifold.
- They are used, among other things, for the simulation of noise signals and for the description of fading effects.
We restrict ourselves at first to the statistical description of the amplitude distribution. In detail, the following are treated:
- The relationship between »probability density function« $\rm (PDF)$ and »cumulative distribution function« $\rm (CDF)$;
- the calculation of »expected values and moments«;
- some »special cases«:
- uniform distributed random variables,
- Gaussian distributed random variables,
- exponential distributed random variables,
- Laplace distributed random variables,
- Rayleigh distributed random variables,
- Rice distributed random variables,
- Cauchy distributed random variables;
- the »generation of continuous random variables« on a computer.
Inner statistical dependencies of the underlying processes are not considered here. For this, we refer to the following main chapters 4 and 5.
Properties of continuous random variables
In the second chapter it was shown that the amplitude distribution of a discrete random variable is completely determined by its $M$ occurrence probabilities, where the step number $M$ usually has a finite value.
Now we consider continuous (value) random variables. By this we mean random variables whose possible numerical values are uncountable ⇒ $M \to \infty$.
Further it shall hold:
- In the following we denote continuous random variables (mostly) with $x$ in contrast to the discrete random variables, which are denoted with $z$ as before.
- No statement is made here about a possible time discretization, i.e., continuous random variables can be discrete in time.
- Further, we assume for this chapter that there are no statistical bindings between the individual samples $x_ν$, or at least leave them out of consideration.
The following image shows a section of a stochastic noise signal $x(t)$ whose instantaneous value can be taken as a continuous random variable $x$.
From the probability density function (PDF) shown on the right, it can be seen that instantaneous values around the mean $m_1$ occur most frequently for this example signal. Since there are no statistical bindings between the samples $x_ν$, such a signal is also referred to as white noise.
Definition of the probability density function
For a continuous random variable, the probabilities that it takes on quite specific values are equal to (wie übersetzt man 'identisch 0' besser?) $0$. Therefore, to describe a continuous random variable, we must always refer to the probability density function - abbreviated PDF.
The value of the probability density function $f_{x}(x)$ at location $x_\mu$ is equal to the probability that the instantaneous value of the random variable $x$ lies in an (infinitesimally small) interval of width $Δx$ around $x_\mu$ divided by $Δx$: $$f_x(x=x_\mu) = \lim_{\rm \Delta \it x \hspace{0.05cm}\to \hspace{0.05cm}\rm 0}\frac{\rm Pr \{\it x_\mu-\rm \Delta \it x/\rm 2 \le \it x \le x_\mu \rm +\rm \Delta \it x/\rm 2\}}{\rm \Delta \it x}.$$
This extremely important descriptive variable has the following properties:
- Although it can be seen from the time course on the last subchapter that the most frequent signal components are at $x = m_1$ and the probability density function has its largest value here, the probability ${\rm Pr}(x = m_1$) that the instantaneous value is exactly equal to the mean $m_1$ is identical $0$.
- For the probability that the random variable lies in the range between $x_u$ and $x_o$:
- $${\rm Pr}(x_{\rm u} \le x \le x_{\rm o}) = \int_{x_{\rm u}}^{x_{\rm o}} f_{x}(x) \,{\rm d}x.$$
- As an important normalization property, this yields for the area under the PDF with the boundary transitions $x_{\rm u} → \hspace{0.05cm} - \hspace{0.05cm} ∞$ and $x_{\rm o} → +∞:$
- $$\int_{-\infty}^{+\infty} f_{x}(x) \,{\rm d}x = \rm 1.$$
- The corresponding equation for discrete-value, $M$-level random variables states that the sum over the $M$ occurrence probabilities gives the value $1$.
Note on nomenclature: In the literature, we usually distinguish between the random variable $X$ and its realizations $x ∈ X$. Thus, the above definition equation is
$$f_{X}(X=x) = \lim_{{\rm \Delta} x \hspace{0.05cm}\to \hspace{0.05cm} 0}\frac{{\rm Pr} \{ x-{\rm \Delta} x/2 \le X \le x +{\rm \Delta} x/ 2\}}{{\rm \Delta} x}.$$
We have largely dispensed with this more precise nomenclature in our learning tutorial so as not to use up two letters for one variable. Lowercase letters (like $x$) often denote signals and uppercase letters (like $X$) denote the associated spectra in our case. Nevertheless, today (2017) we honestly have to admit that the decision of 2001 was not quite fortunate.
PDF definition for discrete random variables
For reasons of a uniform representation of all random variables (both discrete-value and continuous-value), it is convenient to define the probability density function also for discrete random variables. Applying the definition equation of the last subchapter to discrete random variables, the PDF takes infinitely large values at some points $x_\mu$ due to the nonvanishingly small probability value and the limit transition $Δx → 0$. Thus, the PDF results in a sum of Dirac functions (resp. distributions): $$f_{x}(x)=\sum_{\mu=1}^{M}p_\mu\cdot {\rm \delta}( x-x_\mu).$$
The weights of these Dirac functions are equal to the probabilities $p_\mu = {\rm Pr}(x = x_\mu$).
Here is another note to help classify the different descriptive quantities for discrete and continuous random variables: Probability and probability density function are related in a similar way as in the book Signal Representation
- a discrete spectral component of a harmonic oscillation ⇒ line spectrum, and
- a continuous spectrum of an energy-limited (pulse-shaped) signal.
Below is a section of a rectangular signal with three possible values, where the signal value $0 \ \rm V$ occurs twice as often as the outer signal values ($\pm 1 \ \rm V$).
Thus, the corresponding PDF (values from top to bottom) is: $$f_{x}(x) = 0.25 \cdot \delta(x-{\rm 1 \rm V}) + 0.5\cdot \delta(x) + 0.25\cdot \delta (x + 1\rm V).$$
For a more in-depth look at the topic covered here, we recommend the following educational video:
Numerical determination of the PDF
You can see here a scheme for the numerical determination of the probability density function.
Assuming that the random variable $x$ at hand has negligible values outside the range from $x_{\rm min} = -4.02$ to $x_{\rm max} = +4.02$, proceed as follows:
- Divide the range of values of $x$ into $I$ intervals of equal width $Δx$ and define a field $\text{WDF}[0 : I-1]$. In the above sketch $I = 201$ and accordingly $Δx = 0.04$ is chosen.
- The random variable $x$ is now called $N$ times in succession, each time checking to which interval $i_{\rm akt}$ the current random variable $x_{\rm akt}$ belongs: $i_{\rm akt} = ({\rm int})((x + x_{\rm max})/Δx)$.
- The corresponding field element WDF( $i_{\rm akt}$) is then incremented by $1$. After $N$ iterations, $\text{WDF}[i_{\rm akt}]$ then contains the number of random numbers belonging to the interval $i_{\rm akt}$.
- The actual WDF values are obtained if, at the end, all field elements $\text{WDF}[i]$ with $0 ≤ i ≤ I-1$ are still divided by $N - Δx$.
From the drawn green arrows in the graph above, one can see:
- The value $x_{\rm akt} = 0.07$ leads to the result $i_{\rm akt} =$ (int) ((0.07 + 4.02)/0.04) = (int) $102.25$. Here (int) means an integer conversion after float division ⇒ $i_{\rm akt} = 102$.
- The same interval $i_{\rm akt} = 102$ results for $0.06 < x_{\rm akt} < 0.10$, so for example also for $x_{\rm akt} = 0.09$.
Exercises for the chapter
Exercise 3.1: cos²-PDF and PDF with Dirac Functions