Difference between revisions of "Theory of Stochastic Signals/Probability Density Function"
Line 144: | Line 144: | ||
==Exercises for the chapter== | ==Exercises for the chapter== | ||
− | [[Aufgaben:Exercise_3.1:_cos²-PDF_and_PDF_with_Dirac_Functions|Exercise 3.1: | + | [[Aufgaben:Exercise_3.1:_cos²-PDF_and_PDF_with_Dirac_Functions|Exercise 3.1: Cosine-square PDF and PDF with Dirac Functions]] |
[[Aufgaben:Exercise_3.1Z:_Triangular_PDF|Exercise 3.1Z: Triangular PDF]] | [[Aufgaben:Exercise_3.1Z:_Triangular_PDF|Exercise 3.1Z: Triangular PDF]] |
Revision as of 15:24, 3 February 2022
Contents
# OVERVIEW OF THE THIRD MAIN CHAPTER #
We consider here continuous random variables, i.e., random variables which can assume infinitely many different values, at least in certain ranges of real numbers.
- Their applications in information and communication technology are manifold.
- They are used, among other things, for the simulation of noise signals and for the description of fading effects.
We restrict ourselves at first to the statistical description of the amplitude distribution. In detail, the following are treated:
- The relationship between »probability density function« (PDF) and »cumulative distribution function« (CDF);
- the calculation of »expected values and moments«;
- some »special cases« of continuous-value distributions:
- uniform distributed random variables,
- Gaussian distributed random variables,
- exponential distributed random variables,
- Laplace distributed random variables,
- Rayleigh distributed random variables,
- Rice distributed random variables,
- Cauchy distributed random variables;
- the »generation of continuous random variables« on a computer.
Inner statistical dependencies of the underlying processes are not considered here. For this, we refer to the following main chapters 4 and 5.
Properties of continuous random variables
In the second chapter it was shown that the amplitude distribution of a discrete random variable is completely determined by its M occurrence probabilities, where the level number M usually has a finite value.
Definition: By a »value-continuous random variable« is meant a random variable whose possible numerical values are uncountable ⇒ M→∞. In the following, we will often use the short form "continuous random variable".
Further it shall hold:
- In the following we denote continuous random variables (mostly) with x in contrast to the discrete random variables, which are denoted with z as before.
- No statement is made here about a possible time discretization, i.e., continuous random variables can be discrete in time.
- Further, we assume for this chapter that there are no statistical bindings between the individual samples x_ν, or at least leave them out of consideration.
\text{Example 1:} The graphic shows a section of a stochastic noise signal x(t) whose instantaneous value can be taken as a continuous random variable x.
- From the »probability density function» \rm (PDF) shown on the right, it can be seen that instantaneous values around the mean m_1 occur most frequently for this example signal.
- Since there are no statistical dependencies between the samples x_ν, such a signal is also referred to as »white noise«.
Definition of the probability density function
For a continuous random variable, the probabilities that it takes on quite specific values are zero. Therefore, to describe a continuous random variable, we must always refer to the "probability density function" \rm (PDF).
\text{Definition:} The value of the »probability density function« f_{x}(x) at location x_\mu is equal to the probability that the instantaneous value of the random variable x lies in an (infinitesimally small) interval of width Δx around x_\mu, divided by Δx:
- f_x(x=x_\mu) = \lim_{\rm \Delta \it x \hspace{0.05cm}\to \hspace{0.05cm}\rm 0}\frac{\rm Pr \{\it x_\mu-\rm \Delta \it x/\rm 2 \le \it x \le x_\mu \rm +\rm \Delta \it x/\rm 2\} }{\rm \Delta \it x}.
This extremely important descriptive variable has the following properties:
- Although from the time course in \text{Example 1} it can be seen that the most frequent signal components lie at x = m_1 and the PDF has its largest value here, for a continuous random variable the probability {\rm Pr}(x = m_1), that the instantaneous value is exactly equal to the mean m_1, is identically zero.
- For the probability that the random variable lies in the range between x_{\rm u} and x_{\rm o}:
- {\rm Pr}(x_{\rm u} \le x \le x_{\rm o})= \int_{x_{\rm u} }^{x_{\rm o} }f_{x}(x) \,{\rm d}x.
- As an important normalization property, this yields for the area under the PDF with the boundary transitions x_{\rm u} → \hspace{0.05cm} - \hspace{0.05cm} ∞ and x_{\rm o} → +∞:
- \int_{-\infty}^{+\infty} f_{x}(x) \,{\rm d}x = \rm 1.
- The corresponding equation for discrete-value, M-level random variables states that the sum over the M occurrence probabilities gives the value 1.
\text{Note on nomenclature:} In the literature, a distinction is often made between the random variable X and its realizations x ∈ X.
Thus, the above definition equation is
- f_{X}(X=x) = \lim_{ {\rm \Delta} x \hspace{0.05cm}\to \hspace{0.05cm} 0}\frac{ {\rm Pr} \{ x - {\rm \Delta} x/2 \le X \le x +{\rm \Delta} x/ 2\} }{ {\rm \Delta} x}.
We have largely dispensed with this more precise nomenclature in our learning tutorial \rm LNTwww so as not to use up two letters for one quantity.
- Lowercase letters (as x) often denote signals and uppercase letters (as X) the associated spectra in our case.
- Nevertheless, today (2017) we have to honestly admit that the 2001 decision was not entirely happy.
PDF definition for discrete random variables
For reasons of a uniform representation of all random variables (both discrete-value and continuous-value), it is convenient to define the probability density function also for discrete random variables.
\text{Definition:} Applying the definition equation of the last section to discrete random variables, the PDF takes infinitely large values at some points x_\mu due to the nonvanishingly small probability value and the limit transition  Δx → 0.
Thus, the PDF results in a sum of Dirac delta functions ⇒ "distributions":
- f_{x}(x)=\sum_{\mu=1}^{M}p_\mu\cdot {\rm \delta}( x-x_\mu).
The weights of these Dirac delta functions are equal to the probabilities p_\mu = {\rm Pr}(x = x_\mu).
Here is another note to help classify the different descriptive quantities for discrete and continuous random variables: Probability and probability density function are related in a similar way as in the book Signal Representation
- a discrete spectral component of a harmonic oscillation ⇒ line spectrum, and
- a continuous spectrum of an energy-limited (pulse-shaped) signal.
\text{Example 2:} Below is a section
- of a rectangular signal with three possible values,
- where the signal value 0 \ \rm V  occurs twice as often as the outer signal values (\pm 1 \ \rm V).
Thus, the corresponding PDF (values from top to bottom) is:
- f_{x}(x) = 0.25 \cdot \delta(x - {\rm 1 V})+ 0.5\cdot \delta(x) + 0.25\cdot \delta (x + 1\rm V).
For a more in-depth look at the topic covered here, we recommend the following (German language) learning video:
- Wahrscheinlichkeit und WDF ⇒ "Probability and probability density function"
Numerical determination of the PDF
You can see here a scheme for the numerical determination of the probability density function:
Assuming that the random variable x at hand has negligible values outside the range from x_{\rm min} = -4.02 to x_{\rm max} = +4.02, proceed as follows:
- Divide the range of x-values into I intervals of equal width Δx and define a field \text{PDF}[0 : I-1]. In the sketch I = 201 and accordingly Δx = 0.04 is chosen.
- The random variable x is now called N times in succession, each time checking to which interval i_{\rm act} the current random variable x_{\rm act} belongs:
i_{\rm act} = ({\rm int})((x + x_{\rm max})/Δx). - The corresponding field element PDF( i_{\rm act}) is then incremented by 1.
- After N iterations, \text{PDF}[i_{\rm act}] then contains the number of random numbers belonging to the interval i_{\rm act}.
- The actual PDF values are obtained if, at the end, all field elements \text{PDF}[i] with 0 ≤ i ≤ I-1 are still divided by N \cdot Δx.
\text{Example 3:} From the drawn green arrows in the graph above, one can see:
- The value x_{\rm act} = 0.07 leads to the result i_{\rm act} = (int) ((0.07 + 4.02)/0.04) = (int) 102.25.
- Here "(int)" means an integer conversion after float division ⇒ i_{\rm act} = 102.
- The same interval i_{\rm act} = 102 results for 0.06 < x_{\rm act} < 0.10, so for example also for x_{\rm act} = 0.09.
Exercises for the chapter
Exercise 3.1: Cosine-square PDF and PDF with Dirac Functions