Difference between revisions of "Aufgaben:Exercise 3.10Z: Maximum Likelihood Decoding of Convolutional Codes"

From LNTwww
 
(16 intermediate revisions by 4 users not shown)
Line 1: Line 1:
{{quiz-Header|Buchseite=Kanalcodierung/Decodierung von Faltungscodes}}
+
{{quiz-Header|Buchseite=Channel_Coding/Decoding_of_Convolutional_Codes}}
  
[[File:P_ID2678__KC_Z_3_10.png|right|frame|Gesamtsystemmodell <br>Coder &ndash; Kanal &ndash; Viterbi]]
+
[[File:EN_KC_Z_3_10.png|right|frame|Overall system model,&nbsp; given for this exercise]]
Der Viterbi&ndash;Algorithmus stellt die bekannteste Realisierungsform für die Maximum&ndash;Likelihood&ndash;Decodierung eines Faltungscodes dar. Wir gehen hier von folgendem Modell aus:
+
The Viterbi algorithm represents the best known realization form for the maximum likelihood decoding of a convolutional code.&nbsp; We assume here the following model:
* Die Informationssequenz&nbsp; $\underline{u}$&nbsp; wird durch einen Faltungscode in die Codesequenz&nbsp; $\underline{x}$&nbsp; umgesetzt. Es gelte&nbsp; $u_i &#8712; \{0, \, 1\}$. Dagegen werden die Codesymbole bipolar dargestellt &nbsp; &#8658; &nbsp; $x_i &#8712; \{&ndash;1, \, +1\}$.
+
* The information sequence&nbsp; $\underline{u}$&nbsp; is converted into the code sequence&nbsp; $\underline{x}$&nbsp; by a convolutional code.  
* Der Kanal sei durch das&nbsp; [[Kanalcodierung/Kanalmodelle_und_Entscheiderstrukturen#Binary_Symmetric_Channel_.E2.80.93_BSC| BSC&ndash;Modell]]&nbsp; gegeben &nbsp; &#8658; &nbsp; $y_i &#8712; \{&ndash;1, \, +1\}$&nbsp; oder es wird der&nbsp; [[Kanalcodierung/Kanalmodelle_und_Entscheiderstrukturen#AWGN.E2.80.93Kanal_bei_bin.C3.A4rem_Eingang| AWGN&ndash;Kanal]]&nbsp; vorausgesetzt &nbsp; &#8658; &nbsp; reellwertige Empfangswerte&nbsp; $y_i$.
+
 
* Bei gegebener Empfangssequenz&nbsp; $\underline{y}$&nbsp; entscheidet sich der Viterbi&ndash;Algorithmus für die Codesequenz&nbsp; $\underline{z}$&nbsp; gemäß
+
*It is valid&nbsp; $u_i &#8712; \{0, \, 1\}$.&nbsp; In contrast,&nbsp; the code symbols are represented bipolar &nbsp; &#8658; &nbsp; $x_i &#8712; \{&ndash;1, \, +1\}$.
 +
 
 +
* Let the channel be given by the models&nbsp; [[Channel_Coding/Channel_Models_and_Decision_Structures#Binary_Symmetric_Channel_.E2.80.93_BSC|$\text{BSC}$]]&nbsp; &nbsp; &#8658; &nbsp; received values&nbsp; $y_i &#8712; \{&ndash;1, \, +1\}$&nbsp; or&nbsp; [[Channel_Coding/Channel_Models_and_Decision_Structures#AWGN_channel_at_binary_input| $\text{AWGN}$]] &nbsp; &#8658; $y_i$&nbsp; real-valued.
 +
 
 +
* Given a received sequence&nbsp; $\underline{y}$&nbsp; the Viterbi algorithm decides for the sequence&nbsp; $\underline{z}$&nbsp; according to
 
:$$\underline{z} = {\rm arg} \max_{\underline{x}_{\hspace{0.03cm}i} \hspace{0.03cm} \in \hspace{0.05cm} \mathcal{C}} \hspace{0.1cm} {\rm Pr}( \underline{x}_{\hspace{0.03cm}i} |\hspace{0.05cm} \underline{y} ) \hspace{0.05cm}.$$
 
:$$\underline{z} = {\rm arg} \max_{\underline{x}_{\hspace{0.03cm}i} \hspace{0.03cm} \in \hspace{0.05cm} \mathcal{C}} \hspace{0.1cm} {\rm Pr}( \underline{x}_{\hspace{0.03cm}i} |\hspace{0.05cm} \underline{y} ) \hspace{0.05cm}.$$
  
*Dies entspricht dem&nbsp; [[Kanalcodierung/Kanalmodelle_und_Entscheiderstrukturen#Maximum-a-posteriori.E2.80.93_und_Maximum-Likelihood.E2.80.93Kriterium| Maximum&ndash;a&ndash;posteriori]]&nbsp; (MAP)&ndash;Kriterium. Sind alle Informationssequenzen&nbsp; $\underline{u}$&nbsp; gleichwahrscheinlich, so geht dieses in das etwas einfachere&nbsp; [[Kanalcodierung/Kanalmodelle_und_Entscheiderstrukturen#Maximum-a-posteriori.E2.80.93_und_Maximum-Likelihood.E2.80.93Kriterium| Maximum&ndash;Likelihood&ndash;Kriterium]]&nbsp; über:
+
*This corresponds to the&nbsp; [[Channel_Coding/Channel_Models_and_Decision_Structures#Criteria_.C2.BBMaximum-a-posteriori.C2.AB_and_.C2.BBMaximum-Likelihood.C2.AB| "Maximum a posteriori"]]&nbsp; $\rm (MAP)$&nbsp; criterion.&nbsp; If all information sequences&nbsp; $\underline{u}$&nbsp; are equally likely,&nbsp; this transitions to the somewhat simpler&nbsp; [[Channel_Coding/Channel_Models_and_Decision_Structures#Criteria_.C2.BBMaximum-a-posteriori.C2.AB_and_.C2.BBMaximum-Likelihood.C2.AB| "Maximum likelihood criterion"]]&nbsp; $\rm (ML)$:  
 
:$$\underline{z} = {\rm arg} \max_{\underline{x}_{\hspace{0.03cm}i} \hspace{0.05cm} \in \hspace{0.05cm} \mathcal{C}} \hspace{0.1cm} {\rm Pr}( \underline{y}  \hspace{0.05cm}|\hspace{0.05cm} \underline{x}_{\hspace{0.03cm}i} ) \hspace{0.05cm}.$$
 
:$$\underline{z} = {\rm arg} \max_{\underline{x}_{\hspace{0.03cm}i} \hspace{0.05cm} \in \hspace{0.05cm} \mathcal{C}} \hspace{0.1cm} {\rm Pr}( \underline{y}  \hspace{0.05cm}|\hspace{0.05cm} \underline{x}_{\hspace{0.03cm}i} ) \hspace{0.05cm}.$$
  
*Als weiteres Ergebnis gibt der Viterbi&ndash;Algorithmus zusätzlich die Sequenz&nbsp; $\underline{v}$&nbsp; als Schätzung für die Informationssequenz&nbsp; $\underline{u}$&nbsp; aus.
+
*As a further result,&nbsp; the Viterbi algorithm additionally outputs the sequence&nbsp; $\underline{v}$&nbsp; as an estimate for the information sequence&nbsp; $\underline{u}$.
  
  
In dieser Aufgabe soll der Zusammenhang zwischen der&nbsp; [[Kanalcodierung/Zielsetzung_der_Kanalcodierung#Einige_wichtige_Definitionen_zur_Blockcodierung| Hamming&ndash;Distanz]]&nbsp; $d_{\rm H}(\underline{x}, \, \underline{y})$&nbsp; sowie der&nbsp; [[Kanalcodierung/Kanalmodelle_und_Entscheiderstrukturen#Maximum-Likelihood.E2.80.93Entscheidung_beim_AWGN.E2.80.93Kanal| Euklidischen Distanz]]
+
In this exercise,&nbsp; you should determinethe relationship between the&nbsp; [[Channel_Coding/Objective_of_Channel_Coding#Important_definitions_for_block_coding| $\text{Hamming distance}$]]&nbsp; $d_{\rm H}(\underline{x}, \, \underline{y})$&nbsp; and the&nbsp; [[Channel_Coding/Channel_Models_and_Decision_Structures#Maximum-likelihood_decision_at_the_AWGN_channel|$\text{Euclidean distance}$]]
 
:$$d_{\rm E}(\underline{x}  \hspace{0.05cm}, \hspace{0.1cm}\underline{y}) =
 
:$$d_{\rm E}(\underline{x}  \hspace{0.05cm}, \hspace{0.1cm}\underline{y}) =
\sqrt{\sum_{i=1}^{L} \hspace{0.2cm}(x_i - y_i)^2}\hspace{0.05cm}$$
+
\sqrt{\sum_{i=1}^{L} \hspace{0.2cm}(x_i - y_i)^2}\hspace{0.05cm}.$$
  
ermittelt werden. Anschließend ist das obige ML&ndash;Kriterium  zu formulieren mit
+
Then,&nbsp; the above maximum likelihood criterion is to be formulated with
* der Hamming&ndash;Distanz&nbsp; $d_{\rm H}(\underline{x}, \, \underline{y})$,
+
* the Hamming distance&nbsp; $d_{\rm H}(\underline{x}, \, \underline{y})$,
* der Euklidischen Distanz&nbsp; $d_{\rm E}(\underline{x}, \, \underline{y})$, und
 
* dem&nbsp; [[Kanalcodierung/Decodierung_von_Faltungscodes#Zusammenhang_zwischen_Hamming.E2.80.93Distanz_und_Korrelation| Korrelationswert]]&nbsp; $&#9001; x \cdot y &#9002;$.
 
  
 +
* the Euclidean distance&nbsp; $d_{\rm E}(\underline{x}, \, \underline{y})$,&nbsp; and
  
 +
* the&nbsp; [[Channel_Coding/Decoding_of_Convolutional_Codes#Relationship_between_Hamming_distance_and_correlation|$\text{correlation value}$]]&nbsp; $&#9001; x \cdot y &#9002;$.
  
  
Line 30: Line 34:
  
  
''Hinweise:''
 
* Die Aufgabe bezieht sich auf das Kapitel&nbsp; [[Kanalcodierung/Decodierung_von_Faltungscodes| Decodierung von Faltungscodes]].
 
* Bezug genommen wird insbesondere auf  die  Seite&nbsp; [[Kanalcodierung/Decodierung_von_Faltungscodes#Viterbi.E2.80.93Algorithmus.2C_basierend_auf_Korrelation_und_Metriken|Viterbi&ndash;Algorithmus &ndash; basierend auf Korrelation und Metriken]].
 
* Zur Vereinfachung wird auf &bdquo;Tilde&rdquo; und &bdquo;Apostroph&rdquo; verzichtet.
 
* Weitere Informationen zu diesem Thema finden Sie auf folgenden Seiten dieses Buches:
 
** [[Kanalcodierung/Kanalmodelle_und_Entscheiderstrukturen#Maximum-a-posteriori.E2.80.93_und_Maximum-Likelihood.E2.80.93Kriterium| MAP&ndash; und ML&ndash;Kriterium]],
 
** [[Kanalcodierung/Kanalmodelle_und_Entscheiderstrukturen#Maximum-Likelihood.E2.80.93Entscheidung_beim_BSC.E2.80.93Kanal| ML&ndash;Entscheidung beim BSC&ndash;Kanal]],
 
** [[Kanalcodierung/Kanalmodelle_und_Entscheiderstrukturen#Maximum-Likelihood.E2.80.93Entscheidung_beim_AWGN.E2.80.93Kanal| ML&ndash;Entscheidung beim AWGN&ndash;Kanal]],
 
** [[Kanalcodierung/Decodierung_linearer_Blockcodes#Blockschaltbild_und_Voraussetzungen| Decodierung linearer Blockcodes]].
 
  
  
 +
<u>Hints:</u>
 +
* This exercise refers to the chapter&nbsp; [[Channel_Coding/Decoding_of_Convolutional_Codes| "Decoding of Convolutional Codes"]].
 +
 +
* Reference is made in particular to the section&nbsp; [[Channel_Coding/Decoding_of_Convolutional_Codes#Viterbi_algorithm_based_on_correlation_and_metrics|"Viterbi algorithm &ndash; based on correlation and metrics"]].
  
 +
* For simplicity,&nbsp; "tilde"&nbsp; and&nbsp; "apostrophe"&nbsp; are omitted.
  
===Fragebogen===
+
* For more information on this topic,&nbsp; see the following sections in this book:
 +
:* [[Channel_Coding/Channel_Models_and_Decision_Structures#Criteria_.C2.BBMaximum-a-posteriori.C2.AB_and_.C2.BBMaximum-Likelihood.C2.AB| "MAP and ML criterion"]],
 +
 
 +
:* [[Channel_Coding/Channel_Models_and_Decision_Structures#Maximum-likelihood_decision_at_the_BSC_channel| "ML decision at the BSC channel"]],
 +
 
 +
:* [[Channel_Coding/Channel_Models_and_Decision_Structures#Maximum-likelihood_decision_at_the_AWGN_channel| "ML decision at the AWGN channel"]],
 +
 
 +
:* [[Channel_Coding/Decoding_of_Linear_Block_Codes#Block_diagram_and_requirements| "Decoding linear block codes"]].
 +
 
 +
 
 +
 
 +
===Questions===
 
<quiz display=simple>
 
<quiz display=simple>
{Wie hängen $d_{\rm H}(\underline{x}, \, \underline{y})$ und $d_{\rm E}(\underline{x}, \, \underline{y})$ beim BSC&ndash;Modell zusammen?
+
{How are &nbsp; $d_{\rm H}(\underline{x}, \, \underline{y})$ &nbsp; and &nbsp; $d_{\rm E}(\underline{x}, \, \underline{y})$ &nbsp; related in the BSC model?
 
|type="()"}
 
|type="()"}
- Es gilt $d_{\rm H}(\underline{x}, \, \underline{y}) = d_{\rm E}(\underline{x}, \, \underline{y})$.
+
- &nbsp; $d_{\rm H}(\underline{x}, \, \underline{y}) = d_{\rm E}(\underline{x}, \, \underline{y})$&nbsp; is valid.
- Es gilt $d_{\rm H}(\underline{x}, \, \underline{y}) = d_{\rm E}^2(\underline{x}, \, \underline{y})$.
+
- &nbsp; $d_{\rm H}(\underline{x}, \, \underline{y}) = d_{\rm E}^2(\underline{x}, \, \underline{y})$&nbsp; is valid.
+ Es gilt $d_{\rm H}(\underline{x}, \, \underline{y}) = d_{\rm E}^2(\underline{x}, \, \underline{y})/4$.
+
+ &nbsp; $d_{\rm H}(\underline{x}, \, \underline{y}) = d_{\rm E}^2(\underline{x}, \, \underline{y})/4$&nbsp; is valid.
  
{Welche der Gleichungen beschreiben die ML&ndash;Decodierung beim BSC&ndash;Modell? Die Minimierung/Maximierung bezieht sich jeweils auf alle $\underline{x} &#8712;\mathcal{ C}$.
+
{Which of the equations describe the maximum likelihood decoding in the BSC model?&nbsp; The minimization/maximization refers alwaysto all&nbsp; $\underline{x} &#8712;\mathcal{ C}$.
 
|type="[]"}
 
|type="[]"}
 
+ $\underline{z} = \arg \min {d_{\rm H}(\underline{x}, \, \underline{y})}$,
 
+ $\underline{z} = \arg \min {d_{\rm H}(\underline{x}, \, \underline{y})}$,
Line 57: Line 68:
 
+ $\underline{z} = \arg \min {d_{\rm E}^2(\underline{x}, \, \underline{y})}$,
 
+ $\underline{z} = \arg \min {d_{\rm E}^2(\underline{x}, \, \underline{y})}$,
  
{Welche Gleichung beschreibt die ML&ndash;Entscheidung beim BSC&ndash;Modell?
+
{Which equation describes the maximum likelihood decision in the BSC model?
 
|type="()"}
 
|type="()"}
 
- $\underline{z} = \arg \min &#9001; \underline{x} \cdot \underline{y} &#9002;$,
 
- $\underline{z} = \arg \min &#9001; \underline{x} \cdot \underline{y} &#9002;$,
 
+ $\underline{z} = \arg \max &#9001; \underline{x} \cdot \underline{y} &#9002;$.
 
+ $\underline{z} = \arg \max &#9001; \underline{x} \cdot \underline{y} &#9002;$.
  
{Welche Gleichungen gelten für die ML&ndash;Entscheidung beim AWGN?
+
{What equations apply to the maximum likelihood decision in the AWGN model?
 
|type="[]"}
 
|type="[]"}
 
- $\underline{z} = \arg \min {d_{\rm H}(\underline{x}, \, \underline{y})}$,
 
- $\underline{z} = \arg \min {d_{\rm H}(\underline{x}, \, \underline{y})}$,
Line 69: Line 80:
 
</quiz>
 
</quiz>
  
===Musterlösung===
+
===Solution===
 
{{ML-Kopf}}
 
{{ML-Kopf}}
'''(1)'''&nbsp; Richtig ist der <u>Lösungsvorschlag 3</u>:
+
'''(1)'''&nbsp; Correct is the&nbsp; <u>proposed solution 3</u>:
*Die zwei Binärfolgen seien $\underline{x}$ und $\underline{y}$ mit $x_i &#8712; \{-1, \, +1\}, \ y_i &#8712; \{-1, \, +1\}$. Die Folgenlänge sei jeweils $L$.
+
*Let the two binary sequences be &nbsp; $\underline{x}$ &nbsp; and &nbsp; $\underline{y}$ &nbsp; with &nbsp; $x_i &#8712; \{-1, \, +1\}, \ y_i &#8712; \{-1, \, +1\}$.&nbsp; Let the sequence length be&nbsp; $L$&nbsp; in each case.
*Die Hamming&ndash;Distanz $d_{\rm H}(\underline{x}, \, \underline{y})$ gibt die Anzahl der Bit an, in denen sich $\underline{x}$ und $\underline{y}$ unterscheiden, für die also $x_i \, - y_i = &plusmn;2$ &nbsp; &#8658; &nbsp; $ (x_i \, - y_i)^2 = 4$ gilt.  
+
 
*Gleiche Symbole $(x_i = y_i)$ tragen zur Hamming&ndash;Distanz nicht bei und ergeben $(x_i \, &ndash; y_i)^2 = 0$. Entsprechend dem <u>Lösungsvorschlag 3</u> kann daher geschrieben werden:
+
*The Hamming distance &nbsp; $d_{\rm H}(\underline{x}, \, \underline{y})$ &nbsp; gives the number of bits in which&nbsp; $\underline{x}$&nbsp; and&nbsp; $\underline{y}$&nbsp; differ,&nbsp; for which thus&nbsp; $x_i \, - y_i = &plusmn;2$ &nbsp; &#8658; &nbsp; $ (x_i \, - y_i)^2 = 4$&nbsp; holds.
 +
 +
*Equal symbols&nbsp; $(x_i = y_i)$&nbsp; do not contribute to the Hamming distance and give&nbsp; $(x_i \, &ndash; y_i)^2 = 0$.&nbsp; According to the&nbsp; <u>solution 3</u>,&nbsp; we can therefore write:
 
:$$ d_{\rm H}(\underline{x}  \hspace{0.05cm}, \hspace{0.1cm}\underline{y}) =
 
:$$ d_{\rm H}(\underline{x}  \hspace{0.05cm}, \hspace{0.1cm}\underline{y}) =
 
\frac{1}{4} \cdot \sum_{i=1}^{L} \hspace{0.2cm}(x_i - y_i)^2= \frac{1}{4} \cdot d_{\rm E}^2(\underline{x}  \hspace{0.05cm}, \hspace{0.1cm}\underline{y})\hspace{0.05cm}.$$
 
\frac{1}{4} \cdot \sum_{i=1}^{L} \hspace{0.2cm}(x_i - y_i)^2= \frac{1}{4} \cdot d_{\rm E}^2(\underline{x}  \hspace{0.05cm}, \hspace{0.1cm}\underline{y})\hspace{0.05cm}.$$
  
  
'''(2)'''&nbsp; <u>Alle Lösungsvorschläge</u> sind richtig:
+
'''(2)'''&nbsp; <u>All proposed solutions</u>&nbsp; are correct:
*Beim BSC&ndash;Modell ist es allgemein üblich, zum gegebenen Empfangsvektor $\underline{y}$ das Codewort $\underline{x}$ mit der kleinsten Hamming&ndash;Distanz $d_{\rm H}(\underline{x}, \, \underline{y})$ auszuwählen:
+
*In the BSC model,&nbsp; it is common practice to select the code word&nbsp; $\underline{x}$&nbsp; with the smallest Hamming distance&nbsp; $d_{\rm H}(\underline{x}, \, \underline{y})$&nbsp; for the given received vector&nbsp; $\underline{y}$:
 
:$$\underline{z} = {\rm arg} \min_{\underline{x} \hspace{0.05cm} \in \hspace{0.05cm} \mathcal{C}} \hspace{0.1cm}
 
:$$\underline{z} = {\rm arg} \min_{\underline{x} \hspace{0.05cm} \in \hspace{0.05cm} \mathcal{C}} \hspace{0.1cm}
 
d_{\rm H}(\underline{x}  \hspace{0.05cm}, \hspace{0.1cm}\underline{y})\hspace{0.05cm}.$$
 
d_{\rm H}(\underline{x}  \hspace{0.05cm}, \hspace{0.1cm}\underline{y})\hspace{0.05cm}.$$
*Entsprechend der Teilaufgabe (1) gilt aber auch:
+
*But according to the subtask&nbsp; '''(1)'''&nbsp; also applies:
 
:$$\underline{z} = {\rm arg} \min_{\underline{x} \hspace{0.05cm} \in \hspace{0.05cm} \mathcal{C}} \hspace{0.1cm}
 
:$$\underline{z} = {\rm arg} \min_{\underline{x} \hspace{0.05cm} \in \hspace{0.05cm} \mathcal{C}} \hspace{0.1cm}
 
d_{\rm E}^{\hspace{0.15cm}2}(\underline{x}  \hspace{0.05cm}, \hspace{0.1cm}\underline{y})/4
 
d_{\rm E}^{\hspace{0.15cm}2}(\underline{x}  \hspace{0.05cm}, \hspace{0.1cm}\underline{y})/4
Line 92: Line 105:
 
\hspace{0.05cm}.$$
 
\hspace{0.05cm}.$$
  
*Der Faktor $1/4$ spielt für die Minimierung keine Rolle. Da $d_{\rm E}(\underline{x}, \, \underline{y}) &#8805; 0$ ist, ist es auch egal, ob die Minimierung hinsichtlich $d_{\rm E}(\underline{x}, \, \underline{y})$ oder $d_{\rm E}^2(\underline{x}, \, \underline{y})$ vorgenommen wird.  
+
*The factor&nbsp; $1/4$&nbsp; does not matter for the minimization.&nbsp; Since&nbsp; $d_{\rm E}(\underline{x}, \, \underline{y}) &#8805; 0$,&nbsp; it does not matter whether the minimization is done with respect to &nbsp; $d_{\rm E}(\underline{x}, \, \underline{y})$ &nbsp; or &nbsp; $d_{\rm E}^2(\underline{x}, \, \underline{y})$.  
 +
 
  
  
'''(3)'''&nbsp; Richtig ist der <u>Lösungsvorschlag 2</u>:
+
'''(3)'''&nbsp; Correct is the&nbsp; <u>proposed solution 2</u>:
*Das Quadrat der Euklidischen Distanz kann wie folgt ausgedrückt werden:
+
*The square of the Euclidean distance can be expressed as follows:
 
:$$d_{\rm E}^{\hspace{0.15cm}2}(\underline{x}  \hspace{0.05cm}, \hspace{0.1cm}\underline{y}) =
 
:$$d_{\rm E}^{\hspace{0.15cm}2}(\underline{x}  \hspace{0.05cm}, \hspace{0.1cm}\underline{y}) =
 
\sum_{i=1}^{L} \hspace{0.2cm}(x_i - y_i)^2 =  
 
\sum_{i=1}^{L} \hspace{0.2cm}(x_i - y_i)^2 =  
Line 103: Line 117:
 
\hspace{0.05cm}.$$
 
\hspace{0.05cm}.$$
  
*Die beiden ersten Summanden sind jeweils gleich $L$ und müssen für die Minimierung nicht berücksichtigt werden.  
+
*The first two summands are each equal to&nbsp; $L$&nbsp; and need not be considered for minimization.
*Für den letzten Ausdruck in dieser Gleichung kann $&ndash;2 \cdot &#9001; \underline{x}, \, \underline{y} &#9002;$ geschrieben werden.  
+
*Aufgrund des negativen Vorzeichens wird aus der Minimierung eine Maximierung &nbsp; &#8658; &nbsp; <u>Antwort 2</u>.
+
*For the last expression in this equation,&nbsp; $&ndash;2 \cdot &#9001; \underline{x}, \, \underline{y} &#9002;$&nbsp; can be written.
 +
 +
*Due to the negative sign,&nbsp; minimization becomes maximization &nbsp; &#8658; &nbsp; <u>answer 2</u>.
  
  
'''(4)'''&nbsp; Richtig sind die <u>Lösungsvorschläge 2 und 3</u>.:
+
 
*Für den AWGN&ndash;Kanal kann im Gegensatz zum BSC keine Hamming&ndash;Distanz angegeben werden.  
+
'''(4)'''&nbsp; Correct are the&nbsp; <u>proposed solutions 2 and 3</u>:
*Ausgehend von der Gleichung
+
*For the AWGN channel,&nbsp; unlike the BSC,&nbsp; no Hamming distance can be specified.
 +
 +
*Based on the equation
 
:$$d_{\rm E}^{\hspace{0.15cm}2}(\underline{x}  \hspace{0.05cm}, \hspace{0.1cm}\underline{y}) =  
 
:$$d_{\rm E}^{\hspace{0.15cm}2}(\underline{x}  \hspace{0.05cm}, \hspace{0.1cm}\underline{y}) =  
 
\hspace{0.1cm}\sum_{i=1}^{L} \hspace{0.1cm} x_i^{\hspace{0.15cm}2} \hspace{0.1cm}+ \hspace{0.1cm}\sum_{i=1}^{L} \hspace{0.1cm} y_i^{\hspace{0.15cm}2}
 
\hspace{0.1cm}\sum_{i=1}^{L} \hspace{0.1cm} x_i^{\hspace{0.15cm}2} \hspace{0.1cm}+ \hspace{0.1cm}\sum_{i=1}^{L} \hspace{0.1cm} y_i^{\hspace{0.15cm}2}
 
\hspace{0.1cm}-2 \cdot \sum_{i=1}^{L} \hspace{0.1cm} x_i \cdot y_i$$
 
\hspace{0.1cm}-2 \cdot \sum_{i=1}^{L} \hspace{0.1cm} x_i \cdot y_i$$
  
:gelten für den ersten und letzten Summanden die gleichen Aussagen wie für das BSC&ndash;Modell &ndash; siehe Teilaufgabe (3).  
+
:the same statements apply for the first and last summands as for the BSC model &ndash; see subtask&nbsp; '''(3)'''.  
*Für den mittleren Summanden gilt mit $y_i = x_i + n_i$ und $x_i &#8712; \{&ndash;1, \, +1\}$:  
+
 
 +
*For the middle summand,&nbsp; $y_i = x_i + n_i$&nbsp; and&nbsp; $x_i &#8712; \{&ndash;1, \, +1\}$&nbsp; hold:  
 
:$$\sum_{i=1}^{L} \hspace{0.1cm} y_i^{\hspace{0.15cm}2} =  
 
:$$\sum_{i=1}^{L} \hspace{0.1cm} y_i^{\hspace{0.15cm}2} =  
 
\hspace{0.1cm}\sum_{i=1}^{L} \hspace{0.1cm} x_i^{\hspace{0.15cm}2} \hspace{0.1cm}+ \hspace{0.1cm}\sum_{i=1}^{L} \hspace{0.1cm} n_i^{\hspace{0.15cm}2}
 
\hspace{0.1cm}\sum_{i=1}^{L} \hspace{0.1cm} x_i^{\hspace{0.15cm}2} \hspace{0.1cm}+ \hspace{0.1cm}\sum_{i=1}^{L} \hspace{0.1cm} n_i^{\hspace{0.15cm}2}
 
\hspace{0.1cm}+2 \cdot \sum_{i=1}^{L} \hspace{0.1cm} x_i \cdot n_i \hspace{0.05cm}.$$
 
\hspace{0.1cm}+2 \cdot \sum_{i=1}^{L} \hspace{0.1cm} x_i \cdot n_i \hspace{0.05cm}.$$
  
*Der erste Summand ergibt wieder $L$, der zweite ist proportional zur Rauschleistung und der letzte Term verschwindet, da $\underline{x}$ und $\underline{n}$ unkorreliert sind.  
+
*The first summand gives again&nbsp; $L$,&nbsp; the second is proportional to the noise power,&nbsp; and the last term vanishes since&nbsp; $\underline{x}$&nbsp; and&nbsp; $\underline{n}$&nbsp; are uncorrelated.
*Für die Minimerung von $d_{\rm E}(\underline{x}, \, \underline{y})$ muss also die Summe über $y_i^2$ nicht berücksichtigt werden, da kein Bezug zu den Codesequenzen $\underline{x}$ besteht.
+
 +
*So for minimizing&nbsp; $d_{\rm E}(\underline{x}, \, \underline{y})$,&nbsp; the sum over&nbsp; $y_i^2$&nbsp; need not be considered since there is no relation to the code sequences&nbsp; $\underline{x}$.
 
{{ML-Fuß}}
 
{{ML-Fuß}}
  
  
  
[[Category:Aufgaben zu  Kanalcodierung|^3.4 Decodierung von Faltungscodes^]]
+
[[Category:Channel Coding: Exercises|^3.4 Decoding of Convolutional Codes^]]

Latest revision as of 15:57, 21 November 2022

Overall system model,  given for this exercise

The Viterbi algorithm represents the best known realization form for the maximum likelihood decoding of a convolutional code.  We assume here the following model:

  • The information sequence  $\underline{u}$  is converted into the code sequence  $\underline{x}$  by a convolutional code.
  • It is valid  $u_i ∈ \{0, \, 1\}$.  In contrast,  the code symbols are represented bipolar   ⇒   $x_i ∈ \{–1, \, +1\}$.
  • Let the channel be given by the models  $\text{BSC}$    ⇒   received values  $y_i ∈ \{–1, \, +1\}$  or  $\text{AWGN}$   ⇒ $y_i$  real-valued.
  • Given a received sequence  $\underline{y}$  the Viterbi algorithm decides for the sequence  $\underline{z}$  according to
$$\underline{z} = {\rm arg} \max_{\underline{x}_{\hspace{0.03cm}i} \hspace{0.03cm} \in \hspace{0.05cm} \mathcal{C}} \hspace{0.1cm} {\rm Pr}( \underline{x}_{\hspace{0.03cm}i} |\hspace{0.05cm} \underline{y} ) \hspace{0.05cm}.$$
$$\underline{z} = {\rm arg} \max_{\underline{x}_{\hspace{0.03cm}i} \hspace{0.05cm} \in \hspace{0.05cm} \mathcal{C}} \hspace{0.1cm} {\rm Pr}( \underline{y} \hspace{0.05cm}|\hspace{0.05cm} \underline{x}_{\hspace{0.03cm}i} ) \hspace{0.05cm}.$$
  • As a further result,  the Viterbi algorithm additionally outputs the sequence  $\underline{v}$  as an estimate for the information sequence  $\underline{u}$.


In this exercise,  you should determinethe relationship between the  $\text{Hamming distance}$  $d_{\rm H}(\underline{x}, \, \underline{y})$  and the  $\text{Euclidean distance}$

$$d_{\rm E}(\underline{x} \hspace{0.05cm}, \hspace{0.1cm}\underline{y}) = \sqrt{\sum_{i=1}^{L} \hspace{0.2cm}(x_i - y_i)^2}\hspace{0.05cm}.$$

Then,  the above maximum likelihood criterion is to be formulated with

  • the Hamming distance  $d_{\rm H}(\underline{x}, \, \underline{y})$,
  • the Euclidean distance  $d_{\rm E}(\underline{x}, \, \underline{y})$,  and





Hints:

  • For simplicity,  "tilde"  and  "apostrophe"  are omitted.
  • For more information on this topic,  see the following sections in this book:


Questions

1

How are   $d_{\rm H}(\underline{x}, \, \underline{y})$   and   $d_{\rm E}(\underline{x}, \, \underline{y})$   related in the BSC model?

  $d_{\rm H}(\underline{x}, \, \underline{y}) = d_{\rm E}(\underline{x}, \, \underline{y})$  is valid.
  $d_{\rm H}(\underline{x}, \, \underline{y}) = d_{\rm E}^2(\underline{x}, \, \underline{y})$  is valid.
  $d_{\rm H}(\underline{x}, \, \underline{y}) = d_{\rm E}^2(\underline{x}, \, \underline{y})/4$  is valid.

2

Which of the equations describe the maximum likelihood decoding in the BSC model?  The minimization/maximization refers alwaysto all  $\underline{x} ∈\mathcal{ C}$.

$\underline{z} = \arg \min {d_{\rm H}(\underline{x}, \, \underline{y})}$,
$\underline{z} = \arg \min {d_{\rm E}(\underline{x}, \, \underline{y})}$,
$\underline{z} = \arg \min {d_{\rm E}^2(\underline{x}, \, \underline{y})}$,

3

Which equation describes the maximum likelihood decision in the BSC model?

$\underline{z} = \arg \min 〈 \underline{x} \cdot \underline{y} 〉$,
$\underline{z} = \arg \max 〈 \underline{x} \cdot \underline{y} 〉$.

4

What equations apply to the maximum likelihood decision in the AWGN model?

$\underline{z} = \arg \min {d_{\rm H}(\underline{x}, \, \underline{y})}$,
$\underline{z} = \arg \min {d_{\rm E}(\underline{x}, \, \underline{y})}$,
$\underline{z} = \arg \max 〈 \underline{x} \cdot \underline{y} 〉$.


Solution

(1)  Correct is the  proposed solution 3:

  • Let the two binary sequences be   $\underline{x}$   and   $\underline{y}$   with   $x_i ∈ \{-1, \, +1\}, \ y_i ∈ \{-1, \, +1\}$.  Let the sequence length be  $L$  in each case.
  • The Hamming distance   $d_{\rm H}(\underline{x}, \, \underline{y})$   gives the number of bits in which  $\underline{x}$  and  $\underline{y}$  differ,  for which thus  $x_i \, - y_i = ±2$   ⇒   $ (x_i \, - y_i)^2 = 4$  holds.
  • Equal symbols  $(x_i = y_i)$  do not contribute to the Hamming distance and give  $(x_i \, – y_i)^2 = 0$.  According to the  solution 3,  we can therefore write:
$$ d_{\rm H}(\underline{x} \hspace{0.05cm}, \hspace{0.1cm}\underline{y}) = \frac{1}{4} \cdot \sum_{i=1}^{L} \hspace{0.2cm}(x_i - y_i)^2= \frac{1}{4} \cdot d_{\rm E}^2(\underline{x} \hspace{0.05cm}, \hspace{0.1cm}\underline{y})\hspace{0.05cm}.$$


(2)  All proposed solutions  are correct:

  • In the BSC model,  it is common practice to select the code word  $\underline{x}$  with the smallest Hamming distance  $d_{\rm H}(\underline{x}, \, \underline{y})$  for the given received vector  $\underline{y}$:
$$\underline{z} = {\rm arg} \min_{\underline{x} \hspace{0.05cm} \in \hspace{0.05cm} \mathcal{C}} \hspace{0.1cm} d_{\rm H}(\underline{x} \hspace{0.05cm}, \hspace{0.1cm}\underline{y})\hspace{0.05cm}.$$
  • But according to the subtask  (1)  also applies:
$$\underline{z} = {\rm arg} \min_{\underline{x} \hspace{0.05cm} \in \hspace{0.05cm} \mathcal{C}} \hspace{0.1cm} d_{\rm E}^{\hspace{0.15cm}2}(\underline{x} \hspace{0.05cm}, \hspace{0.1cm}\underline{y})/4 \hspace{0.2cm}\Rightarrow \hspace{0.2cm} \underline{z} = {\rm arg} \min_{\underline{x} \hspace{0.05cm} \in \hspace{0.05cm} \mathcal{C}} \hspace{0.1cm} d_{\rm E}^{\hspace{0.15cm}2}(\underline{x} \hspace{0.05cm}, \hspace{0.1cm}\underline{y}) \hspace{0.2cm}\Rightarrow \hspace{0.2cm} \underline{z} = {\rm arg} \min_{\underline{x} \hspace{0.05cm} \in \hspace{0.05cm} \mathcal{C}} \hspace{0.1cm} d_{\rm E}(\underline{x} \hspace{0.05cm}, \hspace{0.1cm}\underline{y}) \hspace{0.05cm}.$$
  • The factor  $1/4$  does not matter for the minimization.  Since  $d_{\rm E}(\underline{x}, \, \underline{y}) ≥ 0$,  it does not matter whether the minimization is done with respect to   $d_{\rm E}(\underline{x}, \, \underline{y})$   or   $d_{\rm E}^2(\underline{x}, \, \underline{y})$.


(3)  Correct is the  proposed solution 2:

  • The square of the Euclidean distance can be expressed as follows:
$$d_{\rm E}^{\hspace{0.15cm}2}(\underline{x} \hspace{0.05cm}, \hspace{0.1cm}\underline{y}) = \sum_{i=1}^{L} \hspace{0.2cm}(x_i - y_i)^2 = \hspace{0.1cm}\sum_{i=1}^{L} \hspace{0.1cm} x_i^{\hspace{0.15cm}2} \hspace{0.1cm}+ \hspace{0.1cm}\sum_{i=1}^{L} \hspace{0.1cm} y_i^{\hspace{0.15cm}2} \hspace{0.1cm}-2 \cdot \sum_{i=1}^{L} \hspace{0.1cm} x_i \cdot y_i \hspace{0.05cm}.$$
  • The first two summands are each equal to  $L$  and need not be considered for minimization.
  • For the last expression in this equation,  $–2 \cdot 〈 \underline{x}, \, \underline{y} 〉$  can be written.
  • Due to the negative sign,  minimization becomes maximization   ⇒   answer 2.


(4)  Correct are the  proposed solutions 2 and 3:

  • For the AWGN channel,  unlike the BSC,  no Hamming distance can be specified.
  • Based on the equation
$$d_{\rm E}^{\hspace{0.15cm}2}(\underline{x} \hspace{0.05cm}, \hspace{0.1cm}\underline{y}) = \hspace{0.1cm}\sum_{i=1}^{L} \hspace{0.1cm} x_i^{\hspace{0.15cm}2} \hspace{0.1cm}+ \hspace{0.1cm}\sum_{i=1}^{L} \hspace{0.1cm} y_i^{\hspace{0.15cm}2} \hspace{0.1cm}-2 \cdot \sum_{i=1}^{L} \hspace{0.1cm} x_i \cdot y_i$$
the same statements apply for the first and last summands as for the BSC model – see subtask  (3).
  • For the middle summand,  $y_i = x_i + n_i$  and  $x_i ∈ \{–1, \, +1\}$  hold:
$$\sum_{i=1}^{L} \hspace{0.1cm} y_i^{\hspace{0.15cm}2} = \hspace{0.1cm}\sum_{i=1}^{L} \hspace{0.1cm} x_i^{\hspace{0.15cm}2} \hspace{0.1cm}+ \hspace{0.1cm}\sum_{i=1}^{L} \hspace{0.1cm} n_i^{\hspace{0.15cm}2} \hspace{0.1cm}+2 \cdot \sum_{i=1}^{L} \hspace{0.1cm} x_i \cdot n_i \hspace{0.05cm}.$$
  • The first summand gives again  $L$,  the second is proportional to the noise power,  and the last term vanishes since  $\underline{x}$  and  $\underline{n}$  are uncorrelated.
  • So for minimizing  $d_{\rm E}(\underline{x}, \, \underline{y})$,  the sum over  $y_i^2$  need not be considered since there is no relation to the code sequences  $\underline{x}$.