Movatterモバイル変換

Linear code

From Wikipedia, the free encyclopedia

Class of error-correcting code

Incoding theory, alinear code is anerror-correcting code for which anylinear combination ofcodewords is also a codeword. Linear codes are traditionally partitioned intoblock codes andconvolutional codes, althoughturbo codes can be seen as a hybrid of these two types.^[1] Linear codes allow for more efficient encoding and decoding algorithms than other codes (cf.syndrome decoding).^{[citation needed]}

Linear codes are used inforward error correction and are applied in methods for transmitting symbols (e.g.,bits) on acommunications channel so that, if errors occur in the communication, some errors can be corrected or detected by the recipient of a message block. The codewords in a linear block code are blocks of symbols that are encoded using more symbols than the original value to be sent.^[2] A linear code of lengthn transmits blocks containingn symbols. For example, the [7,4,3]Hamming code is a linearbinary code which represents 4-bit messages using 7-bit codewords. Two distinct codewords differ in at least three bits. As a consequence, up to two errors per codeword can be detected while a single error can be corrected.^[3] This code contains 2⁴ = 16 codewords.

Definition and parameters

[edit]

Alinear code of lengthn and dimensionk is alinear subspaceC withdimensionk of thevector space $\mathbb {F} _{q}^{n}$ where $\mathbb {F} _{q}$ is thefinite field withq elements. Such a code is called aq-ary code. Ifq = 2 orq = 3, the code is described as abinary code, or aternary code respectively. The vectors inC are calledcodewords. Thesize of a code is the number of codewords and equalsq^k.

Theweight of a codeword is the number of its elements that are nonzero and thedistance between two codewords is theHamming distance between them, that is, the number of elements in which they differ. The distanced of the linear code is the minimum weight of its nonzero codewords, or equivalently, the minimum distance between distinct codewords. A linear code of lengthn, dimensionk, and distanced is called an [n,k,d] code (or, more precisely, $[n,k,d]_{q}$ code).

We want to give $\mathbb {F} _{q}^{n}$ the standard basis because each coordinate represents a "bit" that is transmitted across a "noisy channel" with some small probability of transmission error (abinary symmetric channel). If some other basis is used then this model cannot be used and the Hamming metric does not measure the number of errors in transmission, as we want it to.

Generator and check matrices

[edit]

As alinear subspace of $\mathbb {F} _{q}^{n}$ , the entire codeC (which may be very large) may be represented as thespan of a set of $k {\displaystyle k}$ codewords (known as abasis inlinear algebra). These basis codewords are often collated in the rows of a matrix G known as agenerating matrix for the codeC. When G has the block matrix form ${\boldsymbol {G}}=[I_{k}\mid P]$ , where $I_{k}$ denotes the $k\times k$ identity matrix and P is some $k\times (n-k)$ matrix, then we say G is instandard form.

A matrixH representing a linear function $\phi :\mathbb {F} _{q}^{n}\to \mathbb {F} _{q}^{n-k}$ whosekernel isC is called acheck matrix ofC (or sometimes a parity check matrix). Equivalently,H is a matrix whosenull space isC. IfC is a code with a generating matrixG in standard form, ${\boldsymbol {G}}=[I_{k}\mid P]$ , then ${\boldsymbol {H}}=[-P^{T}\mid I_{n-k}]$ is a check matrix for C. The code generated byH is called thedual code of C. It can be verified that G is a $k\times n$ matrix, while H is a $(n-k)\times n$ matrix.

Linearity guarantees that the minimumHamming distanced between a codewordc₀ and any of the other codewordsc ≠ c₀ is independent ofc₀. This follows from the property that the differencec − c₀ of two codewords inC is also a codeword (i.e., anelement of the subspaceC), and the property thatd(c, c₀) = d(c − c₀, 0). These properties imply that

\min _{c\in C,\ c\neq c_{0}}d(c,c_{0})=\min _{c\in C,\ c\neq c_{0}}d(c-c_{0},0)=\min _{c\in C,\ c\neq 0}d(c,0)=d.

In other words, in order to find out the minimum distance between the codewords of a linear code, one would only need to look at the non-zero codewords. The non-zero codeword with the smallest weight has then the minimum distance to the zero codeword, and hence determines the minimum distance of the code.

The distanced of a linear codeC also equals the minimum number of linearly dependent columns of the check matrixH.

Proof: Because ${\boldsymbol {H}}\cdot {\boldsymbol {c}}^{T}={\boldsymbol {0}}$ , which is equivalent to $\sum _{i=1}^{n}(c_{i}\cdot {\boldsymbol {H_{i}}})={\boldsymbol {0}}$ , where ${\boldsymbol {H_{i}}}$ is the $i^{th}$ column of ${\boldsymbol {H}}$ . Remove those items with $c_{i}=0$ , those ${\boldsymbol {H_{i}}}$ with $c_{i}\neq 0$ are linearly dependent. Therefore, $d {\displaystyle d}$ is at least the minimum number of linearly dependent columns. On another hand, consider the minimum set of linearly dependent columns $\{{\boldsymbol {H_{j}}}\mid j\in S\}$ where $S {\displaystyle S}$ is the column index set. $\sum _{i=1}^{n}(c_{i}\cdot {\boldsymbol {H_{i}}})=\sum _{j\in S}(c_{j}\cdot {\boldsymbol {H_{j}}})+\sum _{j\notin S}(c_{j}\cdot {\boldsymbol {H_{j}}})={\boldsymbol {0}}$ . Now consider the vector ${\boldsymbol {c'}}$ such that $c_{j}'=0$ if $j\notin S$ . Note ${\boldsymbol {c'}}\in C$ because ${\boldsymbol {H}}\cdot {\boldsymbol {c'}}^{T}={\boldsymbol {0}}$ . Therefore, we have $d\leq wt({\boldsymbol {c'}})$ , which is the minimum number of linearly dependent columns in ${\boldsymbol {H}}$ . The claimed property is therefore proven.

Example: Hamming codes

[edit]

Main article:Hamming code

As the first class of linear codes developed for error correction purpose,Hamming codes have been widely used in digital communication systems. For any positive integer $r\geq 2$ , there exists a $[2^{r}-1,2^{r}-r-1,3]_{2}$ Hamming code. Since $d=3$ , this Hamming code can correct a 1-bit error.

Example : The linear block code with the following generator matrix and parity check matrix is a $[7,4,3]_{2}$ Hamming code.

{\boldsymbol {G}}={\begin{pmatrix}1&0&0&0&1&1&0\\0&1&0&0&0&1&1\\0&0&1&0&1&1&1\\0&0&0&1&1&0&1\end{pmatrix}},

{\boldsymbol {H}}={\begin{pmatrix}1&0&1&1&1&0&0\\1&1&\ 1&0&0&1&0\\0&1&1&1&0&0&1\end{pmatrix}}

Example: Hadamard codes

[edit]

Main article:Hadamard code

Hadamard code is a $[2^{r},r,2^{r-1}]_{2}$ linear code and is capable of correcting many errors. Hadamard code could be constructed column by column : the $i^{th}$ column is the bits of the binary representation of integer $i {\displaystyle i}$ , as shown in the following example. Hadamard code has minimum distance $2^{r-1}$ and therefore can correct $2^{r-2}-1$ errors.

Example: The linear block code with the following generator matrix is a $[8,3,4]_{2}$ Hadamard code: ${\boldsymbol {G}}_{\mathrm {Had} }={\begin{pmatrix}0&0&0&0&1&\ 1&1&1\\0&0&1&1&0&0&1&1\\0&1&0&1&0&1&0&1\end{pmatrix}}$ .

Hadamard code is a special case ofReed–Muller code. If we take the first column (the all-zero column) out from ${\boldsymbol {G}}_{\mathrm {Had} }$ , we get $[7,3,4]_{2}$ simplex code, which is thedual code of Hamming code.

Nearest neighbor algorithm

[edit]

The parameter d is closely related to the error correcting ability of the code. The following construction/algorithm illustrates this (called the nearest neighbor decoding algorithm):

Input: Areceived vector v in $\mathbb {F} _{q}^{n}.$

Output: A codeword $w {\displaystyle w}$ in $C {\displaystyle C}$ closest to $v {\displaystyle v}$ , if any.

Starting with $t=0$ , repeat the following two steps.
Enumerate the elements of the ball of (Hamming) radius $t {\displaystyle t}$ around the received word $v {\displaystyle v}$ , denoted $B_{t}(v)$ .
- For each $w {\displaystyle w}$ in $B_{t}(v)$ , check if $w {\displaystyle w}$ in $C {\displaystyle C}$ . If so, return $w {\displaystyle w}$ as the solution.
Increment $t {\displaystyle t}$ . Fail only when $t>(d-1)/2$ so enumeration is complete and no solution has been found.

We say that a linear $C {\displaystyle C}$ is $t {\displaystyle t}$ -error correcting if there is at most one codeword in $B_{t}(v)$ , for each $v {\displaystyle v}$ in $\mathbb {F} _{q}^{n}$ .

Popular notation

[edit]

Main article:Block code § Popular notation

Codes in general are often denoted by the letterC, and a code of lengthn and ofrankk (i.e., havingn code words in its basis andk rows in itsgenerating matrix) is generally referred to as an (n, k) code. Linear block codes are frequently denoted as [n, k, d] codes, whered refers to the code's minimum Hamming distance between any two code words.

(The [n, k, d] notation should not be confused with the (n, M, d) notation used to denote anon-linear code of lengthn, sizeM (i.e., havingM code words), and minimum Hamming distanced.)

Singleton bound

[edit]

Lemma (Singleton bound): Every linear [n,k,d] code C satisfies $k+d\leq n+1$ .

A codeC whose parameters satisfyk +d = n + 1 is calledmaximum distance separable orMDS. Such codes, when they exist, are in some sense best possible.

IfC₁ andC₂ are two codes of lengthn and if there is a permutationp in thesymmetric groupS_n for which (c₁,...,c_n) inC₁ if and only if (c_p(1),...,c_p(n)) inC₂, then we sayC₁ andC₂ arepermutation equivalent. In more generality, if there is an $n\times n$ monomial matrix $M\colon \mathbb {F} _{q}^{n}\to \mathbb {F} _{q}^{n}$ which sendsC₁ isomorphically toC₂ then we sayC₁ andC₂ areequivalent.

Lemma: Any linear code is permutation equivalent to a code which is in standard form.

Bonisoli's theorem

[edit]

A code is defined to beequidistant if and only if there exists some constantd such that the distance between any two of the code's distinct codewords is equal tod.^[4] In 1984 Arrigo Bonisoli determined the structure of linear one-weight codes over finite fields and proved that every equidistant linear code is a sequence ofdual Hamming codes.^[5]

Examples

[edit]

Some examples of linear codes include:

Repetition code
Parity code
Cyclic code
Hamming code
Golay code, both thebinary andternary versions
Polynomial codes, of whichBCH codes are an example
Reed–Solomon codes
Reed–Muller code
Algebraic geometry code
Binary Goppa code
Low-density parity-check codes
Expander code
Multidimensional parity-check code
Toric code
Turbo code
Locally recoverable code

Generalization

[edit]

Hamming spaces over non-field alphabets have also been considered, especially overfinite rings, most notablyGalois rings overZ₄. This gives rise tomodules instead of vector spaces andring-linear codes (identified withsubmodules) instead of linear codes. The typical metric used in this case theLee distance. There exist aGray isometry between $\mathbb {Z} _{2}^{2m}$ (i.e. GF(2^2m)) with the Hamming distance and $\mathbb {Z} _{4}^{m}$ (also denoted as GR(4,m)) with the Lee distance; its main attraction is that it establishes a correspondence between some "good" codes that are not linear over $\mathbb {Z} _{2}^{2m}$ as images of ring-linear codes from $\mathbb {Z} _{4}^{m}$ .^[6]^[7]^[8]

Some authors have referred to such codes over rings simply as linear codes as well.^[9]

References

[edit]

^William E. Ryan and Shu Lin (2009).Channel Codes: Classical and Modern. Cambridge University Press. p. 4.ISBN 978-0-521-84868-8.
^MacKay, David, J.C. (2003).Information Theory, Inference, and Learning Algorithms(PDF).Cambridge University Press. p. 9.Bibcode:2003itil.book.....M.ISBN 9780521642989.In alinear block code, the extra $N-K$ bits are linear functions of the original $K {\displaystyle K}$ bits; these extra bits are calledparity-check bits{{cite book}}: CS1 maint: multiple names: authors list (link)
^Thomas M. Cover and Joy A. Thomas (1991).Elements of Information Theory. John Wiley & Sons, Inc. pp. 210–211.ISBN 978-0-471-06259-2.
^Etzion, Tuvi; Raviv, Netanel (2013). "Equidistant codes in the Grassmannian".arXiv:1308.6231 [math.CO].
^Bonisoli, A. (1984). "Every equidistant linear code is a sequence of dual Hamming codes".Ars Combinatoria.18:181–186.
^Marcus Greferath (2009). "An Introduction to Ring-Linear Coding Theory". In Massimiliano Sala; Teo Mora; Ludovic Perret; Shojiro Sakata; Carlo Traverso (eds.).Gröbner Bases, Coding, and Cryptography. Springer Science & Business Media.ISBN 978-3-540-93806-4.
^"Encyclopedia of Mathematics".www.encyclopediaofmath.org.
^J.H. van Lint (1999).Introduction to Coding Theory (3rd ed.). Springer. Chapter 8: Codes over ℤ₄.ISBN 978-3-540-64133-9.
^S.T. Dougherty; J.-L. Kim; P. Sole (2015)."Open Problems in Coding Theory". In Steven Dougherty; Alberto Facchini; Andre Gerard Leroy; Edmund Puczylowski; Patrick Sole (eds.).Noncommutative Rings and Their Applications. American Mathematical Soc. p. 80.ISBN 978-1-4704-1032-2.

Bibliography

[edit]

J. F. Humphreys; M. Y. Prest (2004).Numbers, Groups and Codes (2nd ed.). Cambridge University Press.ISBN 978-0-511-19420-7. Chapter 5 contains a more gentle introduction (than this article) to the subject of linear codes.

External links

[edit]

q-ary code generator program
Code Tables: Bounds on the parameters of various types of codes,IAKS, Fakultät für Informatik, Universität Karlsruhe (TH)]. Online, up to date table of the optimal binary codes, includes non-binary codes.
The database of Z4 codes Online, up to date database of optimal Z4 codes.