US20050157811A1

Movatterモバイル変換

Info

Publication number: US20050157811A1
Application number: US11/081,260
Authority: US
Inventors: Bjorn Bjerke; John Ketchum; Jay Walton
Original assignee: Individual
Current assignee: Qualcomm Inc
Priority date: 2001-12-03
Filing date: 2005-03-15
Publication date: 2005-07-21
Also published as: BR0214640A; CN1618222A; US20030103584A1; US7154936B2; KR20050058269A; US20070064831A1; WO2003049397A3; JP2006503446A; TWI292271B; EP2317717A3; JP2010136404A; US20080212657A1; TW200303672A; AU2002343608A8; AU2002343608A1; US7372896B2; KR100942401B1; WO2003049397A2; EP2317717A2; EP1451991A2

Abstract

Techniques to iteratively detect and decode data transmitted in a wireless (e.g., MIMO-OFDM) communication system. The iterative detection and decoding is performed by iteratively passing soft (multi-bit) “a priori” information between a detector and a decoder. The detector receives modulation symbols, performs a detection function that is complementary to the symbol mapping performed at the transmitter, and provides soft-decision symbols for transmitted coded bits. “Extrinsic information” in the soft-decision symbols is then decoded by the decoder to provide its extrinsic information, which comprises the a priori information used by the detector in the detection process. The detection and decoding may be iterated a number of times. The soft-decision symbols and the a priori information may be represented using log-likelihood ratios (LLRs). Techniques are provided to reduce the computational complexity associated with deriving the LLRs, including interference nulling to isolate each transmitted signal and “dual-maxima” approximation.

Description

Claim of Priority Under 35 U.S.C. §120
The present Application for Patent is a Divisional of Patent Application No. 10/005,104 entitled “Iterative Detection and Decoding for a MIMO-OFDM System” filed Dec. 3, 2001, pending, and assigned to the assignee hereof and hereby expressly incorporated by reference herein.

BACKGROUND

1. Field

The present invention relates generally to data communication, and more specifically to techniques for performing iterative detection and decoding for a MIMO-OFDM communication system.

2. Background

A multiple-input multiple-output (MIMO) communication system employs multiple (NT) transmit antennas and multiple (NR) receive antennas for data transmission. A MIMO channel formed by the NT transmit and NR receive antennas may be decomposed into NS independent channels, with NS≦min {NT, NR }. Each of the NS independent channels is also referred to as a spatial subchannel of the MIMO channel and corresponds to a dimension. The MIMO system can provide improved performance (e.g., increased transmission capacity) over that of a single-input single-output (SISO) communication system if the additional dimensionalities created by the multiple transmit and receive antennas are utilized.

A wideband MIMO system typically experiences frequency selective fading, i.e., different amounts of attenuation across the system bandwidth. This frequency selective fading causes inter-symbol interference (ISI), which is a phenomenon whereby each symbol in a received signal acts as distortion to subsequent symbols in the received signal. This distortion degrades performance by impacting the ability to correctly detect the received symbols. As such, ISI is a non-negligible noise component that may have a large impact on the overall signal-to-noise-and-interference ratio (SNR) for systems designed to operate at high SNR levels, such as MIMO systems. In such systems, equalization may be used at the receivers to combat ISI. However, the computational complexity required to perform equalization is typically significant or prohibitive for most applications.

Orthogonal frequency division multiplexing (OFDM) may be used to combat ISI, and achieves this without the use of computationally intensive equalization. An OFDM system effectively partitions the system bandwidth into a number of (N_F) frequency subchannels, which may be referred to as sub-bands or frequency bins. Each frequency subchannel is associated with a respective subcarrier upon which data may be modulated. The frequency subchannels of the OFDM system may experience frequency selective fading (i.e., different amounts of attenuation for different frequency subchannels), depending on the characteristics (e.g., multipath profile) of the propagation path between the transmit and receive antennas. With OFDM, the ISI due to the frequency selective fading may be combated by repeating a portion of each OFDM symbol (i.e., appending a cyclic prefix to each OFDM symbol), as is known in the art.

A MIMO system may thus advantageously employ OFDM to combat ISI. The frequency subchannels of the MIMO-OFDM system may experience different channel conditions (e.g., different fading and multipath effects) and may achieve different SNRs. Moreover, the channel conditions may vary over time. Consequently, the supported data rates may vary from frequency subchannel to frequency subchannel and from spatial subchannel to spatial subchannel, and may further vary with time. To achieve high performance, it is necessary to properly code and modulate the data at the transmitter (e.g., based on the determined channel conditions) and to properly detect and decode the received signals at the receiver.

There is therefore a need in the art for techniques to detect and decode signals that may have been (flexibly) coded and modulated based on one or more coding and modulation schemes, e.g., as determined by the channel conditions.

SUMMARY

Aspects of the invention provide techniques to iteratively detect and decode data transmitted in a wireless (e.g., MIMO-OFDM) communication system. The iterative detection and decoding exploits the error correction capabilities of the channel code to provide improved performance. This is achieved by iteratively passing soft (multi-bit) “a priori” information between a soft-input soft-output detector and a soft-input soft-output decoder.

The detector receives modulation symbols previously generated at a transmitter system based on one or more coding and modulation schemes, performs a detection function that is complementary to the symbol mapping performed at the transmitter system, and provides soft-decision symbols for transmitted coded bits. Extrinsic information in the soft-decision symbols (which comprises the a priori information for the decoder, as described below) is then decoded by the decoder based on one or more decoding schemes complementary to the one or more coding schemes used at the transmitter system. The decoder further provides its extrinsic information (which comprises the a priori information for the detector) that is then used by the detector in the detection process.

The detection and decoding may be iterated a number of times. During the iterative detection and decoding process, the reliability of the bit decisions is improved with each iteration. The iterative detection and decoding process described herein may be used to combat frequency selective fading as well as flat fading. Moreover, the iterative detection and decoding process may be flexibly used with various types of coding schemes (e.g., serial and parallel concatenated convolutional codes) and with various modulation schemes (e.g., M-PSK and M-QAM).

The a priori information passed between the detector and decoder and the soft-decision symbols may be represented using log-likelihood ratios (LLRs). Techniques are provided herein to reduce the computational complexity associated with deriving the LLRs. Such techniques include the use of interference nulling to isolate each transmitted signal by removing the other interferers and the use of a “dual-maxima” or some other approximation to compute the LLRs, which are described below.

Various aspects and embodiments of the invention are described in further detail below. The invention further provides methods, receiver units, transmitter units, receiver systems, transmitter systems, systems, and other apparatuses and elements that implement various aspects, embodiments, and features of the invention, as described in further detail below.

BRIEF DESCRIPTION OF THE DRAWINGS

The features, nature, and advantages of the present invention will become more apparent from the detailed description set forth below when taken in conjunction with the drawings in which like reference characters identify correspondingly throughout and wherein:

FIG. 1 is a block diagram of a transmitter system and a receiver system in a MIMO-OFDM system;

FIGS. 2A and 2B are block diagrams of two transmitter units that code and modulate data with (1) a single coding and modulation scheme and (2) separate coding and modulation schemes on a per-antenna basis, respectively;

FIGS. 3A and 3B are block diagrams of serial and parallel concatenated convolutional encoders, respectively;

FIG. 3C is a block diagram of a recursive convolutional encoder;

FIGS. 4A and 4B are block diagrams of two receiver units that detect and decode data previously processed with (1) a single coding and modulation scheme and (2) separate coding and modulation schemes on a per-antenna basis, respectively;

FIG. 4C is a block diagram of a receiver unit that performs successive nulling and interference cancellation to recover one transmitted signal at a time;

FIGS. 5A and 5B are block diagrams of two Turbo decoders capable of performing iterative decoding for serial and parallel concatenated convolutional codes, respectively; and

FIG. 6 is a block diagram of an interference canceller that may be used for the receiver unit inFIG. 4C.

DETAILED DESCRIPTION

The iterative detection and decoding techniques described herein may be used for various wireless communication systems. For clarity, various aspects and embodiments of the invention are described specifically for multiple-input multiple output communication system that implements orthogonal frequency division multiplexing (i.e., a MIMO-OFDM system).

As noted above, a MIMO system employs N_Ttransmit antennas and N_Rreceive antennas for data transmission, where N_R≧N_T. A MIMO channel formed by the N_Ttransmit antennas and N_Rreceive antennas may be decomposed into N_Sspatial subchannels, where N_S≦min {N_T, N_R}. An OFDM system effectively partitions the system bandwidth into N_Ffrequency subchannels. Each frequency subchannel may be defined to be sufficiently narrow so that its frequency response is considered flat or frequency non-selective. A MIMO-OFDM system may thus transmit data via a number of (N_C) “transmission channels” (where N_C=N_S·N_F), with each such transmission channel corresponding to a frequency subchannel of a spatial subchannel.

FIG. 1 is a block diagram of an embodiment of atransmitter system110 and areceiver system150 in a MIMO-OFDM system100.Transmitter system110 andreceiver system150 are capable of implementing various aspects and embodiments of the invention, as described below.

Attransmitter system110, traffic data is provided at a particular data rate from adata source112 to a transmit (TX)data processor114, which codes and interleaves the traffic data based on one or more coding schemes to provide coded data. The coding may be performed based on a single coding scheme for all transmit antennas, one coding scheme for each transmit antenna or each subset of transmit antennas, or one coding scheme for each transmission channel or each group of transmission channels. The data rate and the coding may be determined by a data rate control and a coding control, respectively, provided by acontroller130.

The coded data is then provided to amodulator116, which may also receive pilot data (e.g., data of a known pattern and processed in a known manner). The pilot data may be multiplexed with the coded traffic data (e.g., using time division multiplexing (TDM) or code division multiplexing (CDM)) in all or a subset of the frequency subchannels and in all or a subset of the spatial subchannels used to transmit the traffic data. The pilot may be used by the receiver system to perform a number of functions such as acquisition, frequency and timing synchronization, channel estimation, coherent data demodulation, and so on.

In a specific embodiment, the processing bymodulator116 includes (1) modulating the received data with one or more modulation schemes (e.g., M-PSK, M-QAM, and so on) to provide modulation symbols, (2) transforming the modulation symbols to form OFDM symbols, and (3) appending a cyclic prefix to each OFDM symbol to form a corresponding transmission symbol. Similarly, the modulation may be performed based on a single modulation scheme for all transmit antennas, one modulation scheme for each transmit antenna or each subset of transmit antennas, or one modulation scheme for each transmission channel or each group of transmission channels. The modulation is performed based on a modulation control provided bycontroller130. The modulated data (i.e., the transmission symbols) is then provided to transmitters (TMTR)122athrough122tassociated with the N_Ttransmit antennas to be used for data transmission.

Each transmitter122 converts the received modulated data into one or more analog signals and further conditions (e.g., amplifies, filters, and quadrature modulates) the analog signals to generate a modulated signal suitable for transmission over the communication channel. The modulated signals fromtransmitters122athrough122tare then transmitted viaantennas124athrough124t, respectively, to the receiver system.

Atreceiver system150, the transmitted modulated signals are received byantennas152athrough152r, and the received signal from each antenna is provided to a respective receiver (RCVR)154. Each receiver154 conditions (e.g., filters, amplifies, and downconverts) a respective received signal and digitizes the conditioned signal to provide a respective stream of data samples, which represent the transmission symbols received via the associated antenna. A demodulator (Demod)156 receives and demodulates the N_Rdata sample streams fromreceivers154athrough154rto provide N_Rcorresponding streams of received modulation symbols. For each data sample stream,demodulator156 removes the cyclic prefix included in each transmission symbol and then transforms each received OFDM symbol to provide a corresponding stream of received modulation symbols.

A detector/decoder158 initially performs the detection function that is complementary to the symbol mapping and provides soft-decision (multi-bit) symbols for the coded bits transmitted from the transmitter system. The soft-decision symbols are then decoded based on one or more decoding schemes complementary to the one or more coding schemes used at the transmitter system. In an aspect, the detection and decoding may be performed iteratively a number of times, as described in further detail below. The decoded data is then provided to adata sink160.

Controllers

130 and170 direct the operation at the transmitter and receiver systems, respectively.

Memories

132 and172 provide storage for program codes and data used by

controllers

130 and170, respectively.

Transmitter System

FIG. 2A is a block diagram of atransmitter unit200a, which is an embodiment of the transmitter portion oftransmitter system110 inFIG. 1. In this embodiment, a single coding scheme is used for all N_Ttransmit antennas and a single modulation scheme is used for all N_Ffrequency subchannels of all transmit antennas.Transmitter unit200aincludes (1) aTX data processor114athat receives and codes traffic data in accordance with a specific coding scheme to provide coded data and (2) amodulator116athat modulates the coded data in accordance with a specific modulation scheme to provide modulated data.TX data processor114aandmodulator116aare thus one embodiment ofTX data processor114 andmodulator116, respectively, inFIG. 1.

In the specific embodiment shown inFIG. 2A,TX data processor114aincludes anencoder212, achannel interleaver214, and a demultiplexer (Demux)216.Encoder212 receives and codes the traffic data (i.e., the information bits) in accordance with the selected coding scheme to provide coded bits. The coding increases the reliability of the data transmission. The selected coding scheme may include any combination of cyclic redundancy check (CRC) coding, convolutional coding, Turbo coding, block coding, and so on. Several designs forencoder212 are described below.

Channel interleaver

214 then interleaves the coded bits based on a particular interleaving scheme and provides interleaved coded bits. The interleaving provides time diversity for the coded bits, permits the data to be transmitted based on an average signal-to-noise-and-interference ratio (SNR) for the frequency and/or spatial subchannels used for the data transmission, combats fading, and further removes correlation between coded bits used to form each modulation symbol. The interleaving may further provide frequency diversity if the coded bits are transmitted over multiple frequency subchannels. The coding and channel interleaving are described in further detail below.

Demultiplexer

216 then demultiplexes the interleaved and coded data into N_Tcoded data streams for the N_Ttransmit antennas to be used for the data transmission. The N_Tcoded data streams are then provided to modulator116a.

In the specific embodiment shown inFIG. 2A, modulator116aincludes N_TOFDM modulators, with each OFDM modulator assigned to process a respective coded data stream for one transmit antenna. Each OFDM modulator includes a symbol mapping element222, an inverse fast Fourier transformer (IFFT)224, and a cyclic prefix generator226. In this embodiment, all N_T

symbol mapping elements

222athrough222timplement the same modulation scheme.

Within each OFDM modulator, symbol mapping element222 maps the received coded bits to modulation symbols for the (up to) N_Ffrequency subchannels to be used for data transmission on the transmit antenna associated with the OFDM modulator. The particular modulation scheme to be implemented by symbol mapping element222 is determined by the modulation control provided bycontroller130. For OFDM, the modulation may be achieved by grouping sets of q coded bits to form non-binary symbols and mapping each non-binary symbol to a specific point in a signal constellation corresponding to the selected modulation scheme (e.g., QPSK, M-PSK, M-QAM, or some other scheme). Each mapped signal point corresponds to an M-ary modulation symbol, where M=2^q. Symbol mapping element222 then provides a vector of (up to) N_Fmodulation symbols for each transmission symbol period, with the number of modulation symbols in each vector corresponding to the number of frequency subchannels to be used for data transmission for that transmission symbol period.

If conventional non-iterative symbol de-mapping and decoding are performed at the receiver system, then Gray mapping may be preferably used for the symbol mapping since it may provide better performance in terms of bit error rate (BER). With Gray mapping, the neighboring points in the signal constellation (in both the horizontal and vertical directions) differ by only one out of the q bit positions. Gray mapping reduces the number of bit errors for more likely error events, which correspond to a received modulation symbol being mapped to a location near the correct location, in which case only one coded bit would be received in error.

However, if iterative detection and decoding are performed as described below, it can be shown that non-Gray mapping outperforms Gray mapping. This is true due to the fact that independence between the coded bits enhances independence between the detection and decoding processes, which then provides improved performance when iterative detection and decoding are performed. Thus, each symbol mapping element222 may be designed to implement a non-Gray mapped constellation. In certain instances, improved performance may be achieved if the constellation is defined such that neighboring points differ by as many bit positions as possible (i.e., the opposite goal as for Gray mapping, or “anti-Gray” mapping).

IFFT224 then converts each modulation symbol vector into its time-domain representation (which is referred to as an OFDM symbol) using the inverse fast Fourier transform. IFFT224 may be designed to perform the inverse transform on any number of frequency subchannels (e.g., 8, 16, 32, . . . , NF, . . . ). In an embodiment, for each OFDM symbol, cyclic prefix generator226 repeats a portion of the OFDM symbol to form a corresponding transmission symbol. The cyclic prefix ensures that the transmission symbol retains its orthogonal properties in the presence of multipath delay spread, thereby improving performance against deleterious path effects such as channel dispersion caused by frequency selective fading. The transmission symbols from cyclic prefix generator226 are then provided to an associated transmitter122 and processed to generate a modulated signal, which is then transmitted from the associated antenna124.

FIG. 2B is a block diagram of atransmitter unit200b, which is another embodiment of the transmitter portion oftransmitter system110 inFIG. 1. In this embodiment, a particular coding scheme is used for each of the N_Ttransmit antennas and a particular modulation scheme is used for all N_Ffrequency subchannels of each transmit antenna (i.e., separate coding and modulation on a per-antenna basis). The specific coding and modulation schemes to be used for each transmit antenna may be selected based on the expected channel conditions (e.g., by the receiver system and sent back to the transmitter system).

Transmitter unit

200bincludes (1) aTX data processor114bthat receives and codes traffic data in accordance with separate coding schemes to provide coded data and (2) amodulator116bthat modulates the coded data in accordance with separate modulation schemes to provide modulated data.TX data processor114bandmodulator116bare another embodiment ofTX data processor114 andmodulator116, respectively, inFIG. 1.

In the specific embodiment shown inFIG. 2B,TX data processor114bincludes ademultiplexer210, N_Tencoders212athrough212t, and N_Tchannel interleavers214athrough214t(i.e., one set of encoder and channel interleaver for each transmit antenna).Demultiplexer210 demultiplexes the traffic data (i.e., the information bits) into N_Tdata streams for the N_Ttransmit antennas to be used for the data transmission. Each data stream is then provided to arespective encoder212.

Eachencoder212 receives and codes a respective data stream based on the specific coding scheme selected for the corresponding transmit antenna to provide coded bits. The coded bits from eachencoder212 are then provided to arespective channel interleaver214, which interleaves the coded bits based on a particular interleaving scheme to provide diversity.Channel interleavers214athrough214tthen provide to modulator116bN_Tinterleaved and coded data streams for the N_Ttransmit antennas.

In the specific embodiment shown inFIG. 2B,modulator116bincludes N_TOFDM modulators, with each OFDM modulator including symbol mapping element222, IFFT224, and cyclic prefix generator226. In this embodiment, the N_T

symbol mapping elements

222athrough222tmay implement different modulation schemes. Within each OFDM modulator, symbol mapping element222 maps groups of q_ncoded bits to form M_n-ary modulation symbols, where M_ncorresponds to the specific modulation scheme selected for the n-th transmit antenna (as determined by the modulation control provided by controller130) and M_n=2^qⁿ. The subsequent processing by IFFT224 and cyclic prefix generator226 is as described above.

Other designs for the transmitter unit may also be implemented and are within the scope of the invention. For example, the coding and modulation may be separately performed for each subset of transmit antennas, each transmission channel, or each group of transmission channels. The implementation ofencoders212,channel interleavers214, symbol mapping elements222, IFFTs224, and cyclic prefix generators226 is known in the art and not described in detail herein.

The coding and modulation for MIMO systems with and without OFDM are described in further detail in U.S. patent application Ser. Nos. 09/826,481 and 09/956,449, both entitled “Method and Apparatus for Utilizing Channel State Information in a Wireless Communication System,” respectively filed Mar. 23, 2001 and Sep. 18, 2001; U.S. patent application Ser. No. 09/854,235, entitled “Method and Apparatus for Processing Data in a Multiple-Input Multiple-Output (MIMO) Communication System Utilizing Channel State Information,” filed May 11, 2001; U.S. patent application Ser. No. 09/776,075, entitled “Coding Scheme for a Wireless Communication System,” filed Feb. 1, 2001; and U.S. patent application Ser. No. 09/993,087, entitled “Multiple-Access Multiple-Input Multiple-Output (MIMO) Communication System,” filed Nov. 6, 2001. These applications are all assigned to the assignee of the present application and incorporated herein by reference. Still other coding and modulation schemes may also be used, and this is within the scope of the invention.

An example OFDM system is described in U.S. patent application Ser. No. 09/532,492, entitled “High Efficiency, High Performance Communication System Employing Multi-Carrier Modulation,” filed Mar. 30, 2000, assigned to the assignee of the present invention and incorporated herein by reference. OFDM is also described by John A. C. Bingham in a paper entitled “Multicarrier Modulation for Data Transmission: An Idea Whose Time Has Come,” IEEE Communications Magazine, May 1990, which is incorporated herein by reference.

Encoding

Various types of encoder may be used to code data prior to transmission. For example, the encoder may implement any one of the following (1) a serial concatenated convolutional code (SCCC), (2) a parallel concatenated convolutional code (PCCC), (3) a simple convolutional code, (4) a concatenated code comprised of a block code and a convolutional code, and so on. Concatenated convolutional codes are also referred to as Turbo codes.

FIG. 3A is a block diagram of an embodiment of a serial concatenatedconvolutional encoder212x, which may be used for each ofencoders212 inFIGS. 2A and 2B.Encoder212xincludes an outerconvolutional encoder312a, acode interleaver314, and an innerconvolutional encoder312b, all coupled in series. Outerconvolutional encoder312acodes the information bits with a particular outer code of code rate R_o. The coded output fromencoder312ais provided tocode interleaver314, which interleaves each packet of N_Pcoded bits in accordance with a particular (e.g., pseudo-random) interleaving scheme.

Code interleaver

314 may implement any one of a number of interleaving schemes, such as the ones used for cdma2000 and W-CDMA. In one specific interleaving scheme, the N_Pcoded bits in a packet are written, by row, into a 2⁵-row by 2ⁿ-column array, where n is the smallest integer such that N_P≦2⁵⁺ⁿ. The rows are then shuffled in accordance with a bit-reversal rule. For example, row1 (“00001”) is swapped with row16 (“10000”), row3 (“00011”) is swapped with row24 (“11000”), and so on. The bits within each row are then permutated (i.e., rearranged) according to a row-specific linear congruential sequence (LCS). The LCS for row k may be defined as x_k(i+1)={x_k(i)+c_k}mod 2ⁿ, where i=0, 1, . . . 2ⁿ−1, x_k(0)=c_k, and c_kis a specific value selected for each row and is further dependent on the value for n. For the permutation in each row, the i-th bit in the row is placed in location x(i). The bits in the array are then read out by column.

The LCS code interleaving scheme is described in further detail in commonly assigned U.S. patent application Ser. No. 09/205,511, entitled “Turbo Code Interleaver Using Linear Congruential Sequences,” filed Dec. 4, 1998, and in a cdma2000 document entitled “C.S0002-A-1 Physical Layer Standard for cdma2000 Spread Spectrum Systems,” both of which are incorporated herein by reference. Other code interleavers may also be used and are within the scope of the invention. For example, a random interleaver or a symmetrical-random (S-random) interleaver may also be used instead of the LCS interleaver described above.

Innerconvolutional encoder312breceives and further codes the interleaved bits fromcode interleaver314 with a particular inner code of code rate R_i. In an embodiment,encoder312bimplements a recursive code to fully realize the benefit of the significant interleaving gain provided bycode interleaver314. The inner code does not need to be a powerful code since the key desired property is recursiveness. In fact, the inner code may simply be a rate-1 differential code. The overall code rate for serial concatenatedconvolutional encoder212xis R_sccc=R_o·R_i.

FIG. 3B is a block diagram of an embodiment of a parallel concatenatedconvolutional encoder212y, which may also be used for each ofencoders212 inFIGS. 2A and 2B.Encoder212yincludes two constituent

convolutional encoder

312cand312d, acode interleaver324, apuncturing element326, and a parallel-to-serial (P/S)converter328.Code interleaver324 interleaves the information bits in accordance with a particular (i.e., pseudo-random) interleaving scheme, and may be implemented as described above forcode interleaver314.

As shown inFIG. 3B, the information bits are provided toconvolutional encoder312cand the interleaved information bits are provided toconvolutional encoder312d. Each encoder312 codes the received bits based on a particular constituent code and provides a respective stream of parity bits.

Encoders

312cand312dmay be implemented with two recursive systematic constituent codes with code rates of R₁and R₂, respectively. The recursive codes maximize the benefits provided by the interleaving gain.

The parity bits b^yand b^zfrom

encoders

312cand312d, respectively, are provided to puncturingelement326, which punctures (i.e., deletes) zero or more of the parity bits to provide the desired number of output bits. Puncturingelement326 is an optional element that may be used to adjust the overall code rate, R_PCCC, of the parallel concatenated convolutional encoder, which is given by 1/R_PCCC=1/R₁+1/R₂−1.

The information bits (which are also referred to as the systematic bits), and the punctured parity bits from

convolutional encoders

312cand312dare provided to P/S converter328 and serialized into a coded bit stream that is provided to the next processing element.

FIG. 3C is a block diagram of an embodiment of a recursiveconvolutional encoder312x, which may be used for each ofencoders312athrough312dinFIGS. 3A and 3B.Encoder312xmay also be used for each ofencoders212 inFIGS. 2A and 2B.

In the embodiment shown inFIG. 3C,encoder312ximplements the following, transfer function for the recursive convolutional code:

G (D) = [\begin{matrix} 1 & \frac{n (D)}{d (D)} \end{matrix}],

where

- n(D)=1+D+D³, and
- d(D)=1+D²+D³.
  Encoder312xmay also be designed to implement other convolutional codes, and this is within the scope of the invention.

Encoder

312xincludes a number of series-coupled delay elements332, a number of modulo-2 adders334, and a switch336. Initially, the states of delay elements332 are set to zeros and switch336 is in the up position. Then, for each received bit in a packet,adder334aperforms modulo-2 addition of the received bit with the output bit fromadder334cand provides the result to delayelement332a.Adder334bperforms modulo-2 addition of the bits fromadder334aand delay

elements

332aand332cand provides the parity bit.Adder334cperforms modulo-2 addition of the bits from

delay elements

332band332c.

After all N₁information bits in the packet have been coded, switch336 is moved to the down position and three zero (“0”) bits are provided to encoder312x.Encoder312xthen codes the three zero bits and provides three tail systematic bits and three tail parity bits.

It can be shown analytically and via computer simulations that SCCCs provide better performance than PCCCs in additive white Gaussian noise (AWGN) channels at medium to high SNR levels, which is typically the desired operating region for MIMO systems. While the BER for PCCCs asymptotically reaches an error floor, this floor is absent or much lower for SCCCs. PCCCs outperform SCCCs in the high BER region, and may be more suitably used when the system loads approach the capacity limits of the channel at low SNRs. Both PCCCs and SCCCs may be implemented using relatively simple constituent codes (e.g., having constraint lengths of 3 to 16), such as the one shown inFIG. 3C.

Channel Interleaving

Referring back toFIGS. 2A and 2B, the coded bits from eachencoder212 are interleaved by arespective channel interleaver214 to provide temporal, frequency, and/or spatial diversity against deleterious path effects (e.g., fading and multipath). Moreover, since the coded bits are subsequently grouped together to form non-binary symbols that are then mapped to M-ary modulation symbols, the interleaving may be used to ensure that the coded bits that form each modulation symbol are not located close to each other temporally (i.e., the channel interleaving distributes the coded bits that are temporally close together in a pseudo-random manner among modulation symbols that may be transmitted over different frequency subchannels, spatial subchannels, and/or transmission symbol periods). The combination of encoding, channel interleaving and symbol mapping (especially anti-Gray mapping) may be viewed as a serial concatenated code, where the symbol mapper takes on the role of the inner code. The channel interleaver provides interleaving gain in much the same way as in an SCCC, as described earlier. This potential for performance gain is unlocked by the iterative receiver structure described below. The channel interleaving can provide improved performance for various coding and modulation schemes, such as a single common coding and modulation scheme for all transmit antennas or separate coding and modulation scheme per antenna.

Various interleaving schemes may be used for the channel interleaver. In one interleaving scheme, the coded bits for each packet are written (linearly) to rows of an array. The bits in each row may then be permutated (i.e., rearranged) based on (1) a bit-reversal rule, (2) a linear congruential sequence (such as the one described above for the code interleaver), (3) a randomly generated pattern, or (4) a permutation pattern generated in some other manner. The rows are also permutated in accordance with a particular row permutation pattern. The permutated coded bits are then retrieved from each column of the array and provided to the next processing element. Other channel interleaving schemes may also be used and this is within the scope of the invention.

In an embodiment, the channel interleaving is performed separately for each independently coded data stream. For the PCCCs, the information bits and the tail and parity bits for each packet may also be channel interleaved separately. For example, the information bits b^x, the tail and parity bits b^yfrom the firstconstituent encoder312c, and the tail and parity bits b^zfrom the secondconstituent encoder312dmay be interleaved by three separate channel interleavers, which may employ the same or different interleaving schemes. This separate channel interleaving allows for flexible puncturing of the individual parity bits.

The interleaving interval may be selected to provide the desired temporal, frequency, and/or spatial diversity, or any combination thereof. For example, the coded bits for a particular time period (e.g., 10 msec, 20 msec, and so on) and for a particular combination of transmission channels may be interleaved. The channel interleaving may be performed for each transmit antenna, or across each group of transmit antennas or across all transmit antennas to provide spatial diversity. The channel interleaving may also be performed for each frequency subchannel, or across each group of frequency subchannels or across all frequency subchannels to provide frequency diversity. The channel interleaving may also be performed across each group of one or more frequency subchannels of each group of one or more transmit antennas such that the coded bits from one data stream may be distributed over one or more frequency subchannels of one or more transmit antennas to provide a combination of temporal, frequency, and spatial diversity. The channel interleaving may also be performed across all frequency subchannels of all transmit antennas.

Receiver System

FIG. 4A is a block diagram of an embodiment of areceiver unit400a, which is an embodiment of the receiver portion ofreceiver system150 inFIG. 1. In this embodiment, a single demodulation scheme is used for all N_Ffrequency subchannels of all N_Ttransmit antennas and a single decoding scheme is used for all transmit antennas.Receiver unit400amay thus be used to receive a data transmission fromtransmitter unit200ainFIG. 2A.

The signals transmitted from the N_Ttransmit antennas are initially received by each of N_Rantennas152athrough152rand routed to a respective receiver154 (which is also referred to as a front-end unit). Each receiver154 conditions (e.g., filters, amplifies, and downconverts) a respective received signal and further digitizes the conditioned signal to provide data samples. Each receiver154 may further demodulate the data samples with a recovered pilot to provide a stream of received transmission symbols, which is provided to a demodulator156a.

In the specific embodiment shown inFIG. 4A, demodulator156aincludes N_ROFDM demodulators, with each OFDM demodulator assigned to process a respective transmission symbol stream from one receive antenna. Each OFDM demodulator includes a cyclic prefix remover412 and a fast Fourier transformer (FFT)414. Cyclic prefix remover412 removes the cyclic prefix previously appended to each OFDM symbol by the transmitter system to ensure ISI-free reception of the transmitted modulation symbols. FFT414 then transforms each received OFDM symbol to provide a vector of N_Freceived modulation symbols for the N_Ffrequency subchannels used to transmit the OFDM symbol. The N_Rmodulation symbol vectors from all N_ROFDM demodulators for each transmission symbol period are provided to a detector/decoder158a, which is one embodiment of detector/decoder158 inFIG. 1.

In the embodiment shown inFIG. 4A, detector/decoder158aincludes adetector420aand a decoder430 that perform iterative detection and decoding on the modulation symbols received from all N_Rreceive antennas to provide decoded data. The iterative detection and decoding exploits the error correction capabilities of the channel code to provide improved performance. This is achieved by iteratively passing soft “a priori” information between the soft-input soft-output (SISO)detector420aand the soft-input soft-output decoder430, as described in further detail below.

Detector

420areceives the modulation symbols fromdemodulator156aand a priori information from decoder430 and derives soft-decision (i.e., multi-bit) symbols for all N_Ffrequency subchannels of all N_Ttransmit antennas, with each such soft-decision symbol being an estimate of a coded bit transmitted by the transmitter system. As described in further detail below, the soft-decision symbols may be represented as log-likelihood ratios (LLRs), which are denoted as L(b_k) inFIG. 4A.

For each transmission symbol period,detector420aprovides up to N_Bsoft-decision symbols to N_Brespective summers422, where N_B=N_T·N_F·q and q is dependent on the specific modulation scheme used for the data transmission. Each summer422 also receives the a priori information for its coded bit b_kfrom decoder430 (which is referred to as the detector a priori information and denoted as L_a(b_k)), and subtracts this detector a priori information from the received soft-decision symbol to derive extrinsic information for the coded bit (denoted as L_e(b_k)). The extrinsic information for all (N_T·N_F·q) coded bits is then (1) converted from parallel to serial by a P/S converter424, (2) deinterleaved by achannel deinterleaver426 in a manner complementary to the channel interleaving performed at the transmitter system, and (3) provided as a priori information from the detector to the decoder (which is referred to as the decoder a priori information and denoted as L_a^D(b_k)).

Decoder430 uses the decoder a priori information in the decoding process and provides the decoded data. Decoder430 further provides “a posteriori” information (denoted as L^D(b_k)) to asummer432.Summer432 then subtracts the decoder a priori information, L_a^D(b_k), from the decoder a posteriori information, L^D(b_k), to derive extrinsic information from the decoder for the detector (denoted as L_e^D(b_k)). This detector extrinsic information is then interleaved by achannel interleaver434, converted from serial to parallel by a S/P converter436, and provided as the detector a priori information, L_a(b_k), todetector420aand summers422.

To briefly summarize, the output of the detection process may be expressed as:
L_e(b_k)=L(b_k)−L_a(b_k), Eq (1)
where L(b_k) represents the soft-decision symbol for the k-th coded bit b_k;

- L_a(b_k) represents the detector a priori information for the k-th coded bit, which is provided by the decoder; and
- L_e(b_k) represents the extrinsic information for the k-th coded bit provided by the detector to the decoder.
  The output of the decoding process may similarly be expressed as:
  L_e^D(b_k)=L^D(b_k)−L_a^D(b_k), Eq (2)
  where L^D(b_k) represents the a posteriori information for the k-th coded bit provided by the decoder;
- L_a^D(b_k) represents the decoder a priori information for the k-th coded bit provided by the detector; and
- L_e^D(b_k) represents the extrinsic information for the k-th coded bit provided by the decoder to the detector.

As shown inFIG. 4A, the decoder a priori information, L_a^D(b_k), is simply the detector extrinsic information, L_e(b_k), after the parallel-to-serial conversion and channel deinterleaving. Similarly, the detector a priori information, L_a(b_k), is simply the decoder extrinsic information, L_e^D(b_k), after the channel interleaving and serial-to-parallel conversion.

The detection and decoding process may be iterated a number of times. During the iterative detection and decoding process, the reliability of the bit decisions is improved with each iteration. The iterative detection and decoding process described herein may be used to combat frequency selective fading (e.g., by using OFDM with cyclic prefix) as well as flat fading (without any modifications). Moreover, the iterative detection and decoding process may be flexibly used with various types of coding and modulation schemes, including the serial and parallel concatenated convolutional codes as described above.

InFIG. 4A,detector420aprovides soft-decision symbols for the transmitted coded bits based on the modulation symbols received from the N_Rreceive antennas as well as the a priori information fed back from decoder430. The soft-decision symbols may be conveniently represented in the form of log-likelihood ratios (LLRs) and include channel information, extrinsic information, and a priori information. The channel information for each coded bit includes information about the channel response between the transmit and receive antennas. The extrinsic information for each coded bit comprises incremental information about that coded bit that is extracted from other coded bits in the detection process. And the a priori information for each coded bit includes information about the coded bit that is known or derived outside the detection process.

In an embodiment, only the channel information and extrinsic information are passed from the detector to the decoder where, after parallel-to-serial conversion and channel deinterleaving, they are used as a priori information in the decoding process. For simplicity, the channel information and extrinsic information are collectively referred to as simply the extrinsic information. Ideally, the decoder a priori information should be provided by an independent source. However, since such a source is not available, an independent source may be mimicked by minimizing the correlation between the decoder a priori information (i.e., the detector output) and previous decisions made by the decoder (i.e., the detector a priori information). This is achieved by subtracting the detector a priori information from the soft-decision symbols derived by the detector, using summers422 as shown inFIG. 4A.

LLR Computation by Detector

The modulation symbol received from the output of the OFDM demodulator coupled to the m-th receive antenna for the l-th frequency subchannel at time index j (i.e., transmission symbol period j) may be expressed as:

\begin{matrix} r_{m, l} (j) = \sum_{n = 1}^{N_{T}} h_{n, m, l} (j) \cdot c_{n, l} (j) + n_{m, l} (j), & Eq (3) \end{matrix}

where h_n,m,l(j) is the channel response between the n-th transmit antenna and the m-th receive antenna for the l-th frequency subchannel at time index j;

- C_n,l(j) is the modulation symbol transmitted on the l-th frequency subchannel of the n-th transmit antenna; and
- n_m,l(j) is a sample function of a zero-mean, temporally and spatially white Gaussian noise process.
  To simplify notation, the time index j is dropped in the following derivations.

Equation (3) may be expressed in matrix form, as follows:
r_l=H_lc_l+n_l, forl=0, 1, 2, . . . , N_F−1, Eq (4)
wherer_l=[r_1.lr_2.l. . . r_N_R_,l]^Tis a vector of N_Rmodulation symbols received from the N_Rreceive antennas for the l-th frequency subchannel;

- H_lis the N_R×N_Tmatrix of channel gains {h_n,m,l} for the l-th frequency subchannel, where h_n,m,ldenotes the complex channel gain between the n-th transmit antenna and the m-th receive antenna for the l-th frequency subchannel;
- c_l=[c_1.lc_2.l. . . c_N_T_,l]^Tis a vector of N_Tmodulation symbols transmitted from the N_Ttransmit antennas for the l-th frequency subchannel;
- n_l=[n_1.ln_2.l. . . n_N_R_,l]^Tis a vector of N_Rnoise samples for the N_Rreceive antennas for the l-th frequency subchannel; and
- “^T” denotes the transposition.

The modulation symbols received from all N_Ffrequency subchannels of all N_Rreceive antennas for each time index may be expressed as:
r=[r₀^Tr₁^T. . .r_N_F₋₁^T]^T. Eq (5)

The N_F·N_Rreceived modulation symbols inr correspond to the N_F·N_Ttransmitted modulation symbols, which may be expressed as:
c=[c₁^Tc₂^T. . .c_N_T^T]^T. Eq (6)

As noted above, each modulation symbol is formed by a respective group of q coded bits. The N_F·N_Rreceived modulation symbols inr thus further correspond to the N_F·N_T·q transmitted coded bits, which may be expressed as:
b=[b₁^T,b₂^T, . . .b_N_T^T]^T, Eq (7)
where the coded bits transmitted from the n-th transmit antenna may be expressed as
b_n=[b_n,0,1. . . b_n,0,qb_n,1,l. . . b_n,1,q. . . b_n,N_F_−1,l. . . b_n,N_F_−1,q]^T.

The detector computes the LLRs for each transmitted coded bit b_n,l,i, as follows:

\begin{matrix} L (b_{n, l, i}) = \ln \frac{\Pr {b_{n, l, i} = + 1 | \underset{\underline{_}}{r}}}{\Pr {b_{n, l, i} = - 1 | \underset{\underline{_}}{r}}}, \begin{matrix} for n = 1, 2, \dots, N_{T}, \\ l = 0, 1, \dots, N_{F} - 1, \\ and i = 1, 2, \dots, q . \end{matrix} & Eq (8) \end{matrix}

As shown in equation (8), the LLR for a given coded bit, L(b_n,l,i), is computed as the (natural) logarithm of the ratio of the probability of the coded bit b_n,l,ibeing a +1 given the received modulation symbolsr, Pr{b_n,l,i=+1|r}, over the probability of the coded bit b_n,l,ibeing a −1 given the received modulation symbolsr, Pr{b_n,l,i=−1|r}. The probabilities for each coded bit are derived based on the received modulation symbol containing that bit and the sequence of coded bits received forr, as derived below.

The following equalities may be expressed:

\begin{matrix} \begin{matrix} \Pr {b_{n, l, i} = \pm 1 | \underset{\underline{_}}{r}} \Pr {\underset{\underline{_}}{r}} = \sum_{\underset{\underline{_}}{b} : b_{n, l, i} = \pm 1} \Pr {\underset{\underline{_}}{r}, \underset{\underline{_}}{b}} \\ = \sum_{\underset{b_{n, l, i} = \pm 1}{\underset{\underline{_}}{c} : \underset{\underline{_}}{c} = f (\underset{\underline{_}}{b})}} \Pr {\underset{\underline{_}}{r}, \underset{\underline{_}}{c}} \\ = \sum_{\underset{b_{n, l, i} = \pm 1}{\underset{\underline{_}}{c} : \underset{\underline{_}}{c} = f (\underset{\underline{_}}{b})}} \Pr {\underset{\underline{_}}{r} | \underset{\underline{_}}{c}} \Pr {\underset{\underline{_}}{c}}, \end{matrix} & Eq (9) \end{matrix}

where ƒ(·) represents the symbol mapping from the coded bitsb to the modulation symbolsc. The LLRs may then be expressed as:

\begin{matrix} L (b_{n, l, i}) = \ln \frac{\sum_{\underset{b_{n, l, i} = \pm 1}{\underset{\underline{_}}{c} : \underset{\underline{_}}{c} = f (\underset{\underline{_}}{b})}} \Pr {\underset{\underline{_}}{r} | \underset{\underline{_}}{c}} \Pr {\underset{\underline{_}}{c}}}{\sum_{\underset{b_{n, l, i} = - 1}{\underset{\underline{_}}{c} : \underset{\underline{_}}{c} = f (\underset{\underline{_}}{b})}} \Pr {\underset{\underline{_}}{r} | \underset{\underline{_}}{c}} \Pr {\underset{\underline{_}}{c}}} . & Eq (10) \end{matrix}

In the first iteration of the iterative detection and decoding process, it is assumed that all points in the signal constellation are equally likely. Hence, the term Pr{c} can be removed from the numerator and denominator of equation (10). In subsequent iterations, however, the only assumption is that the transmitted modulation symbols are independent. Furthermore, since the coded bits that make up the modulation symbols are interleaved, it is assumed that the bit probabilities are independent. Based on these assumptions, the term Pr{c} may expressed as:

\begin{matrix} \Pr {\underset{\underline{_}}{c}} = \prod_{n = 1}^{N_{T}} \prod_{l = 0}^{N_{F} - 1} \prod_{i = 1}^{q} \Pr {b_{n, l, i}} = \prod_{p = 1}^{N_{T} N_{F} q} \Pr {b_{p}}, & Eq (11) \end{matrix}

where a change in notation of variables is made (i.e., p={n,l,i}) in the term to the right of the equality to simplify notation.

The received modulation symbols r_1,l, r_2,l, . . . , r_N_R_,N_F₋₁are conditionally independent givenc. The term Pr{r|c} may then be expressed as:

\begin{matrix} \begin{matrix} \Pr {\underset{\underline{_}}{r} | \underset{\underline{_}}{c}} = \prod_{m = 1}^{N_{R}} \prod_{l = 0}^{N_{F} - 1} \Pr {r_{m, l} | \underset{\underline{_}}{c}} \\ = \prod_{m = 1}^{N_{R}} \prod_{l = 0}^{N_{F} - 1} \exp (- \frac{1}{2 σ^{2}} {\langle r_{m, l} - \sum_{n = 1}^{N_{T}} h_{n, m, l} \cdot c_{n, l} \rangle}^{2}), \end{matrix} & Eq (12) \end{matrix}

where σ²is the noise spectral density given by σ²=N₀/2.

Substituting equations (11) and (12) into equation (10), the LLR for the k-th coded bit may then be expressed as:

\begin{matrix} \begin{matrix} L (b_{k}) = \\ \ln \frac{\sum_{\underset{b_{k} = + 1}{\underset{\underline{_}}{c} : \underset{\underline{_}}{c} = f (\underset{\underline{_}}{b})}} [\prod_{m = 1}^{N_{R}} \prod_{l = 0}^{N_{F} - 1} \exp (- \frac{1}{2 σ^{2}} {\langle \begin{matrix} r_{m, l} - \\ \sum_{n = 1}^{N_{T}} h_{n, m, l} \cdot c_{n, l} \end{matrix} \rangle}^{2}) \prod_{p = 1}^{N_{T} N_{F} q} \Pr {b_{p}}]}{\sum_{\underset{b_{k} = - 1}{\underset{\underline{_}}{c} : \underset{\underline{_}}{c} = f (\underset{\underline{_}}{b})}} [\prod_{m = 1}^{N_{R}} \prod_{l = 0}^{N_{F} - 1} \exp (- \frac{1}{2 σ^{2}} {\langle \begin{matrix} r_{m, l} - \\ \sum_{n = 1}^{N_{T}} h_{n, m, l} \cdot c_{n, l} \end{matrix} \rangle}^{2}) \prod_{p = 1}^{N_{T} N_{F} q} \Pr {b_{p}}]}, \end{matrix} & Eq (13) \end{matrix}

where k={n,l,i}. Equation (13) may further be decomposed as follows:

\begin{matrix} \begin{matrix} L (b_{k}) = \ln \frac{\sum_{\underset{b_{k} = + 1}{\underset{\underline{_}}{c} : \underset{\underline{_}}{c} = f (\underset{\underline{_}}{b})}} [\prod_{m = 1}^{N_{R}} \prod_{l = 0}^{N_{F} - 1} \exp (β_{m, l}) \prod_{\underset{p \neq k}{p = 1}}^{N_{T} N_{F} q} \Pr {b_{p}}]}{\sum_{\underset{b_{k} = - 1}{\underset{\underline{_}}{c} : \underset{\underline{_}}{c} = f (\underset{\underline{_}}{b})}} [\prod_{m = 1}^{N_{R}} \prod_{l = 0}^{N_{F} - 1} \exp (β_{m, l}) \prod_{\underset{p \neq k}{p = 1}}^{N_{T} N_{F} q} \Pr {b_{p}}]} + \\ \ln \frac{\Pr {b_{k} = + 1}}{\Pr {b_{k} = - 1}} \\ = L_{e} (b_{k}) + L_{a} (b_{k}), \\ where \\ β_{m, l} = - \frac{1}{2 σ^{2}} {\langle r_{m, l} - \sum_{n = 1}^{N_{T}} h_{n, m, l} \cdot c_{n, l} \rangle}^{2} . \end{matrix} & Eq (14) \end{matrix}

As shown in equation (14), the LLR for the k-th coded bit, L(b_k), may be decomposed into two parts. The term L_a(b_k) represents the a priori information for the k-th coded bit computed by the decoder and fed back to the detector. This detector a priori information is expressed in the form of a priori LLRs, which may be expressed as:

\begin{matrix} L_{a} (b_{k}) = \ln \frac{\Pr {b_{k} = + 1}}{\Pr {b_{k} = - 1}} . & Eq (15) \end{matrix}

The term L_e(b_k) represents the extrinsic information for the k-th coded bit computed by the detector and fed forward to the decoder. The product of the a priori probabilities, ΠPr{b_p}, in equation (14) may be expressed as:

\begin{matrix} \prod_{\underset{p \neq k}{p = 1}}^{N_{T} N_{F} q} \Pr {b_{p}} = C \cdot \exp (\prod_{\underset{p \neq k}{p = 1}}^{N_{T} N_{F} q} \frac{1}{2} b_{p} L_{a} (b_{p})) = C \cdot \exp (α), & Eq (16) \end{matrix}

where C is a constant and

α = \sum_{p = 1, p \neq k}^{N_{T} N_{F} q} \frac{1}{2} b_{p} L_{a} (b_{p}) .

Hence, the detector extrinsic information, L_e(b_k), may be expressed in terms of the detector a priori LLRs, as follows:

\begin{matrix} L_{e} (b_{k}) = \ln \frac{\begin{matrix} \sum_{\underset{b_{k} = + 1}{\underset{\underline{_}}{c} : c = f (\underset{\underline{_}}{b})}} \\ [\prod_{m = 1}^{N_{R}} \prod_{i = 0}^{N_{F} - 1} \exp (β_{m, l}) \exp (\sum_{\underset{p \neq k}{p = 1}}^{N_{T} N_{F} q} \frac{1}{2} b_{p} L_{a} (b_{p}))] \end{matrix}}{\begin{matrix} \sum_{\underset{b_{k} = - 1}{\underset{\underline{_}}{c} : c = f (\underset{\underline{_}}{b})}} \\ [\prod_{m = 1}^{N_{R}} \prod_{i = 0}^{N_{F} - 1} \exp (β_{m, l}) \exp (\sum_{\underset{p \neq k}{p = 1}}^{N_{T} N_{F} q} \frac{1}{2} b_{p} L_{a} (b_{p}))] \end{matrix}} . & Eq (17) \end{matrix}

Since the detector a priori information, L_a(b_k), is known by the decoder, it may be subtracted from L(b_k) by summers422 inFIG. 4A such that only the detector extrinsic information, L_e(b_k), is provided to the decoder.

It can be seen from equations (13) and (17) that the computational complexity to derive the LLRs for the coded bits grows exponentially with the number of frequency subchannels (N_F), the number of transmit antennas (N_T), and the size of the signal constellation (2^q). Several techniques may be used to reduce the computational burden to derive the coded bit LLRS. Such techniques include the use of interference nulling to isolate each transmitted signal by removing the other interferers and the use of a “dual-maxima” or some other approximation to compute the LLRS. These techniques are described in further detail below.

Without loss of generality, the signal from transmitantenna1 may be treated as the desired signal and the other signals from the remaining (N_T−1) transmit antennas may be treated as interference to the desired signal. With N_Rreceive antennas, where N_R≧N_T, the (N_T−1) interferers may be nulled (or canceled). For each of the N_Ffrequency subchannels, the vector of N_Rmodulation symbols,r_l(which are received from the N_Rreceive antennas for the l-th frequency subchannel) may be pre-multiplied by an (N_R−N_T+1)×N_Rnulling matrix,Θ_l⁽¹⁾, and the resulting vector{tilde over (r)}_l⁽¹⁾of (N_R−N_T+1) elements may be expressed as:
{tilde over (r)}_l⁽¹⁾=Θ_l⁽¹⁾r_l=Θ_l⁽¹⁾H_lc_l+Θ_l⁽¹⁾n_l={tilde over (H)}_l⁽¹⁾c_1,l+ñ_l⁽¹⁾, forl=0, 1, . . . , N_F−1. Eq (18)

As shown in equation (18), the components from transmitantennas 2, 3, . . . , N_Tare suppressed in the vector{tilde over (r)}_l⁽¹⁾and only the component c_1,lfrom desired transmitantenna 1 remains.

The nulling matrices,Θ_l⁽ⁿ⁾, may be determined based on algorithms known in the art. The derivation of the nulling matrix,Θ_l⁽¹⁾, for transmitantenna1 is briefly described as follows. First, the N_R×(N_T−1) channel response matrix,H_l⁽¹⁾, for transmitantennas 2 through NT and the NR receive antennas is determined. A set of (N_R−N_T+1) orthonormal vectors {ν₁⁽¹⁾ν₂⁽¹⁾. . . ν_N_R_−N_T₊₁⁽¹⁾}, whose members are the rows of the nulling matrix,Θ_l⁽¹⁾, is then computed such that
Θ_l⁽¹⁾H_l⁽¹⁾=0,
where0 is the all-zero matrix, and
Θ_l⁽¹⁾Θ_l^(1)*=I,
whereΘ_l^(1)*is the Hermitian ofΘ_l⁽¹⁾andI is the identity matrix (i.e., all ones along the diagonal and zeros elsewhere). Fast algorithms are available for computing the orthonormal vectors, as is known in the art. As indicated by the notation, different nulling matrices are derived for different transmit antennas and different frequency subchannels (i.e.,Θ_l⁽ⁿ⁾for n=1, 2, . . . , N_T, and l=0, 1, . . . , N_F−1).

Derivation of the nulling matrices for a MIMO system is described in further detail by Vahid Tarokh et al in a paper entitled “Combined Array Processing and Space-Time Coding,” IEEE Transactions on Information Theory, Vol. 45, No. 4, May 1999, which is incorporated herein by reference.

After nulling the interference on the desired signal due to the signals from the other (N_T−1) transmit antennas, the LLRs for the coded bits from the desired transmit antenna may then be calculated in a similar manner as described above, without regard to the components from the other (N_T−1) transmit antennas. For transmitantenna 1, the LLRs for the coded bits transmitted on all N_Ffrequency subchannels of this transmit antenna, [b_1,0,1. . . b_1,0,qb_1,1,1. . . b_1,1,q. . . b_1,N_F_−1,1. . . b_1,N_F_−1,q], may be expressed as:

\begin{matrix} L (b_{1, l, i}) = \ln \frac{\Pr {b_{1, l, i} = + 1 | {\underset{\underline{_}}{\tilde{r}}}^{(1)}}}{\Pr {b_{1, l, i} = - 1 | {\underset{\underline{_}}{\tilde{r}}}^{(1)}}}, \begin{matrix} for l = 0, 1, \dots, N_{F} - 1 \\ and i = 1, 2, \dots, q, \end{matrix} & Eq (19) \end{matrix}

where{tilde over (r)}⁽¹⁾=[{tilde over (r)}₀^(1)T{tilde over (r)}_l^(1)T. . .{tilde over (r)}_N_F₋₁^(1)T]^T.

After the interference nulling, the LLR computation is simplified since only the desired signal from one transmit antenna is considered at a time. Equation (19) may be expressed in a form similar to equation (14), as follows:

\begin{matrix} L^{(1)} (b_{k}) = \ln \frac{\begin{matrix} \sum_{\underset{b_{k} = + 1}{c_{1} : c_{1} = f (b_{1})}} \\ [\prod_{m = 1}^{N_{R}} \prod_{i = 0}^{N_{F} - 1} \exp (β_{m, l}) \exp (\sum_{\underset{p \neq k}{p = 1}}^{N_{T} N_{F} q} \frac{1}{2} b_{p} L_{a} (b_{p}))] \end{matrix}}{\begin{matrix} \sum_{\underset{b_{k} = - 1}{\underset{\underline{_}}{c} : c = f (\underset{\underline{_}}{b})}} \\ [\prod_{m = 1}^{N_{R}} \prod_{i = 0}^{N_{F} - 1} \exp (β_{m, l}) \exp (\sum_{\underset{p \neq k}{p = 1}}^{N_{T} N_{F} q} \frac{1}{2} b_{p} L_{a} (b_{p}))] \end{matrix}} + \ln \frac{\Pr [b_{k} = + 1}}{\Pr {b_{k} = - 1}}, & Eq (20) \end{matrix}

where k=1, 2, . . . , N_F·q and k={m, l}.

As shown in equation (20), instead of calculating (N_F·N_T·q) LLR values for all N_Ttransmit antennas, only (N_F·q) LLR values are calculated at a time for each of N_Ttransmit antennas. However, by performing the interference nulling, the complexity of the calculation in Eq (20) is no longer exponential in the number of transmit antennas N_Tsince (1) each summation is performed over only the modulation symbolsc_ntransmitted from the desired n-th transmit antenna, and (2) the term

\prod \Pr {b_{p}}

is evaluated only for the coded bits transmitted from the n-th transmit antenna.

The product of the a priori probabilities,

\prod \Pr {b_{p}},

in equation (20) may be expressed as:

\begin{matrix} \prod_{\underset{p \neq k}{p = 1}}^{N_{F} q} \Pr {b_{p}} = C \cdot \exp (\sum_{\underset{p \neq k}{p = 1}}^{N_{F} q} \frac{1}{2} b_{p} L_{a} (b_{p})) = C \cdot \exp (α_{n}) . & Eq (21) \end{matrix}

The detector extrinsic information, L_e⁽ⁿ⁾(b_k), may then be expressed in terms of the detector a priori LLRs, as follows:

\begin{matrix} L_{e}^{(n)} (b_{k}) = \ln \frac{\begin{matrix} \sum_{\underset{b_{k} = + 1}{\underline{c} : \underline{c} = f ({\underline{b}}_{n})}} \\ [\prod_{m = 1}^{N_{R}} \prod_{l = 0}^{N_{F} - 1} \exp (β_{m, l}) \exp (\sum_{\underset{p \neq k}{p = 1}}^{N_{F} q} \frac{1}{2} b_{p} L_{a} (b_{p}))] \end{matrix}}{\begin{matrix} \sum_{\underset{b_{k} = - 1}{\underline{c} : \underline{c} = f ({\underline{b}}_{n})}} \\ [\prod_{m = 1}^{N_{R}} \prod_{l = 0}^{N_{F} - 1} \exp (β_{m, l}) \exp (\sum_{\underset{p \neq k}{p = 1}}^{N_{F} q} \frac{1}{2} b_{p} L_{a} (b_{p}))] \end{matrix}} . & Eq (22) \end{matrix}

The detection with interference nulling described above may be repeated N_Ttimes, once for each transmit antenna. For each repetition to recover the desired signal from a particular transmit antenna, the (N_T−1) interferers of this desired signal may be nulled out by pre-multiplying the received modulation symbol vectors,r_l, with the nulling matrix,Θ_l⁽ⁿ⁾, derived for that transmit antenna and that frequency subchannel, as shown in equation (18). The LLRs for the coded bits in the desired signal may then be computed, as shown in equations (20) and (22). Thus, equation (20) or (22) may be evaluated N_Ttimes, once for each desired signal, with each evaluation providing a set of (N_F·q) LLRs for the coded bits in the desired signal.

The reduced computational complexity for deriving the LLRs for the coded bits is achieved with a corresponding decrease in diversity, since the desired signal is received with a diversity of order (N_R−N_T+1), instead of a diversity of order N_R, using equation (18).

The dual-maxima approximation may also be used to reduce the computational complexity associated with deriving the LLRs for the coded bits. As shown in equations (20) and (22), the LLR for each coded bit is computed as the logarithm of the ratio of two summations. Each summation is performed over a number of elements, with each such element being composed of products of exponential terms, exp(β_m,l) and exp(α_n). The exponentiation in the elements of each summation enhances the differences between the individual elements of the summation. Hence, one element typically dominates each summation, and the following approximation may be made:

\begin{matrix} \ln \sum_{j} \exp (a_{j}) \approx \max_{j} (a_{j}) . & Eq (23) \end{matrix}

For simplicity, the following may be defined:

\begin{matrix} \begin{matrix} L_{k} = \ln \frac{\sum_{\underline{u} : u_{k} = + 1} \exp [M (\underline{u}, \underline{y})]}{\sum_{\underline{u} : u_{k} = - 1} \exp [M (\underline{u}, \underline{y})]} \\ = \ln \sum_{\underline{u} : u_{k} = + 1} \exp [M (\underline{u}, \underline{y})] - \ln \sum_{\underline{u} : u_{k} = - 1} \exp [M (\underline{u}, \underline{y})] . \end{matrix} & Eq (24) \end{matrix}

Applying the approximation shown in equation (23) for the sum of exponents to equation (24), the following can be expressed:

\begin{matrix} L_{k} \approx \max_{\underline{u} : u_{k} = + 1} {M (\underline{u}, \underline{y})} - \max_{\underline{u} : u_{k} = - 1} {M (\underline{u}, \underline{y})} . & Eq (25) \end{matrix}

The approximation shown in equation (25) is often referred to as the dual-maxima approximation.

The dual-maxima approximation may be used to simplify the computation for the LLRs for the coded bits. Specifically, for equation (22), the logarithm of the ratio of two summations may first be decomposed as follows:

\begin{matrix} \begin{matrix} L_{e}^{(n)} (b_{k}) = \ln \sum_{\underset{b_{k} = + 1}{{\underline{c}}_{n} : {\underline{c}}_{n} = f ({\underline{b}}_{n})}} [\prod_{m = 1}^{N_{R}} \prod_{l = 0}^{N_{F} - 1} \exp (β_{m, l}) \exp (α_{n})] - \\ \ln \sum_{\underset{b_{k} = - 1}{{\underline{c}}_{n} : {\underline{c}}_{n} = f ({\underline{b}}_{n})}} [\prod_{m = 1}^{N_{R}} \prod_{l = 0}^{N_{F} - 1} \exp (β_{m, l}) \exp (α_{n})] . \end{matrix} & Eq (26) \end{matrix}

Next, instead of summing over the individual elements for all possible values of the coded bits for the modulation symbolsc_nfrom the n-th transmit antenna, the dual-maxima approximation algorithm finds the maximum element in each summation (i.e., one for the numerator and another for the denominator in equation (22)) and uses these two maximum elements in the LLR calculation, as shown in equation (25).

By using approximations based on the dual-maxima approximation, the computational complexity can be made to increase linearly in the number of coded bits per modulation symbol, q, instead of exponentially. Simulation results have shown that the performance degradation due to the use of such approximations is negligible over the range of SNRs where the use of high-order modulations is justified.

Other approximations and simplifications may also be used to reduce the number of complex additions and multiplications needed to compute the LLRs for the coded bits, and this is within the scope of the invention.

Other simplifications that may be used for computing LLRs are described by Andrew J. Viterbi in a paper entitled “An Intuitive Justification and a Simplified Implementation of the MAP Decoder for Convolutional Codes,” IEEE Journal on Selected Areas in Communications, Vol. 16, No. 2, February 1998, pp. 260-264, and by Patrick Robertson et al. in a paper entitled “A Comparison of Optimal and Sub-Optimal MAP Decoding Algorithms Operating in the Log Domain,” IEEE International Conference on Communication, 1995, pp. 1009-1012, both of which are incorporated herein by reference. These various simplification techniques typically perform computations in the log-domain, where division becomes subtraction and multiplication becomes addition.

FIG. 4B is a block diagram of an embodiment of areceiver unit400b, which is another embodiment of the receiver portion ofreceiver system150 inFIG. 1. In this embodiment, different demodulation and decoding schemes may be used for the N_Ttransmit antennas.Receiver unit400bmay thus be used to receive a data transmission fromtransmitter unit200binFIG. 2B, which employs separate coding and modulation schemes on a per-antenna basis.

The signals transmitted from the N_Ttransmit antennas are initially received by each of N_Rantennas152athrough152rand routed to a respective receiver154. Each receiver154 conditions, digitizes, and processes a respective received signal to provide a respective stream of transmission symbols. The transmission symbol stream from each receiver154 is provided to a respective OFDM demodulator410 within ademodulator156b. Each OFDM demodulator410 removes the cyclic prefix appended to each OFDM symbol by the transmitter system and then transforms each received OFDM symbol to provide a vector of N_Freceived modulation symbols for the N_Ffrequency subchannels used to transmit the OFDM symbol. The N_Rmodulation symbol vectors from all N_ROFDM demodulators410 for each transmission symbol period are provided to a detector/decoder158b, which is another embodiment of detector/decoder158 inFIG. 1.

In the embodiment shown inFIG. 4B, detector/decoder158bincludes adetector420band N_Tdecoder blocks440, which collectively perform iterative detection and decoding on the modulation symbols received from all N_Rreceive antennas to provide the decoded data. Each decoder block440 is assigned to process the modulation symbols transmitted from a respective transmit antenna, which may have been coded and modulated with its own specific coding and modulation schemes.

Detector

420breceives the modulation symbols fromdemodulator156band the a priori information from the N_Tdecoders430athrough430tand provides soft-decision symbols for the N_Ttransmit antennas, with each such soft-decision symbol being an estimate of a transmitted coded bit and may be represented by the LLR, as shown in equation (22). For each transmission symbol period,detector420bprovides N_Tvectors of soft-decision symbols for the N_Ttransmit antennas to the N_Tdecoder blocks440, with each vector including (N_F·q_n) soft-decision symbols (where q_nis dependent on the specific modulation scheme used for the n-th transmit antenna). Within each decoder block440, the detector a priori information for each coded bit being processed by that decoder block is subtracted from the corresponding soft-decision symbol to derive the extrinsic information for the coded bit. The detector extrinsic information for all (N_F·q_n) coded bits is then converted from parallel to serial by P/S converter424, deinterleaved bychannel deinterleaver426, and provided as a priori information to decoder430.

Decoder430 within each decoder block440 uses the decoder a priori information in the decoding process and provides the decoded data for the transmit antenna assigned to and processed by the decoder block. Decoder430 further provides the a posteriori information for the coded bits transmitted by the assigned transmit antenna. Asummer432 then subtracts the decoder a priori information from the decoder a posteriori information to derive the decoder extrinsic information, which is then interleaved bychannel interleaver434, converted from serial to parallel by S/P converter436, and provided as a priori information todetector420band summer422.

FIG. 4C is a block diagram of an embodiment of areceiver unit400c, which is yet another embodiment of the receiver portion ofreceiver system150 inFIG. 1. In this embodiment, the detector performs successive nulling and interference cancellation to recover one transmitted signal at a time.Receiver unit400cmay be used to recover a data transmission fromtransmitter unit200binFIG. 2B (which employs separate coding and modulation schemes on a per-antenna basis).

The N_Rreceived signals are initially processed by receivers154 and further processed bydemodulator156 to provide N_Rmodulation symbol vectors,r, for each transmission symbol period, which are then provided to a detector/decoder158c. Detector/decoder158cperforms iterative detection and decoding as well as successive nulling and interference cancellation. In particular, detector/decoder158cimplements a multi-stage (or multi-layer) detection scheme that includes both nulling of interferers and post-decoding interference cancellation (i.e., successive nulling and interference cancellation).

Detector/decoder158cincludes adetector420c, N_Tdecoder blocks440, and P/S converter442.Detector420cincludes N_Tdetection stages (or layers), with each stage being assigned to process and recover the data for a particular transmit antenna. Each stage (except for the last stage) includes an interference nuller450, an LLR computer452, and an interference canceller460. The last stage only includes LLR computer452 since all other transmitted signals have been nulled by this time.

Withindetector420c, the received modulation symbol vectorsr are provided as the input vectorsr⁽¹⁾for interference nuller450a, which pre-multiplies the modulation symbol vectorr_l⁽¹⁾for each frequency subchannel with the nulling matrixΘ_l⁽¹⁾for that frequency subchannel of the first transmit antenna to provide the vector{tilde over (r)}_l⁽¹⁾having the components from the other (N_T−1) transmit antennas approximately removed. The pre-multiplication may be performed as shown in equation (18), which is:
{tilde over (r)}_l⁽¹⁾=Θ_l⁽¹⁾r_l=Θ_l⁽¹⁾H_lc_l+Θ_l⁽¹⁾n_l.

Interference nuller

450aperforms N_Fpre-multiplications to derive N_Fvectors,{tilde over (r)}⁽¹⁾=[{tilde over (r)}₀^(1)T{tilde over (r)}₁^(1)T. . .{tilde over (r)}_N_F₋₁^(1)T]^T, for the N_Ffrequency subchannels of the first transmit antenna.

The vectors{tilde over (r)}⁽¹⁾are then provided toLLR computer452a, which computes the LLRs for the coded bits transmitted from the first transmit antenna, as shown in equation (22). The LLRs for the (N_F·q₁) coded bits from the first transmit antenna are then provided to decoder block440a, which operates on the decoder a priori information to provide the detector a priori information and the decoded bits for the first transmit antenna, as described below. The detector a priori information fromdecoder block440ais provided back toLLR computer452aand used to compute the new decoder a priori information for the next iteration. The detection and decoding for the first transmit antenna may be iterated a number of times.

The decoded bits fromdecoder block440aare also provided to interference canceller460a. Assuming that the data for the first stage has been decoded correctly, the contribution of these decoded bits on the received modulation symbols (which is denoted asî⁽¹⁾) is derived and subtracted from that stage's input vectorsr⁽¹⁾to derive the input vectorsr⁽²⁾for the next stage. This interference cancellation may be expressed as:
r⁽²⁾=r⁽¹⁾−î⁽¹⁾. Eq (27)

Each subsequent stage performs the detection and decoding in a similar manner as described above for the first stage to provide the decoded bits for the assigned transmit antenna. However, the input vectors,r⁽ⁿ⁾, for each subsequent stage contain less interference than that of the previous stage. Also, since the nulling is performed by interference nuller450 using the modulation symbols from all N_Rreceive antennas, the diversity order increases by one from one stage to the next. Finally, in the last stage, only the signal contribution from the last (N_T-th) transmit antenna remains, if the interference cancellation was effectively performed in the preceding stages. Hence, no nulling is necessary and the iterative detection and decoding may be performed directly on that stage's input vectorsr^(N^T⁾.

Pre-decoding interference estimation and cancellation may also be used, and this is within the scope of the invention. In this case, a hard decision may be made on the LLR outputs from the detector. The hard decision may then be re-modulated and multiplied with the estimated channel response to obtain pre-decoding interference estimates (which are typically not as reliable as post-decoding interference estimates). The pre-decoding interference estimates may then be canceled from the received modulation symbols.

Decoders

Decoders430 inFIGS. 4A and 4B may be implemented based on various designs and may be dependent on the particular coding scheme(s) used at the transmitter system. For example, each decoder430 may be implemented as an iterative decoder (i.e., a Turbo decoder) if a Turbo code is used. The structures for the Turbo decoders for serial and parallel concatenated convolutional codes are described below.

FIG. 5A is a simplified block diagram of aTurbo decoder430xcapable of performing iterative decoding for serial concatenated convolutional codes, such as the one shown inFIG. 3A.Turbo decoder430xincludes inner and outer maximum a posteriori (MAP)

decoders

512aand512b, acode deinterleaver514, and acode interleaver516.

The coded bits (or more specifically, the a priori LLRs for the decoder, L_a^D(b_k)) are provided toinner MAP decoder512a, which derives the a posteriori information for the coded bits based on the inner convolutional code. The a posteriori information is then subtracted by the a priori information forMAP decoder512ato provide extrinsic information, e_k^s1, which is indicative of corrections/adjustments in the confidence of the values for the information bits. The extrinsic information is then deinterleaved by code deinterleaver514 and provided as a priori information toouter MAP decoder512b.MAP decoder512aalso provides the LLRs for the coded bits, which comprise the a posteriori information, L^D(b_k), that is provided tosummer432 inFIGS. 4A and 4B.

MAP decoder

512breceives the a priori information fromMAP decoder512a(after the code deinterleaving) and derives the a posteriori information for the coded bits based on the outer convolutional code. The a posteriori information is subtracted by the a priori information forMAP decoder512bto provide extrinsic information, e_k^s2, which is indicative of further corrections/adjustments in the confidence of the values for the information bits. The extrinsic information, e_k^s2, is then interleaved by code interleaver516 and provided toinner MAP decoder512a.

The decoding by inner and

outer MAP decoders

512aand512bmay be iterated a number of times (e.g., 8, 12, 16, or possibly more). With each iteration, greater confidence is gained for the detected values of the information bits. After all the decoding iterations have been completed, the final LLRs for the information bits are provided to a bit detector withinMAP decoder512band sliced to provide the decoded bits, which are hard-decision (i.e., “0” or “1”) values for the information bits.

MAP decoders

512aand512bmay be implemented with the well-known BCJR soft-input soft-output MAP algorithm or its lower complexity derivatives. Alternatively, the soft-output Viterbi (SOV) algorithm may be implemented instead of the MAP algorithms. MAP decoders and MAP algorithms are described in further detail in the aforementioned papers by Viterbi and Robertson. The MAP and SOV algorithms may also be used to decode simple convolutional codes. The complexity of these algorithms is comparable to the standard Viterbi decoding algorithm, multiplied by the number of iterations.

FIG. 5B is a simplified block diagram of aTurbo decoder430ycapable of performing iterative decoding for parallel concatenated convolutional codes, such as the one shown inFIG. 3B.Turbo decoder430yincludes a S/P converter510, two

MAP decoders

512cand512d, two

code interleavers

524aand524b, acode deinterleaver526, and a P/S converter528.

The coded bits (or more specifically, the a priori LLRs for the decoder, L_a^D(b_k)) are provided to S/P converter510, which provides the a priori LLRs for the information bits, L_a^D(b_k^x), toMAP decoder512cand code interleaver524b, the a priori LLRs for the first constituent encoder's parity bits, L_a^D(b_k^y), toMAP decoder512c, and the a priori LLRs for the second constituent encoder's parity bits, L_a^D(b_k^z), to codeinterleaver524b, where L_a^D(b_k)={L_a^D(b_k^x), L_a^D(b_k^y), L_a^D(b_k^z)}.

MAP decoder

512creceives the a priori LLRs for the information bits, L_a^D(b_k^x), the a priori LLRs for the first constituent encoder's parity bits, L_a^D(b_k^y), and extrinsic information fromMAP decoder512d, e_k^p2(after deinterleaving by code deinterleaver526).MAP decoder512cthen derives the a posteriori information for the information bits based on the first constituent convolutional code. This a posteriori information is then subtracted by the received a priori information to provide extrinsic information, e_k^p1, which is indicative of corrections/adjustments in the confidence of the values for the information bits determined from the first constituent encoder's parity bits. The extrinsic information is then interleaved by code interleaver524aand provided toMAP decoder512d.

MAP decoder

512dreceives the a priori LLRs for the information bits, L_a^D(b_k^x) (after interleaving bycode interleaver524b), the a priori LLRs for the second constituent encoder's parity bits, L_a^D(b_k^z), and the extrinsic information fromMAP decoder512c, e_k^p1(after interleaving by code interleaver524a).MAP decoder512dthen derives the a posteriori information for the information bits based on the second constituent convolutional code. This a posteriori information is then subtracted by the received extrinsic information, e_k^p1, to provide the extrinsic information, e_k², which is indicative of further corrections/adjustments in the confidence of the values for the information bits determined from the second constituent encoder's parity bits. The extrinsic information, e_k², is then deinterleaved by code deinterleaver526 and provided toMAP decoder512c.

P/S converter528 receives the first constituent encoder's parity bit LLRs fromMAP decoder512c, the second constituent encoder's parity bit LLRs fromMAP decoder512d, and the information bit LLRs fromMAP decoder512d. P/S converter528 then performs parallel-to-serial conversion of the received LLRs and provides the a posteriori information, L^D(b_k), tosummer432 inFIGS. 4A and 4B.

The decoding by

MAP decoders

512cand512dmay also be iterated a number of times (e.g., 8, 12, 16, or possibly more). After all the decoding iterations have been completed, the final LLRs for the information bits are provided to a bit detector withinMAP decoder512dand sliced to provide the decoded bits.

MAP decoders

512cand512dmay be implemented with the BCJR SISO MAP algorithm or its lower complexity derivatives or with the SOV algorithm.

In general, the number of iterations in both the decoder and the iterative detector-decoder can be fixed or variable (i.e., adaptive). In the latter case, the stop criterion may be triggered when (1) the BER converges or reaches an acceptable level, (2) the worse or average LLR reaches a particular confidence level, or (3) some other criterion is met.

Interference Cancellation

FIG. 6 is a block diagram of an embodiment of aninterference canceller460x, which may be used for each interference canceller460 inFIG. 4C. Withininterference canceller460x, the decoded bits from the decoder block440 for the same stage are re-encoded and channel interleaved by aTX data processor114xto provide re-encoded bits for the transmit antenna being processed by the stage (i.e., the assigned transmit antenna). The re-encoded bits are further symbol mapped by amodulator116xto provide remodulated symbols, which are estimates of the modulation symbols at the transmitter prior to the OFDM processing and channel distortion.TX data processor114xandmodulator116xeach performs the same processing (e.g., encoding, channel interleaving, and modulation) as that performed at the transmitter system for the data stream on the assigned transmit antenna. The remodulated symbols are then provided to achannel simulator612, which processes the symbols with the estimated channel response to provide estimates of the interference due to the decoded bits.

For each frequency subchannel,channel simulator612 multiples the remodulated symbols for the assigned n-th transmit antenna with a vectorĥ_n,lthat includes an estimate of the channel response between the n-th transmit antenna and each of the N_Rreceive antennas. The vectorĥ_n,lis one column of the estimated channel response matrixĤ_lfor the l-th frequency subchannel. The matrixĤ_lmay be determined by a channel estimator associated with the same stage and provided tochannel simulator612.

If the remodulated symbol corresponding to the n-th transmit antenna is expressed as {tilde over (c)}_n,l, then the estimated interference componentî_l⁽ⁿ⁾due to the symbol from the n-th transmit antenna may be expressed as:

\begin{matrix} {\underline{\hat{i}}}_{l}^{(n)} = [\begin{matrix} {\hat{h}}_{n, 1, l} \cdot {\tilde{c}}_{n, l} \\ {\hat{h}}_{n, 2, l} \cdot {\tilde{c}}_{n, l} \\ ⋮ \\ {\hat{h}}_{n, N_{R}, l} \cdot {\tilde{c}}_{n, l} \end{matrix}] . & Eq (28) \end{matrix}

The N_Relements in the interference vectorî_l⁽ⁿ⁾correspond to components in the input vectorr_l⁽ⁿ⁾due to the modulation symbol {tilde over (c)}_n,ltransmitted from the n-th transmit antenna. The interference vectors for all N_Ffrequency subchannels may be formed asî⁽ⁿ⁾=[î₀^(n)Tî₁^(n)T. . .î_N_F₋₁^(n)T]^T. The components in the vectorsî⁽ⁿ⁾are interference to the remaining (not yet detected) modulation symbols from the other transmit antennas which are also included in the input vectorsr⁽ⁿ⁾. The interference vectorsî⁽ⁿ⁾are then subtracted from the input vectorsr⁽ⁿ⁾by asummer614 to provide modified vectorsr⁽ⁿ⁺¹⁾having the interference components from the decoded bits removed. This cancellation can be expressed as shown above in equation (27). The modified vectorsr⁽ⁿ⁺¹⁾are provided as the input vectors to the next processing stage, as shown inFIG. 4C.

The successive cancellation receiver processing technique is described in further detail in the aforementioned U.S patent application Ser. Nos. 09/854,235 and 09/993,087, and by P. W. Wolniansky et al. in a paper entitled “V-BLAST: An Architecture for Achieving Very High Data Rates over the Rich-Scattering Wireless Channel”, Proc. ISSSE-98, Pisa, Italy, which is incorporated herein by reference.

Deriving and Reporting Channel State Information

InFIG. 1, a channel estimator withindemodulator156 may process the received OFDM symbols and derive estimates of one or more characteristics of the communication channel, such as the channel frequency response, the channel noise variance, the SNR of the received symbols, and so on. Detector/decoder158 may also derive and provide the status of each received packet and may further provide one or more other performance metrics indicative of the decoded results. These various types of information may be provided tocontroller170.

Controller

170 may determine or select a particular “rate” to be used for all transmit antennas, for each transmit antenna, for each subset of transmit antennas, for each transmission channel, or for each group of transmission channels based on the various types of information received fromdemodulator156 and detector/decoder158. The rate is indicative of a set of specific values for a set of transmission parameters. For example, the rate may indicate (or may be associated with) a specific data rate to be used for the data transmission, a specific coding scheme and/or code rate, a specific modulation scheme, and so on. Channel state information (CSI) in the form of the selected rate, the channel response estimates, and/or other information may be provided bycontroller170, processed by anencoder180, modulated by amodulator182, and conditioned and transmitted by one or more transmitters154 back totransmitter system110. Various forms of CSI are described in the aforementioned U.S. patent application Ser. No. 09/993,087.

Attransmitter system110, the one or more modulated signals fromreceiver system150 are received by antennas124, conditioned by receivers122, demodulated by ademodulator140, and decoded by adecoder142 to recover the channel state information transmitted by the receiver system. The channel state information is then provided tocontroller130 and used to control the processing of the data transmission to the receiver system. For example, the data rate of the data transmission may be determined based on the selected rate provided by the receiver system, or may be determined based on the channel response estimates provided by the receiver system. The specific coding and modulation schemes associated with the selected rate are determined and reflected in the coding and modulation control provided bycontroller130 toTX data processor114 andmodulator116.

The iterative detection and decoding techniques have been described specifically for serial and parallel concatenated convolutional codes. These techniques may also be used with other codes, such as convolutional codes, block codes, concatenated codes of different types (e.g., a convolutional code with a block code), and so on. Furthermore, the iterative detection and decoding techniques have been described specifically for a MIMO-OFDM system. These techniques may also be used for a MIMO system that does not implement OFDM, an OFDM system that does not utilize MIMO, or some other wireless communication systems (e.g., a wireless LAN system).

The iterative detection and decoding techniques may be implemented in various units in a wireless communication system, such as in a terminal, a base station, an access point, and so on.

The iterative detection and decoding techniques described herein may be implemented by various means. For example, these techniques may be implemented in hardware, software, or a combination thereof. For a hardware implementation, the elements used to perform the iterative detection and decoding (e.g., detector420 and decoder(s)430) may be implemented within one or more application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable gate arrays (FPGAs), processors, controllers, micro-controllers, microprocessors, other electronic units designed to perform the functions described herein, or a combination thereof.

For a software implementation, the iterative detection and decoding may be performed with modules (e.g., procedures, functions, and so on) that perform the computations and functions described herein. The software codes may be stored in a memory unit (e.g.,memory172 inFIG. 1) and executed by a processor (e.g., controller170). The memory unit may be implemented within the processor or external to the processor, in which case it can be communicatively coupled to the processor via various means as is known in the art.

The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present invention. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the invention. Thus, the present invention is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.