Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Shannon–Hartley theorem

From Wikipedia, the free encyclopedia
(Redirected fromShannon–Hartley law)
Theorem that tells the maximum rate at which information can be transmitted
Not to be confused withNyquist–Shannon sampling theorem.
Information theory

Ininformation theory, theShannon–Hartley theorem tells the maximum rate at which information can be transmitted over a communications channel of a specifiedbandwidth in the presence ofnoise. It is an application of thenoisy-channel coding theorem to the archetypal case of acontinuous-timeanalogcommunications channel subject toGaussian noise. The theorem establishes Shannon'schannel capacity for such a communication link, a bound on the maximum amount of error-freeinformation per time unit that can be transmitted with a specifiedbandwidth in the presence of the noise interference, assuming that the signal power is bounded, and that the Gaussian noise process is characterized by a known power or power spectral density. The law is named afterClaude Shannon andRalph Hartley.

Statement of the theorem

[edit]

The Shannon–Hartley theorem states thechannel capacityC{\displaystyle C}, meaning the theoreticaltightest upper bound on theinformation rate of data that can be communicated at an arbitrarily lowerror rate using an average received signal powerS{\displaystyle S} through an analog communication channel subject toadditive white Gaussian noise (AWGN) of powerN{\displaystyle N}:

C=Blog2(1+SN){\displaystyle C=B\log _{2}\left(1+{\frac {S}{N}}\right)}

where

Historical development

[edit]

During the late 1920s,Harry Nyquist andRalph Hartley developed a handful of fundamental ideas related to the transmission of information, particularly in the context of thetelegraph as a communications system. At the time, these concepts were powerful breakthroughs individually, but they were not part of a comprehensive theory. In the 1940s,Claude Shannon developed the concept of channel capacity, based in part on the ideas of Nyquist and Hartley, and then formulated a complete theory of information and its transmission.

Nyquist rate

[edit]
Main article:Nyquist ISI criterion

In 1927, Nyquist determined that the number of independent pulses that could be put through a telegraph channel per unit time is limited to twice the one-sidedbandwidth of the channel. In symbolic notation,

fp2B{\displaystyle f_{p}\leq 2B}

wherefp{\displaystyle f_{p}} is the pulse frequency (in pulses per second) andB{\displaystyle B} is the one-sided bandwidth[clarification needed] (in hertz). The quantity2B{\displaystyle 2B} later came to be called theNyquist rate, and transmitting at the limiting pulse rate of2B{\displaystyle 2B} pulses per second assignalling at the Nyquist rate. Nyquist published his results in 1928 as part of his paper "Certain topics in Telegraph Transmission Theory".[1]

Hartley's law

[edit]

During 1928, Hartley formulated a way to quantify information and itsline rate (also known asdata signalling rateR bits per second).[2] This method, later known as Hartley's law, became an important precursor for Shannon's more sophisticated notion of channel capacity.

Hartley argued that the maximum number of distinguishable pulse levels that can be transmitted and received reliably over a communications channel is limited by the dynamic range of the signal amplitude and the precision with which the receiver can distinguish amplitude levels. Specifically, if the amplitude of the transmitted signal is restricted to the range of [−A ... +A] volts, and the precision of the receiver is ±ΔV volts, then the maximum number of distinct pulsesM is given by

M=1+AΔV{\displaystyle M=1+{A \over \Delta V}}.

By taking information per pulse in bit/pulse to be the base-2-logarithm of the number of distinct messagesM that could be sent, Hartley[3] constructed a measure of the line rateR as:

R=fplog2(M),{\displaystyle R=f_{p}\log _{2}(M),}

wherefp{\displaystyle f_{p}} is the pulse rate, also known as the symbol rate, in symbols/second orbaud.

Hartley then combined the above quantification with Nyquist's observation that the number of independent pulses that could be put through a channel of one-sided bandwidthB{\displaystyle B}hertz was2B{\displaystyle 2B} pulses per second, to arrive at his quantitative measure for achievable line rate.

Hartley's law is sometimes quoted as just a proportionality between theanalog bandwidth,B{\displaystyle B}, in Hertz and what today is called thedigital bandwidth,R{\displaystyle R}, in bit/s.[4]Other times it is quoted in this more quantitative form, as an achievable line rate ofR{\displaystyle R} bits per second:[5]

R2Blog2(M).{\displaystyle R\leq 2B\log _{2}(M).}

Hartley did not work out exactly how the numberM should depend on the noise statistics of the channel, or how the communication could be made reliable even when individual symbol pulses could not be reliably distinguished toM levels; with Gaussian noise statistics, system designers had to choose a very conservative value ofM{\displaystyle M} to achieve a low error rate.

The concept of an error-free capacity awaited Claude Shannon, who built on Hartley's observations about a logarithmic measure of information and Nyquist's observations about the effect of bandwidth limitations.

Hartley's rate result can be viewed as the capacity of an errorlessM-ary channel of2B{\displaystyle 2B} symbols per second. Some authors refer to it as a capacity. But such an errorless channel is an idealization, and if M is chosen small enough to make the noisy channel nearly errorless, the result is necessarily less than the Shannon capacity of the noisy channel of bandwidthB{\displaystyle B}, which is the Hartley–Shannon result that followed later.

Noisy channel coding theorem and capacity

[edit]
Main article:Noisy-channel coding theorem

Claude Shannon's development ofinformation theory during World War II provided the next big step in understanding how much information could be reliably communicated through noisy channels. Building on Hartley's foundation, Shannon'snoisy channel coding theorem (1948) describes the maximum possible efficiency oferror-correcting methods versus levels of noise interference and data corruption.[6][7] The proof of the theorem shows that a randomly constructed error-correcting code is essentially as good as the best possible code; the theorem is proved through the statistics of such random codes.

Shannon's theorem shows how to compute achannel capacity from a statistical description of a channel, and establishes that given a noisy channel with capacityC{\displaystyle C} and information transmitted at a line rateR{\displaystyle R}, then if

R<C{\displaystyle R<C}

there exists a coding technique which allows the probability of error at the receiver to be made arbitrarily small. This means that theoretically, it is possible to transmit information nearly without error up to nearly a limit ofC{\displaystyle C} bits per second.

The converse is also important. If

R>C{\displaystyle R>C}

the probability of error at the receiver increases without bound as the rate is increased, so no useful information can be transmitted beyond the channel capacity. The theorem does not address the rare situation in which rate and capacity are equal.

The Shannon–Hartley theorem establishes what that channel capacity is for a finite-bandwidthcontinuous-time channel subject to Gaussian noise. It connects Hartley's result with Shannon's channel capacity theorem in a form that is equivalent to specifying theM in Hartley's line rate formula in terms of a signal-to-noise ratio, but achieving reliability through error-correction coding rather than through reliably distinguishable pulse levels.

If there were such a thing as a noise-free analog channel, one could transmit unlimited amounts of error-free data over it per unit of time (Note that an infinite-bandwidth analog channel could not transmit unlimited amounts of error-free data absent infinite signal power). Real channels, however, are subject to limitations imposed by both finite bandwidth and nonzero noise.

Bandwidth and noise affect the rate at which information can be transmitted over an analog channel. Bandwidth limitations alone do not impose a cap on the maximum information rate because it is still possible for the signal to take on an indefinitely large number of different voltage levels on each symbol pulse, with each slightly different level being assigned a different meaning or bit sequence. Taking into account both noise and bandwidth limitations, however, there is a limit to the amount of information that can be transferred by a signal of a bounded power, even when sophisticated multi-level encoding techniques are used.

In the channel considered by the Shannon–Hartley theorem, noise and signal are combined by addition. That is, the receiver measures a signal that is equal to the sum of the signal encoding the desired information and a continuous random variable that represents the noise. This addition creates uncertainty as to the original signal's value. If the receiver has some information about the random process that generates the noise, one can in principle recover the information in the original signal by considering all possible states of the noise process. In the case of the Shannon–Hartley theorem, the noise is assumed to be generated by a Gaussian process with a known variance. Since the variance of a Gaussian process is equivalent to its power, it is conventional to call this variance the noise power.

Such a channel is called the Additive White Gaussian Noise channel, because Gaussian noise is added to the signal; "white" means equal amounts of noise at all frequencies within the channel bandwidth. Such noise can arise both from random sources of energy and also from coding and measurement error at the sender and receiver respectively. Since sums of independent Gaussian random variables are themselves Gaussian random variables, this conveniently simplifies analysis, if one assumes that such error sources are also Gaussian and independent.

Implications of the theorem

[edit]

Comparison of Shannon's capacity to Hartley's law

[edit]

Comparing the channel capacity to the information rate from Hartley's law, we can find the effective number of distinguishable levelsM:[8]

2Blog2(M)=Blog2(1+SN){\displaystyle 2B\log _{2}(M)=B\log _{2}\left(1+{\frac {S}{N}}\right)}
M=1+SN.{\displaystyle M={\sqrt {1+{\frac {S}{N}}}}.}

The square root effectively converts the power ratio back to a voltage ratio, so the number of levels is approximately proportional to the ratio of signalRMS amplitude to noise standard deviation.

This similarity in form between Shannon's capacity and Hartley's law should not be interpreted to mean thatM{\displaystyle M} pulse levels can be literally sent without any confusion. More levels are needed to allow for redundant coding and error correction, but the net data rate that can be approached with coding is equivalent to using thatM{\displaystyle M} in Hartley's law.

Frequency-dependent (colored noise) case

[edit]

In the simple version above, the signal and noise are fully uncorrelated, in which caseS+N{\displaystyle S+N} is the total power of the received signal and noise together. A generalization of the above equation for the case where the additive noise is not white (or that theS/N{\displaystyle S/N} is not constant with frequency over the bandwidth) is obtained by treating the channel as many narrow, independent Gaussian channels in parallel:

C=0Blog2(1+S(f)N(f))df{\displaystyle C=\int _{0}^{B}\log _{2}\left(1+{\frac {S(f)}{N(f)}}\right)df}

where

Note: the theorem only applies to Gaussianstationary process noise. This formula's way of introducing frequency-dependent noise cannot describe all continuous-time noise processes. For example, consider a noise process consisting of adding a random wave whose amplitude is 1 or −1 at any point in time, and a channel that adds such a wave to the source signal. Such a wave's frequency components are highly dependent. Though such a noise may have a high power, it is fairly easy to transmit a continuous signal with much less power than one would need if the underlying noise was a sum of independent noises in each frequency band.

Approximations

[edit]
AWGN channel capacity with the power-limited regime and bandwidth-limited regime indicated. Here,SN0=1{\displaystyle {\frac {S}{N_{0}}}=1};B andC can be scaled proportionally for other values.

For large or small and constant signal-to-noise ratios, the capacity formula can be approximated:

Bandwidth-limited case

[edit]

When the SNR is large (S/N ≫ 1), the logarithm is approximated by

log2(1+SN)log2SN=ln10ln2log10SN3.32log10SN,{\displaystyle \log _{2}\left(1+{\frac {S}{N}}\right)\approx \log _{2}{\frac {S}{N}}={\frac {\ln 10}{\ln 2}}\cdot \log _{10}{\frac {S}{N}}\approx 3.32\cdot \log _{10}{\frac {S}{N}},}

in which case the capacity is logarithmic in power and approximately linear in bandwidth (not quite linear, sinceN increases with bandwidth, imparting a logarithmic effect). This is called thebandwidth-limited regime.

C0.332BSNR (in dB){\displaystyle C\approx 0.332\cdot B\cdot \mathrm {SNR\ (in\ dB)} }

where

SNR (in dB)=10log10SN.{\displaystyle \mathrm {SNR\ (in\ dB)} =10\log _{10}{S \over N}.}

Power-limited case

[edit]

Similarly, when the SNR is small (ifS/N1{\displaystyle S/N\ll 1}), applying the approximation to the logarithm:

log2(1+SN)=1ln2ln(1+SN)1ln2SN1.44SN;{\displaystyle \log _{2}\left(1+{\frac {S}{N}}\right)={\frac {1}{\ln 2}}\cdot \ln \left(1+{\frac {S}{N}}\right)\approx {\frac {1}{\ln 2}}\cdot {\frac {S}{N}}\approx 1.44\cdot {S \over N};}

then the capacity is linear in power. This is called thepower-limited regime.

C1.44BSN.{\displaystyle C\approx 1.44\cdot B\cdot {S \over N}.}

In this low-SNR approximation, capacity is independent of bandwidth if the noise is white, ofspectral densityN0{\displaystyle N_{0}} watts per hertz, in which case the total noise power isN=BN0{\displaystyle N=B\cdot N_{0}}.

C1.44SN0{\displaystyle C\approx 1.44\cdot {S \over N_{0}}}

Examples

[edit]
  1. At a SNR of 0 dB (Signal power = Noise power) the Capacity in bits/s is equal to the bandwidth in hertz.
  2. If the SNR is 20 dB, and the bandwidth available is 4 kHz, which is appropriate for telephone communications, then C = 4000 log2(1 + 100) = 4000 log2 (101) = 26.63 kbit/s. Note that the value of S/N = 100 is equivalent to the SNR of 20 dB.
  3. If the requirement is to transmit at 50 kbit/s, and a bandwidth of 10 kHz is used, then the minimum S/N required is given by 50000 = 10000 log2(1+S/N) so C/B = 5 then S/N = 25 − 1 = 31, corresponding to an SNR of 14.91 dB (10 x log10(31)).
  4. What is the channel capacity for a signal having a 1 MHz bandwidth, received with a SNR of −30 dB ? That means a signal deeply buried in noise. −30 dB means a S/N = 10−3. It leads to a maximal rate of information of 106 log2 (1 + 10−3) = 1443 bit/s. These values are typical of the received ranging signals of the GPS, where the navigation message is sent at 50 bit/s (below the channel capacity for the given S/N), and whose bandwidth is spread to around 1 MHz by a pseudo-noise multiplication before transmission.
  5. As stated above, channel capacity is proportional to the bandwidth of the channel and to the logarithm of SNR. This means channel capacity can be increased linearly either by increasing the channel's bandwidth given a fixed SNR requirement or, with fixed bandwidth, by usinghigher-order modulations that need a very high SNR to operate. As the modulation rate increases, thespectral efficiency improves, but at the cost of the SNR requirement. Thus, there is an exponential rise in the SNR requirement if one adopts a16QAM or 64QAM; however, the spectral efficiency improves.

See also

[edit]

Notes

[edit]
  1. ^Nyquist, Harry (April 1928)."Certain topics in telegraph transmission theory"(PDF).Trans. AIEE.47 (2):617–44.Bibcode:1928TAIEE..47..617N.doi:10.1109/T-AIEE.1928.5055024. Also 2002 Reprintdoi:10.1109/5.989873
  2. ^Hartley, R. V. L. (July 1928)."Transmission of Information"(PDF).Bell System Technical Journal.7 (3):535–563.doi:10.1002/j.1538-7305.1928.tb01236.x.
  3. ^Bell, D. A. (1962).Information Theory; and its Engineering Applications (3rd ed.). New York: Pitman.ISBN 9780273417576.{{cite book}}:ISBN / Date incompatibility (help)
  4. ^Gokhale, Anu A. (2004).Introduction to Telecommunications (2nd ed.). Thomson Delmar Learning.ISBN 1-4018-5648-9.
  5. ^Dunlop, John; Smith, D. Geoffrey (1998).Telecommunications Engineering. CRC Press.ISBN 0-7487-4044-9.
  6. ^Shannon, C. E. (1998) [1949].The Mathematical Theory of Communication(PDF). Urbana, IL: University of Illinois Press.
  7. ^Shannon, C. E. (January 1949)."Communication in the presence of noise"(PDF).Proceedings of the Institute of Radio Engineers.37 (1):10–21.doi:10.1109/JRPROC.1949.232969.S2CID 52873253. Archived fromthe original(PDF) on 8 February 2010.
  8. ^Pierce, John Robinson (1980).An Introduction to Information Theory: symbols, signals & noise. Courier.ISBN 0-486-24061-4.

References

[edit]

External links

[edit]
Retrieved from "https://en.wikipedia.org/w/index.php?title=Shannon–Hartley_theorem&oldid=1321322942"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp