Movatterモバイル変換

[0]ホーム

Jump to content

Entropy (information theory)

Edit links

From Wikipedia, the free encyclopedia

Expected amount of information needed to specify the output of a stochastic data source

For other uses, seeEntropy (disambiguation).

This articleneeds additional citations forverification. Please helpimprove this article byadding citations to reliable sources. Unsourced material may be challenged and removed.
Find sources: "Entropy" information theory – news ·newspapers ·books ·scholar ·JSTOR(February 2019) (Learn how and when to remove this message)

Information theory

Entropy Differential entropy Conditional entropy Joint entropy Mutual information Directed information Conditional mutual information Relative entropy Entropy rate Limiting density of discrete points
Asymptotic equipartition property Rate–distortion theory
Shannon's source coding theorem Channel capacity Noisy-channel coding theorem Shannon–Hartley theorem
v t e

Ininformation theory, theentropy of arandom variable quantifies the average level of uncertainty or information associated with the variable's potential states or possible outcomes. This measures the expected amount of information needed to describe the state of the variable, considering the distribution of probabilities across all potential states. Given a discrete random variable $X {\displaystyle X}$ , which may be any member $x {\displaystyle x}$ within the set ${\mathcal {X}}$ and is distributed according to $p\colon {\mathcal {X}}\to [0,1]$ , the entropy is $\mathrm {H} (X):=-\sum _{x\in {\mathcal {X}}}p(x)\log p(x),$ where $\Sigma$ denotes the sum over the variable's possible values.^{[Note 1]} The choice of base for $\log$ , thelogarithm, varies for different applications. Base 2 gives the unit ofbits (or "shannons"), while basee gives "natural units"nat, and base 10 gives units of "dits", "bans", or "hartleys". An equivalent definition of entropy is theexpected value of theself-information of a variable.^[1]

Two bits of entropy: In the case of two fair coin tosses, the information entropy in bits is the base-2 logarithm of the number of possible outcomes‍— with two coins there are four possible outcomes, and two bits of entropy. Generally, information entropy is the average amount of information conveyed by an event, when considering all possible outcomes.

The concept of information entropy was introduced byClaude Shannon in his 1948 paper "A Mathematical Theory of Communication",^[2]^[3] and is also referred to asShannon entropy. Shannon's theory defines adata communication system composed of three elements: a source of data, acommunication channel, and a receiver. The "fundamental problem of communication" – as expressed by Shannon – is for the receiver to be able to identify what data was generated by the source, based on the signal it receives through the channel.^[2]^[3] Shannon considered various ways to encode, compress, and transmit messages from a data source, and proved in hissource coding theorem that the entropy represents an absolute mathematical limit on how well data from the source can belosslessly compressed onto a perfectly noiseless channel. Shannon strengthened this result considerably for noisy channels in hisnoisy-channel coding theorem.

Entropy in information theory is directly analogous to theentropy instatistical thermodynamics. The analogy results when the values of the random variable designate energies of microstates, so Gibbs's formula for the entropy is formally identical to Shannon's formula. Entropy has relevance to other areas of mathematics such ascombinatorics andmachine learning. The definition can be derived from a set ofaxioms establishing that entropy should be a measure of how informative the average outcome of a variable is. For a continuous random variable,differential entropy is analogous to entropy. The definition $\mathbb {E} [-\log p(X)]$ generalizes the above.

Authority control databases
International	GND FAST
National	United States France BnF data Japan Czech Republic Spain Israel
Other	Yale LUX

All figures in entropically compressedexabytes
Type of Information	1986	2007
Storage	2.6	295
Broadcast	432	1900
Telecommunications	0.281	65

Movatterモバイル変換

Introduction

Example

Definition

Measure theory

Example

Characterization

Alternative characterization

Discussion

Alternative characterization via additivity and subadditivity

Discussion

Further properties

Aspects

Relationship to thermodynamic entropy

Data compression

Entropy as a measure of diversity

Entropy of a sequence

Limitations of entropy in cryptography

Data as a Markov process

Efficiency (normalized entropy)

Entropy for continuous random variables

Differential entropy

Limiting density of discrete points

Relative entropy

Use in number theory

Use in combinatorics

Loomis–Whitney inequality

Approximation to binomial coefficient

Use in machine learning

See also

Notes

References

Further reading

Textbooks on information theory

External links