Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Line spectral pairs

From Wikipedia, the free encyclopedia
Linear prediction coefficients

Line spectral pairs (LSP) orline spectral frequencies (LSF) are used to representlinear prediction coefficients (LPC) for transmission over a channel.[1] LSPs have several properties (e.g. smaller sensitivity to quantization noise) that make them superior to direct quantization of LPCs. For this reason, LSPs are very useful inspeech coding.

LSP representation was developed byFumitada Itakura,[2] atNippon Telegraph and Telephone (NTT) in 1975.[3] From 1975 to 1981, he studied problems in speech analysis and synthesis based on the LSP method.[4] In 1980, his team developed an LSP-basedspeech synthesizer chip. LSP is an important technology for speech synthesis and coding, and in the 1990s was adopted by almost all international speech coding standards as an essential component, contributing to the enhancement of digital speech communication over mobile channels and the internet worldwide.[3] LSPs are used in thecode-excited linear prediction (CELP) algorithm, developed byBishnu S. Atal andManfred R. Schroeder in 1985.

Mathematical foundation

[edit]

The LPpolynomialA(z)=1k=1pakzk{\displaystyle A(z)=1-\sum _{k=1}^{p}a_{k}z^{-k}} can be expressed asA(z)=0.5[P(z)+Q(z)]{\displaystyle A(z)=0.5[P(z)+Q(z)]}, where:

By construction,P is apalindromic polynomial andQ anantipalindromic polynomial; physicallyP(z) corresponds to the vocal tract with theglottis closed andQ(z) with theglottis open.[5] It can be shown that:

  • Theroots ofP andQ lie on theunit circle in the complex plane.
  • The roots ofP alternate with those ofQ as we travel around the circle.
  • As the coefficients ofP andQ are real, the roots occur inconjugate pairs

The Line Spectral Pair representation of the LP polynomial consists simply of the location of the roots ofP andQ (i.e.ω{\displaystyle \omega } such thatz=eiω,P(z)=0{\displaystyle z=e^{i\omega },P(z)=0}). As they occur in pairs, only half of the actual roots (conventionally between 0 andπ{\displaystyle \pi }) need be transmitted. The total number of coefficients for bothP andQ is therefore equal top, the number of original LP coefficients (not countinga0=1{\displaystyle a_{0}=1}).

A common algorithm for finding these[6] is to evaluate the polynomial at a sequence of closely spaced points around the unit circle, observing when the result changes sign; when it does a root must lie between the points tested. Because the roots ofP are interspersed with those ofQ a single pass is sufficient to find the roots of both polynomials.

To convert back to LPCs, we need to evaluateA(z)=0.5[P(z)+Q(z)]{\displaystyle A(z)=0.5[P(z)+Q(z)]}by "clocking" an impulse through itN times (order of the filter), yielding the original filter, A(z).

Properties

[edit]

Line spectral pairs have several interesting and useful properties. When the roots ofP(z) andQ(z) are interleaved, stability of the filter is ensured if and only if the roots are monotonically increasing. Moreover, the closer two roots are, the more resonant the filter is at the corresponding frequency. Because LSPs are not overly sensitive to quantization noise and stability is easily ensured, LSP are widely used for quantizing LPC filters. Line spectral frequencies can be interpolated.

See also

[edit]

Sources

[edit]

Includes an overview in relation to LPC.

References

[edit]
  1. ^Sahidullah, Md.; Chakroborty, Sandipan; Saha, Goutam (Jan 2010)."On the use of perceptual Line Spectral pairs Frequencies and higher-order residual moments for Speaker Identification".International Journal of Biometrics.2 (4):358–378.doi:10.1504/ijbm.2010.035450.
  2. ^Zheng, F.; Song, Z.; Li, L.; Yu, W. (1998)."The Distance Measure for Line Spectrum Pairs Applied to Speech Recognition"(PDF).Proceedings of the 5th International Conference on Spoken Language Processing (ICSLP'98) (3):1123–6.
  3. ^ab"List of IEEE Milestones".IEEE. Retrieved15 July 2019.
  4. ^"Fumitada Itakura Oral History". IEEE Global History Network. 20 May 2009. Retrieved2009-07-21.
  5. ^http://svr-www.eng.cam.ac.uk/~ajr/SpeechAnalysis/node51.html#SECTION000713000000000000000 Tony Robinson: Speech Analysis
  6. ^e.g. lsf.c inhttp://www.ietf.org/rfc/rfc3951.txt
Lossless
type
Entropy
Dictionary
Other
Hybrid
Lossy
type
Transform
Predictive
Audio
Concepts
Codec
parts
Image
Concepts
Methods
Video
Concepts
Codec
parts
Theory
Community
People
Retrieved from "https://en.wikipedia.org/w/index.php?title=Line_spectral_pairs&oldid=1292151517"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp