JP2000268510A

Movatterモバイル変換

Info

Publication number: JP2000268510A
Application number: JP11075171A
Authority: JP
Inventors: Takehiko Kuran; 武彦九蘭
Original assignee: Victor Company of Japan Ltd
Current assignee: Victor Company of Japan Ltd
Priority date: 1999-03-19
Filing date: 1999-03-19
Publication date: 2000-09-29

Abstract

PROBLEM TO BE SOLVED: To obtain an encoder capable of drastically enhancing the encoding efficiency even when a GHA(general harmonic analysis) is used. SOLUTION: A frame length setting circuit 14 outputs signals for successively specifying plural kinds of frame lengths which are preliminarily prepared to a framing circuit 11. A judgement circuit 13 calculates the number of lines of frequency extraction components in respective frame lengths in the section of the number of the same samples which are taken out one by one from a frequency component extracting circuit 12. A decision circuit 15 receives frequency extraction components of respective frame lengths and the number of lines of frequency extraction components in respective frame lengths and decides the frequency extraction component of a frame length from which the minimum number of lines of the frequency extraction component can be obtained as an optimum frame length by comparing the number of lines of frequency extraction components corresponding to respective frame lengths and outputs the frequency extraction component of the optimum frame length to an information volume reducing circuit 16 to perform a compression encoding for reducing information volume.

Description

Translated fromJapanese

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は符号化装置に係り、
特にオーディオ信号を符号化する符号化装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an encoding device,
In particular, the present invention relates to an encoding device that encodes an audio signal.

【０００２】[0002]

【従来の技術】従来のオーディオ信号における符号化方
式は、時間周波数変換方式を基本としている。これは、
時間領域の入力信号を周波数領域に変換してから符号化
を行う方式で、周波数領域での信号の偏りや人間の聴覚
心理を利用した、効率的な符号化方式である。しかし、
この時間周波数変換方式による符号化方式で用いる高速
フーリエ変換（ＦＦＴ）や変形離散コサイン変換（ＭＤ
ＣＴ）等の周波数分析方法は、周期的で調和的な信号を
分析する理論であり、信号を観測した観測区間の外では
観測区間内の波形が周期的に繰り返すことを仮定してい
る。2. Description of the Related Art A conventional encoding method for audio signals is based on a time-frequency conversion method. this is,
This is a method in which an input signal in the time domain is converted into a frequency domain and then coding is performed. This is an efficient coding method using the bias of the signal in the frequency domain and psychoacoustics of human beings. But,
The fast Fourier transform (FFT) and the modified discrete cosine transform (MD) used in the encoding method by the time-frequency transform method
A frequency analysis method such as CT) is a theory that analyzes a periodic and harmonic signal, and assumes that a waveform in an observation section periodically repeats outside an observation section where the signal is observed.

【０００３】しかしながら、上記の周波数分析方法で
は、実際には観測区間によって多数の異なった周波数成
分が抽出されることから、周波数領域信号には既に誤差
が含まれ、符号化効率が低下し、また、周波数分解能が
観測区間の長さに反比例することから、過渡的な信号の
分析を行う場合でも、観測区間長を余り短くすることが
できず、周波数分解能が不足し符号化効率が低下し、更
に、観測区間外の波形の予測が困難であることから、予
測残差の符号化による符号化効率の向上が期待できず、
また更に、定常的でない信号の周波数分析が困難である
等の欠点がある。However, in the above-described frequency analysis method, since many different frequency components are actually extracted depending on the observation section, the frequency domain signal already contains an error, and the coding efficiency is reduced. However, since the frequency resolution is inversely proportional to the length of the observation section, even when analyzing a transient signal, the observation section length cannot be shortened too much, the frequency resolution becomes insufficient and the coding efficiency decreases, Furthermore, since it is difficult to predict the waveform outside the observation interval, it is not possible to expect an improvement in coding efficiency by coding the prediction residual,
Further, there is a drawback that it is difficult to analyze the frequency of an unsteady signal.

【０００４】一方、調和的でない信号に拡張したフーリ
エ解析の理論も知られている。この理論は、一般調和解
析（Generalized Harmonic Analysis：ＧＨＡ）と呼ば
れ、観測区間内で原波形から残差エネルギーが最小とな
る最も優勢な正弦波を抽出し、残差成分に同様の処理を
繰り返すという解析方法であり、定常的でない僅かな周
波数変動に対しても正確な周波数成分の抽出が可能であ
り、観測区間長と周波数の分解能は互いに独立して自由
な設定が可能であり、観測区間を越えて信号の予測が可
能であるなどの特徴がある。On the other hand, the theory of Fourier analysis extended to non-harmonic signals is also known. This theory is called Generalized Harmonic Analysis (GHA), which extracts the most dominant sine wave with the minimum residual energy from the original waveform in the observation section and repeats the same process for the residual component It is an analysis method that can accurately extract frequency components even for small frequency fluctuations that are not stationary, and the observation section length and frequency resolution can be set freely independently of each other. And the ability to predict signals beyond the threshold.

【０００５】[0005]

【発明が解決しようとする課題】そこで、上記のＧＨＡ
はＦＦＴやＭＤＣＴなどの調和解析よりも精度の高い周
波数分析が可能であるため、従来の符号化装置において
周波数分析手段として用いられているＦＦＴなどに代え
てＧＨＡを用いることが考えられる。この場合、周波数
分析をした成分のすべてを伝送するならば不連続波形は
生じないが、低ビットレート化のために大量の成分を捨
てるような場合は不連続波形が生じる可能性が高い。す
なわち、音楽信号のような非定常信号を分析した場合、
隣接したフレームＡ、Ｂを構成する周波数成分は大きく
異なり、従って例えば１００本の周波数成分のうち、最
も優勢な周波数成分を１０本だけ残したような場合に、
フレームＡ、Ｂで全く同じ周波数成分が残っていれば不
連続波形は生じないが、一般的にはそうとはいえず、本
数が少ない分だけ、フレームＡ、Ｂの残っている１０本
の周波数成分の差異が小さい場合でも、不連続波形が生
じる可能性が高い。Therefore, the above-mentioned GHA
Can perform frequency analysis with higher accuracy than harmonic analysis such as FFT and MDCT. Therefore, it is conceivable to use GHA instead of FFT or the like used as frequency analysis means in a conventional encoding device. In this case, a discontinuous waveform does not occur if all of the frequency-analyzed components are transmitted, but a discontinuous waveform is highly likely to occur when a large amount of components are discarded to reduce the bit rate. That is, when analyzing a non-stationary signal such as a music signal,
The frequency components constituting the adjacent frames A and B are greatly different. Therefore, for example, when only the 10 most dominant frequency components are left out of 100 frequency components,
If the same frequency components remain in frames A and B, no discontinuous waveform will be generated. However, in general, this is not the case, and the number of remaining 10 frequencies in frames A and B is reduced by the smaller number. Even when the difference between the components is small, there is a high possibility that a discontinuous waveform will occur.

【０００６】このように、符号化方式における周波数分
析手段としてＧＨＡを採用した場合でも、特に符号化し
たビットストリームのビットレートが低い場合には、観
測区間の継ぎ目における不連続波形が復号時に生じるこ
とがあるため、符号化部において、上記の不連続波形が
復号時に生じないようにするには観測区間を符号化時に
重複させる必要があり、符号化効率の大幅な向上は期待
できないという問題がある。As described above, even when GHA is employed as the frequency analysis means in the encoding method, a discontinuous waveform at the joint of the observation section occurs at the time of decoding, particularly when the bit rate of the encoded bit stream is low. Therefore, in the encoding unit, in order to prevent the discontinuous waveform from occurring at the time of decoding, it is necessary to overlap observation sections at the time of encoding, and there is a problem that a significant improvement in encoding efficiency cannot be expected. .

【０００７】本発明は以上の点に鑑みなされたもので、
ＧＨＡを使用しても符号化効率を大幅に向上し得る符号
化装置を提供することを目的とする。[0007] The present invention has been made in view of the above points,
It is an object of the present invention to provide an encoding device that can greatly improve encoding efficiency even when GHA is used.

【０００８】[0008]

【課題を解決するための手段】本発明は上記の目的を達
成するため、外部から設定された複数のサンプルからな
るフレーム長の単位で、入力ディジタルオーディオ信号
の周波数成分を一般調和解析により周波数分析する周波
数分析手段と、周波数分析手段において設定可能なフレ
ーム長を予め複数種類用意し、それら複数種類のフレー
ム長を周波数分析手段に対して順次設定して、この周波
数分析手段から得られた各フレーム長の周波数抽出成分
の本数を算出するフレーム長設定手段と、フレーム長設
定手段を通して得られた各フレーム長の周波数抽出成分
の本数のうち、同じサンプル区間で最小の周波数抽出成
分の本数が得られるフレーム長を最適フレーム長として
決定して、その最適フレーム長における周波数抽出成分
を出力する決定手段と、決定手段から取り出された複数
の周波数抽出成分の情報量を削減した符号化を行い、生
成された符号化データに少なくともフレーム長を示すヘ
ッダ情報を付加してビットストリームとして伝送路へ送
出する情報量削減手段とを有する構成としたものであ
る。SUMMARY OF THE INVENTION In order to achieve the above object, the present invention provides a frequency analysis of a frequency component of an input digital audio signal by a general harmonic analysis in units of a frame length composed of a plurality of externally set samples. A plurality of types of frame lengths that can be set in the frequency analysis unit and the frequency analysis unit to be set are prepared in advance, and the plurality of types of frame lengths are sequentially set for the frequency analysis unit. Frame length setting means for calculating the number of long frequency extraction components, and among the number of frequency extraction components of each frame length obtained through the frame length setting means, the minimum number of frequency extraction components can be obtained in the same sample section. Decision factor for determining the frame length as the optimum frame length and outputting the frequency extraction component at the optimum frame length Performs encoding with a reduced amount of information of a plurality of frequency extraction components extracted from the determining means, adds header information indicating at least a frame length to the generated encoded data, and sends out the bit stream as a bit stream. This configuration has an information amount reducing unit.

【０００９】この発明では、周波数分析手段による分析
終了の条件は、残差が十分に小さいことであり、周波数
抽出成分の本数ではなく、フレーム長と周波数抽出成分
の本数との間には相関関係はないということに着目し、
複数種類のフレーム長を周波数分析手段に対して順次設
定して、この周波数分析手段から得られた各フレーム長
の周波数抽出成分の本数のうち、同じサンプル区間で最
小の周波数抽出成分の本数が得られるフレーム長を最適
フレーム長として決定して、その最適フレーム長におけ
る周波数抽出成分に対して情報量を削減した符号化を行
うようにしたため、最小の周波数抽出成分に対して情報
量を削減した符号化を行うことができる。In the present invention, the condition for terminating the analysis by the frequency analysis means is that the residual is sufficiently small, and the correlation between the frame length and the number of frequency extraction components, not the number of frequency extraction components, is obtained. Noting that there is no
A plurality of types of frame lengths are sequentially set for the frequency analysis means, and among the number of frequency extraction components of each frame length obtained from the frequency analysis means, the minimum number of frequency extraction components in the same sample section is obtained. Is determined as the optimum frame length, and encoding is performed with the information amount reduced for the frequency extraction component at the optimum frame length, so that the code amount is reduced for the minimum frequency extraction component. Can be performed.

【００１０】また、本発明は、フレーム長設定手段を、
複数種類用意したフレーム長のうち、最も長いフレーム
長の整数倍の区間長毎に、フレーム長の決定のための処
理を行うことを特徴とする。更に、本発明は、情報量削
減手段を、入力された周波数抽出成分のうち、設定パワ
ーレベル以上のパワーレベルの周波数抽出成分に対し
て、圧縮符号化を行って符号化データを生成することを
特徴とする。The present invention also provides a frame length setting means,
A process for determining a frame length is performed for each section length that is an integral multiple of the longest frame length among a plurality of types of prepared frame lengths. Furthermore, the present invention provides an information amount reducing unit that performs compression encoding on a frequency extraction component having a power level equal to or higher than a set power level among input frequency extraction components to generate encoded data. Features.

【００１１】[0011]

【発明の実施の形態】次に、本発明の実施の形態につい
て図面と共に説明する。図１は本発明になる符号化装置
の一実施の形態のブロック図を示す。同図において、符
号化装置は一般調和解析回路１０、周波数成分の抽出効
率算出と繰り返し判定を行う判定回路１３、フレーム長
を設定するフレーム長設定回路１４、最適フレーム長と
抽出成分の決定を行う決定回路１５、情報量削減回路１
６及びヘッダ情報付加回路１７より構成されている。一
般調和解析回路１０は、入力ディジタルオーディオ信号
をフレーム単位に分割するフレーム化回路１１と、フレ
ーム化回路１１の出力オーディオ信号からＧＨＡにより
複数の周波数成分を抽出する周波数成分抽出回路１２と
より構成されている。Next, embodiments of the present invention will be described with reference to the drawings. FIG. 1 is a block diagram showing an embodiment of an encoding apparatus according to the present invention. In FIG. 1, the coding apparatus includes a general harmonic analysis circuit 10, a determination circuit 13 for calculating frequency component extraction efficiency and performing repetition determination, a frame length setting circuit 14 for setting a frame length, and determining an optimum frame length and an extracted component. Decision circuit 15, information amount reduction circuit 1
6 and a header information adding circuit 17. The general harmonic analysis circuit 10 includes a framing circuit 11 for dividing an input digital audio signal into frame units, and a frequency component extraction circuit 12 for extracting a plurality of frequency components from the output audio signal of the framing circuit 11 by GHA. ing.

【００１２】次に、この実施の形態の動作について図２
及び図３のフローチャートを併せ参照して説明する。例
えば、サンプリング周波数４８ｋＨｚのディジタルオー
ディオ信号は、図１のフレーム化回路１１に供給され、
ここで複数のサンプルからなる観測区間（フレーム）に
分割される。このとき、フレームは必ずしも重複させる
必要はない。また、フレーム長は後述するように、フレ
ーム長設定回路１４よりのフレーム長設定信号により可
変設定される。Next, the operation of this embodiment will be described with reference to FIG.
And the flowchart of FIG. For example, a digital audio signal having a sampling frequency of 48 kHz is supplied to the framing circuit 11 of FIG.
Here, it is divided into observation sections (frames) composed of a plurality of samples. At this time, the frames do not necessarily have to overlap. The frame length is variably set by a frame length setting signal from a frame length setting circuit 14 as described later.

【００１３】フレーム化回路１１によりフレームに分割
されて取り出されたディジタルオーディオ信号は、周波
数成分抽出回路１２に供給され、ここでフレーム単位で
フレーム内の信号の周波数成分が、図２及び図３に示す
フローチャートの手順に従って一般調和解析（ＧＨＡ）
により抽出される。The digital audio signal divided into frames and extracted by the framing circuit 11 is supplied to a frequency component extracting circuit 12, where the frequency components of the signals within the frame are shown in FIGS. 2 and 3. General Harmonic Analysis (GHA) according to the procedure of the flowchart shown
Is extracted by

【００１４】すなわち、まず、１フレームのディジタル
オーディオ信号が入力されると（図２のステップ１０
１）、その１フレームの信号から、残差信号が最も小さ
くなるような周波数成分を１本だけ抽出する（図２のス
テップ１０２）。続いて、抽出した周波数成分を除いた
原信号、すなわち残差は十分に小さいかどうか判定し
（図２のステップ１０３）、残差が所定のしきい値より
大きいときは上記のステップ１０２の処理を残差に対し
て再び行い、残差が十分に小さくなった時点で分析を終
了する。そして、得られた分析結果が図１の情報量削減
回路１４へ出力される（図２のステップ１０４）。That is, first, when a digital audio signal of one frame is inputted (step 10 in FIG. 2).
1) Extract only one frequency component that minimizes the residual signal from the signal of one frame (step 102 in FIG. 2). Subsequently, it is determined whether or not the original signal from which the extracted frequency components have been removed, that is, the residual is sufficiently small (step 103 in FIG. 2). If the residual is larger than a predetermined threshold value, the processing in step 102 is performed. Is performed again on the residual, and the analysis is terminated when the residual becomes sufficiently small. Then, the obtained analysis result is output to the information amount reducing circuit 14 in FIG. 1 (Step 104 in FIG. 2).

【００１５】図３は上記のステップ１０２の詳細処理を
示すフローチャートで、上記のステップ１０３も併せて
図示している。ステップ１０２の周波数成分分析処理に
ついて、図３のフローチャートと共に更に詳細に説明す
るに、入力された１フレームの信号から、ある周波数ｆ
の信号を１つ選択し（図３のステップ２０１）、短時間
の観測区間［０，Ｌ］で観測された連続信号ｘ₀（ｔ）
から一意的に決まるフーリエ係数Ｓ（ｆ）及びＣ（ｆ）
を次式により計算する（図３のステップ２０２）。FIG. 3 is a flowchart showing the detailed processing of the above-mentioned step 102, and also shows the above-mentioned step 103. The frequency component analysis process of step 102 will be described in more detail with reference to the flowchart of FIG.
Is selected (step 201 in FIG. 3), and the continuous signal x₀ (t) observed in the short-time observation section [0, L].
Fourier coefficients S (f) and C (f) uniquely determined from
Is calculated by the following equation (Step 202 in FIG. 3).

【００１６】[0016]

【数１】ただし、（１）式及び（２）式中、ｆは任意の周波数、
Ｔは周期で１／ｆ、ｎは整数、ｎＴ≦Ｌである。(Equation 1) Here, in the equations (1) and (2), f is an arbitrary frequency,
T is 1 / f in cycle, n is an integer, and nT ≦ L.

【００１７】続いて、上記のフーリエ係数Ｓ（ｆ）及び
Ｃ（ｆ）を用いて、次式により観測区間における残差信
号ε（ｔ，ｆ）と、そのエネルギーＥ（ｆ）を算出する
（図３のステップ２０３）。Subsequently, using the above Fourier coefficients S (f) and C (f), the residual signal ε (t, f) in the observation section and its energy E (f) are calculated by the following equation ( Step 203 in FIG. 3).

【００１８】[0018]

【数２】続いて、上記の周波数ｆが最後の周波数成分かどうか判
定し（図３のステップ２０４）、最後の周波数成分でな
ければ、ステップ２０１〜２０３の処理を再び行う。こ
のようにして、１フレームのすべての周波数成分につい
て、上記の（１）式〜（４）式の計算を行い、そのうち
（４）式で表されるエネルギーＥ（ｆ）を最小とする周
波数ｆ₁とフーリエ係数Ｓ₁（ｆ₁）、Ｃ₁（ｆ₁）を求め
る。このときの周波数ｆ₁が最優勢スペクトルであり、
これが選択される（図３のステップ２０５）。(Equation 2) Subsequently, it is determined whether or not the frequency f is the last frequency component (step 204 in FIG. 3). If the frequency f is not the last frequency component, the processing of steps 201 to 203 is performed again. In this manner, the calculations of the above equations (1) to (4) are performed for all the frequency components of one frame, and the frequency f that minimizes the energy E (f) expressed by the equation (4) is obtained.₁ and Fourier coefficients S₁ (f₁ ) and C₁ (f₁ ) are obtained. The frequency f_{1 at} this time is the most dominant spectrum,
This is selected (step 205 in FIG. 3).

【００１９】次に、次式ｘ₁（ｔ）＝ｘ₀（ｔ）−Ｓ₁（ｆ₁）ｓｉｎ（２πｆ₁ｔ） −Ｃ₁（ｆ₁）ｃｏｓ（２πｆ₁ｔ）（５）により、原信号ｘ₀（ｔ）からステップ２０５で求めた
周波数ｆ₁の最優勢スペクトルによる最優勢成分を削除
し、残差ｘ₁（ｔ）を求める（図３のステップ２０
６）。Next, the following equation_{_{x 1 (t) = x 0}} (t) -S 1 (f 1) sin (2πf 1 t) -C 1 (f 1) cos (2πf 1 t) (5), The residual x₁ (t) is determined from the original signal x₀ (t) by deleting the most dominant component of the frequency f₁ determined in step 205 by the dominant spectrum (step 20 in FIG. 3).
6).

【００２０】続いて、この残差ｘ₁（ｔ）が十分に小さ
いかどうか判定する（図３のステップ１０３）。ここ
で、このステップ１０３での判定は、予め設定した所定
のしきい値と残差ｘ₁（ｔ）を大小比較した比較結果に
基づいて行ってもよいが、ここでは残差ｘ₁（ｔ）が十
分に小さくなるであろうと期待される予め設定したＮ
回、残差ｘ₁（ｔ）を原信号と見なしてステップ２０１
〜２０６の処理を繰り返したかどうかで判定する。Subsequently, it is determined whether or not the residual x₁ (t) is sufficiently small (step 103 in FIG. 3). Here, the determination in step 103 may be made based on a comparison result of magnitude comparison between a predetermined threshold value set in advance and the residual x₁ (t), but here, the residual x₁ (t) ) Is expected to be sufficiently small
Times, the residual x₁ (t) is regarded as the original signal and step 201
The determination is made based on whether or not the processing of steps S206 to S206 has been repeated.

【００２１】これにより、原信号ｘ₀（ｔ）は、観測区
間［０，Ｌ］において次式のｘ（ｔ）で表される。Thus, the original signal x₀ (t) is represented by the following equation x (t) in the observation section [0, L].

【００２２】[0022]

【数３】（６）式のように、ｆ₁，ｆ₂，．．．，ｆ_Nの、計Ｎ本
の正弦波（周波数抽出成分）の重ね合わせで、高精度に
推定できる。これらの正弦波は調和的（基本波の整数倍
の周波数）とは限らない。なお、（６）式で表される信
号ｘ（ｔ）のパワーレベルＰ（ｆ_k）は、Ｓ²_k＋Ｃ²_k
で表される。(Equation 3) As shown in equation (6), f₁ , f₂ ,. . . , Of f_N, a superposition of sinusoidal total N present (frequency extraction component), it can be estimated with high accuracy. These sine waves are not necessarily harmonic (frequency that is an integral multiple of the fundamental wave). Note that the power level P (f_k ) of the signal x (t) represented by the equation (6) is S²_k + C²_k
It is represented by

【００２３】ここで、周波数成分抽出における分析終了
の条件は、図２及び図３のステップ１０３に示したよう
に、残差が十分に小さいことであり、周波数抽出成分の
本数ではない。「残差が十分に小さいこと」とは、１サ
ンプル当たりの振幅の二乗誤差値が、十分小さな値で、
フレーム長に関係なく一定の値ｄｔ（＞０）より小であ
るということである。つまり、フレーム長と周波数抽出
成分の本数との間には相関関係はない。しかし、符号化
効率向上のためには、周波数抽出成分の本数は少ないほ
どよい。Here, the condition for terminating the analysis in the frequency component extraction is that the residual is sufficiently small as shown in step 103 in FIGS. 2 and 3, and is not the number of frequency extraction components. "The residual is sufficiently small" means that the square error value of the amplitude per sample is a sufficiently small value,
That is, it is smaller than a certain value dt (> 0) regardless of the frame length. That is, there is no correlation between the frame length and the number of frequency extraction components. However, in order to improve the coding efficiency, the smaller the number of frequency extraction components, the better.

【００２４】そこで、判定回路１３はフレーム化回路１
１における分割単位であるフレーム長を予め複数種類用
意し、それらを順次指定する信号をフレーム長設定回路
１４を通してフレーム化回路１１へ出力する。そして、
判定回路１３は、周波数成分抽出回路１２から逐次取り
出された、同じサンプル数の区間における各フレーム長
での周波数抽出成分の本数を算出する。決定回路１５
は、判定回路１３より取り出された、フレーム長の周波
数抽出成分と、各フレーム長での周波数抽出成分の本数
とを受け、各フレーム長に対応する周波数抽出成分の本
数を比較して、最小の周波数抽出成分の本数が得られる
フレーム長の周波数抽出成分を最適フレーム長として決
定して、その最適フレーム長の周波数抽出成分を情報量
削減回路１６へ出力する。Therefore, the judgment circuit 13 is provided with the framing circuit 1
A plurality of types of frame lengths, which are division units in 1, are prepared in advance, and signals for sequentially specifying the frame lengths are output to the framing circuit 11 through the frame length setting circuit 14. And
The determination circuit 13 calculates the number of frequency extraction components for each frame length in the section of the same number of samples sequentially extracted from the frequency component extraction circuit 12. Decision circuit 15
Receives the frequency-extracted components of the frame length extracted from the determination circuit 13 and the number of frequency-extracted components at each frame length, compares the number of frequency-extracted components corresponding to each frame length, and obtains the minimum A frequency extraction component having a frame length from which the number of frequency extraction components is obtained is determined as an optimum frame length, and the frequency extraction component having the optimum frame length is output to the information amount reducing circuit 16.

【００２５】例えば、１０２４サンプルの区間を分析し
た結果、１０２４サンプルを一つのフレーム長（第１の
フレーム長）としたときの周波数抽出成分の本数が１０
０本であり、１０２４サンプルを２分割して５１２サン
プルのフレーム長（第２のフレーム長）としたときの１
０２４サンプル分の周波数抽出成分の本数が８５（＝４
０＋４５）本であり、１０２４サンプルを４分割して２
５６サンプルのフレーム長（第３のフレーム長）とした
ときの１０２４サンプル分の周波数抽出成分の本数が１
１０（＝３５＋２５＋３０＋２０）本であった場合は、
第２のフレーム長を最適フレーム長として決定する。For example, as a result of analyzing a section of 1024 samples, when the number of frequency extraction components when 1024 samples are one frame length (first frame length) is 10
0, which is 1 when 1024 samples are divided into two to make a frame length of 512 samples (second frame length).
The number of frequency extraction components for 024 samples is 85 (= 4
0 + 45) lines, and 1024 samples were divided into four
When the frame length of 56 samples (the third frame length) is 1, the number of frequency extraction components for 1024 samples is 1
If there are 10 (= 35 + 25 + 30 + 20),
The second frame length is determined as the optimum frame length.

【００２６】実際には、最も長いフレーム長を１０２４
〜４０９６サンプル程度に決めておき、その１／２、１
／４、１／８、１／１６程度の複数のフレーム長によっ
てそれぞれ仮分析して、最適なフレーム長を決定する。
このようにして、ディジタルオーディオ信号の性質に応
じた最適なフレーム長を決定する適応フレーム長の処理
が行われ、単位時間当たりできるだけ少ない本数の周波
数抽出成分が決定回路１５より取り出される。なお、上
記の適応フレーム長の処理は、最も長いフレーム長の整
数倍の区間長毎に行う。In practice, the longest frame length is 1024
About 4096 samples, 1/2, 1
Provisional analysis is performed for each of a plurality of frame lengths of about ４, ８, and 1/16 to determine an optimum frame length.
In this way, the processing of the adaptive frame length for determining the optimum frame length according to the characteristics of the digital audio signal is performed, and as few frequency extraction components as possible per unit time are extracted from the determination circuit 15. The processing of the adaptive frame length is performed for each section length that is an integral multiple of the longest frame length.

【００２７】上記の周波数抽出成分は、図１の情報量削
減回路１６に供給され、ここで設定パワーレベルより小
さいパワーレベルの抽出成分は出力されないことで、情
報量が削減されると共に、更に設定パワーレベル以上の
パワーレベルの周波数抽出成分に対して、ハフマン符号
化などの可逆圧縮符号化による情報量の削減がなされ
る。情報量削減回路１６により生成された符号化データ
は、ヘッダ情報付加回路１７に供給され、ここで周波数
抽出成分の本数とフレーム長情報をそれぞれ記録したヘ
ッダ情報が付加されてビットストリームとして任意の伝
送路へ送出される。The above-mentioned frequency extraction component is supplied to the information amount reduction circuit 16 of FIG. 1. Here, the extraction amount of the power level smaller than the set power level is not output, so that the information amount is reduced and the setting amount is further reduced. For a frequency extraction component having a power level equal to or higher than the power level, the amount of information is reduced by lossless compression encoding such as Huffman encoding. The coded data generated by the information amount reducing circuit 16 is supplied to a header information adding circuit 17, where header information in which the number of frequency extraction components and frame length information are respectively added is added, and arbitrarily transmitted as a bit stream. Sent to the road.

【００２８】これにより、例えば、上記の第１のフレー
ム長から第３のフレーム長の３つのフレーム長を例にと
ると、最も長い第１のフレーム長（１０２４サンプル）
の整数倍の予め設定した区間長として、１０２４サンプ
ルの区間毎に上記の適応フレーム長の処理を行った場合
は、３つのフレーム長のどのフレーム長で符号化したか
を示すフレーム長情報をヘッダ内に設けて伝送すること
により、復号化部（デコーダ）側では、フレーム長情報
から第２のフレーム長情報を得たときはそのフレーム長
が２回続くことが、また第３のフレーム長情報を得たと
きはそのフレーム長が４回続くことがわかる。Thus, for example, taking the three frame lengths from the first frame length to the third frame length as an example, the longest first frame length (1024 samples)
If the above adaptive frame length processing is performed for each section of 1024 samples as a preset section length of an integral multiple of the above, frame length information indicating which of the three frame lengths has been coded is included in the header. In the decoding unit (decoder) side, when the second frame length information is obtained from the frame length information, the frame length continues twice, and the third frame length information , It is understood that the frame length continues four times.

【００２９】[0029]

【発明の効果】以上説明したように、本発明によれば、
同じサンプル区間で最小の周波数抽出成分の本数が得ら
れるフレーム長を最適フレーム長として決定して、その
最適フレーム長における周波数抽出成分に対して情報量
を削減した符号化を行うことにより、単位時間当たりで
きるだけ少ない本数の周波数成分を抽出して符号化する
ようにしているため、符号化効率を大幅に向上できる。As described above, according to the present invention,
By determining the frame length at which the minimum number of frequency extraction components can be obtained in the same sample section as the optimal frame length, and performing encoding with a reduced amount of information on the frequency extraction components at the optimal frame length, the unit time Since the number of frequency components as small as possible is extracted and encoded, the encoding efficiency can be greatly improved.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明の一実施の形態のブロック図である。FIG. 1 is a block diagram of an embodiment of the present invention.

【図２】図１中の要部の動作説明用フローチャートであ
る。FIG. 2 is a flowchart for explaining an operation of a main part in FIG. 1;

【図３】図２の要部の周波数成分抽出ステップを詳細に
説明するフローチャートである。FIG. 3 is a flowchart illustrating in detail a frequency component extraction step of a main part in FIG. 2;

【符号の説明】[Explanation of symbols]

１０一般調和解析回路（周波数分析手段）１１フレーム化回路１２周波数成分抽出回路１３判定回路（フレーム長設定手段）１４フレーム長設定回路（フレーム長設定手段）１５決定回路（決定手段）１６情報量削減回路（情報量削減手段）１７ヘッダ情報付加回路（情報量削減手段） DESCRIPTION OF SYMBOLS 10 General harmonic analysis circuit (frequency analysis means) 11 Framing circuit 12 Frequency component extraction circuit 13 Judgment circuit (frame length setting means) 14 Frame length setting circuit (frame length setting means) 15 Decision circuit (decision means) 16 Information amount reduction Circuit (information amount reducing means) 17 Header information adding circuit (information amount reducing means)

───────────────────────────────────────────────────── フロントページの続きＦターム(参考） 5D044 AB05 DE03 GK08 GL28 5D045 DA20 5J064 AA02 BA16 BB05 BC02 BC14 BC18 BD01 9A001 BB02 CC02 EE02 EE04 FF05 GG01 GG03 HH15 ──────────────────────────────────────────────────続き Continued on the front page F term (reference) 5D044 AB05 DE03 GK08 GL28 5D045 DA20 5J064 AA02 BA16 BB05 BC02 BC14 BC18 BD01 9A001 BB02 CC02 EE02 EE04 FF05 GG01 GG03 HH15

Claims

Translated fromJapanese

【特許請求の範囲】[Claims]

【請求項１】外部から設定された複数のサンプルから
なるフレーム長の単位で、入力ディジタルオーディオ信
号の周波数成分を一般調和解析により周波数分析する周
波数分析手段と、前記周波数分析手段において設定可能な前記フレーム長
を予め複数種類用意し、それら複数種類のフレーム長を
前記周波数分析手段に対して順次設定して、この周波数
分析手段から得られた各フレーム長の周波数抽出成分の
本数を算出するフレーム長設定手段と、前記フレーム長設定手段を通して得られた前記各フレー
ム長の周波数抽出成分の本数のうち、同じサンプル区間
で最小の周波数抽出成分の本数が得られるフレーム長を
最適フレーム長として決定して、その最適フレーム長に
おける周波数抽出成分を出力する決定手段と、前記決定手段から取り出された複数の周波数抽出成分の
情報量を削減した符号化を行い、生成された符号化デー
タに少なくともフレーム長を示すヘッダ情報を付加して
ビットストリームとして伝送路へ送出する情報量削減手
段とを有することを特徴とする符号化装置。1. A frequency analysis means for frequency-analyzing a frequency component of an input digital audio signal by general harmonic analysis in units of a frame length composed of a plurality of samples set from outside, and said frequency analysis means being capable of being set by said frequency analysis means. A plurality of types of frame lengths are prepared in advance, and the plurality of types of frame lengths are sequentially set to the frequency analysis unit, and the number of frequency extraction components of each frame length obtained from the frequency analysis unit is calculated. Setting means, of the number of frequency extraction components of each of the frame lengths obtained through the frame length setting means, determine the frame length that provides the minimum number of frequency extraction components in the same sample section as the optimal frame length. Deciding means for outputting a frequency extraction component at the optimum frame length; Means for reducing the amount of information of the plurality of extracted frequency extracted components, adding header information indicating at least a frame length to the generated coded data, and transmitting the information as a bit stream to a transmission path. An encoding device, comprising:

【請求項２】前記フレーム長設定手段は、前記複数種
類用意したフレーム長のうち、最も長いフレーム長の整
数倍の区間長毎に、前記フレーム長の決定のための処理
を行うことを特徴とする請求項１記載の符号化装置。2. The method according to claim 1, wherein the frame length setting means performs a process for determining the frame length for each section length that is an integral multiple of the longest frame length among the plurality of types of prepared frame lengths. The encoding device according to claim 1, wherein

【請求項３】前記情報量削減手段は、入力された周波
数抽出成分のうち、設定パワーレベル以上のパワーレベ
ルの周波数抽出成分に対して、圧縮符号化を行って符号
化データを生成することを特徴とする請求項１記載の符
号化装置。3. The information amount reducing unit performs compression encoding on a frequency extraction component having a power level equal to or higher than a set power level among input frequency extraction components to generate encoded data. The encoding device according to claim 1, wherein