JPH0816192A

Movatterモバイル変換

Info

Publication number: JPH0816192A
Application number: JP6152006A
Authority: JP
Inventors: Akira Okamoto; 彰岡本
Original assignee: Kanebo Ltd
Current assignee: Kanebo Ltd
Priority date: 1994-07-04
Filing date: 1994-07-04
Publication date: 1996-01-19
Anticipated expiration: 2014-11-02
Also published as: JP2969047B2

Abstract

PURPOSE:To provide a voice data compression device in which the compression rate is improved while keeping a high tone quality. CONSTITUTION:Analogue voice signals are converted into a series of voice data by an A/D converter 2 and the converted voice data are temporarily stored in a register 3. By the address specification of the address selector 5, a series of voice data kept in the register 3 are arranged in a matrix memory 4. At this time, the voice data are arranged in a two-dimensional matrix so that the data are correlated in horizontal and vertical directions. The voice data arranged in the memory 4 are two-dimensional discrete cosine converted by a two-dimensional DCT section 6 and DCT coefficients are obtained.

Description

Translated fromJapanese

【発明の詳細な説明】Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、データ圧縮を行うデー
タ圧縮装置およびデータ圧縮方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a data compression device and a data compression method for performing data compression.

【０００２】[0002]

【従来の技術】従来より、情報通信においてデータの伝
送量を削減して通信効率を高めるために、種々のデータ
圧縮技術が開発されている。画像データの圧縮技術とし
ては、例えば離散コサイン変換（ＤＣＴ）方式が国際標
準の１つとして採用されている。2. Description of the Related Art Conventionally, various data compression techniques have been developed in order to reduce the amount of data transmission and improve communication efficiency in information communication. As a technique for compressing image data, for example, the discrete cosine transform (DCT) method is adopted as one of international standards.

【０００３】一方、音声データ（オーディオデータ）の
圧縮技術としては、例えば適応差分ＰＣＭ（ＡＤＰＣ
Ｍ）方式がある。この適応差分ＰＣＭ方式では、高品質
の音声が得られるが、量子化ビット数はあまり削減でき
ない。On the other hand, as a compression technique of voice data (audio data), for example, adaptive differential PCM (ADPC)
M) method is available. With this adaptive difference PCM method, high quality speech can be obtained, but the number of quantization bits cannot be reduced so much.

【０００４】そこで、音声データの高能率なデータ圧縮
を実現するために、特開平６−６７６９９号公報に二次
元離散コサイン変換を音声データに適用する方法が開示
されている。Therefore, in order to realize highly efficient data compression of voice data, Japanese Patent Laid-Open No. 6-67699 discloses a method of applying two-dimensional discrete cosine transform to voice data.

【０００５】ここで、まず、図２５〜図２７を参照しな
がら画像データ圧縮のために用いられる二次元離散コサ
イン変換方式について説明する。図２５は離散コサイン
変換方式を実行するためのシステムの基本構成を示すブ
ロック図である。First, the two-dimensional discrete cosine transform method used for image data compression will be described with reference to FIGS. 25 to 27. FIG. 25 is a block diagram showing the basic configuration of a system for executing the discrete cosine transform method.

【０００６】符号化側では、ＤＣＴ装置１００が、入力
される原画像データの二次元離散コサイン変換を行い、
ＤＣＴ係数を出力する。量子化器２００は、量子化テー
ブル４００を参照してＤＣＴ係数に量子化処理を行い、
量子化されたＤＣＴ係数（以下、量子化ＤＣＴ係数と呼
ぶ）を出力する。エントロピー符号化器３００は、符号
化テーブル５００を参照して量子化ＤＣＴ係数にエント
ロピー符号化処理を行い、圧縮データを出力する。エン
トロピー符号化の方式としてハフマン符号化方式が用い
られる。On the encoding side, the DCT device 100 performs a two-dimensional discrete cosine transform of the input original image data,
Output the DCT coefficient. The quantizer 200 refers to the quantization table 400 to perform a quantization process on the DCT coefficient,
The quantized DCT coefficient (hereinafter referred to as a quantized DCT coefficient) is output. The entropy encoder 300 refers to the encoding table 500, performs entropy encoding processing on the quantized DCT coefficient, and outputs compressed data. The Huffman coding method is used as the entropy coding method.

【０００７】復号化側では、エントロピー復号器６００
が、符号化テーブル５００を参照して圧縮データにエン
トロピー復号化処理を行い、量子化ＤＣＴ係数を出力す
る。逆量子化器７００は、量子化テーブル４００を参照
して量子化ＤＣＴ係数に逆量子化処理を行い、ＤＣＴ係
数を出力する。逆ＤＣＴ装置８００は、ＤＣＴ係数に逆
二次元離散コサイン変換を行い、再生画像データを出力
する。On the decoding side, the entropy decoder 600
Performs the entropy decoding process on the compressed data with reference to the encoding table 500, and outputs the quantized DCT coefficient. The inverse quantizer 700 refers to the quantization table 400, performs inverse quantization processing on the quantized DCT coefficient, and outputs the DCT coefficient. The inverse DCT device 800 performs inverse two-dimensional discrete cosine transform on the DCT coefficient and outputs reproduced image data.

【０００８】次に、二次元離散コサイン変換方法を説明
する。まず、図２６に示すように、画像データを複数の
８×８画素ブロックに分割する。図２７に示すように、
１つの８×８画素ブロック内の各画素データの値をＰ
_{X Y}（Ｘ，Ｙ＝０，…，７）で示す。ここで、Ｘ，Ｙが
ブロック内の画素データの位置を表わしている。Next, a two-dimensional discrete cosine transform method will be described. First, as shown in FIG. 26, the image data is divided into a plurality of 8 × 8 pixel blocks. As shown in FIG. 27,
The value of each pixel data in one 8 × 8 pixel block is set to P
It is indicated by_XY (X, Y = 0, ..., 7). Here, X and Y represent the position of the pixel data in the block.

【０００９】分割された各８×８画素ブロックに対し
て、数１による二次元離散コサイン変換を行う。Two-dimensional discrete cosine transform according to equation 1 is performed on each of the divided 8 × 8 pixel blocks.

【００１０】[0010]

【数１】[Equation 1]

【００１１】ここで、Ｓ_{U V}（Ｕ，Ｖ＝０，…，７）は
ＤＣＴ係数を表わし、Ｕ，ＶはＤＣＴ係数の位置を表わ
す。Ｕ＝Ｖ＝０の場合にはＣ_U＝Ｃ_V＝１／√２とな
り、その他の場合にはＣ_U＝Ｃ_V＝１となる。さらに、
画素データＰ_{X Y}のビット精度が８ビットの場合にはＬ
_S＝１２８となり、画素データＰ_{X Y}のビット精度が１
２ビットの場合にはＬ_S＝２０４８となる。Here, S_UV (U, V = 0, ..., 7) represents the DCT coefficient, and U and V represent the position of the DCT coefficient. When U = V = 0, C_U = C_V = 1 / √2, and in other cases, C_U = C_V = 1. further,
If the bit precision of the pixel data P_XY is 8 bits, L
_S = 128 and the bit precision of pixel data P_XY is 1
In the case of 2 bits, L_S = 2048.

【００１２】二次元離散コサイン変換の結果、６４個の
ＤＣＴ係数Ｓ_{U V}が得られる。ＤＣＴ係数Ｓ_{0 0}はＤＣ
係数と呼ばれ、残りの６３個ＤＣＴ係数はＡＣ係数と呼
ばれる。As a result of the two-dimensional discrete cosine transform, 64 DCT coefficients S_UV are obtained. DCT coefficient S_{0 0} is DC
The remaining 63 DCT coefficients are called coefficients and are called AC coefficients.

【００１３】ＤＣ係数は８×８画素データの平均値（直
流成分）を示している。数１に示されるように、各画素
データＰ_{X Y}よりＬ_Sを引くことにより、ＤＣ係数の期
待値を０にレベルシフトしている。The DC coefficient indicates the average value (DC component) of 8 × 8 pixel data. As shown in Expression 1, the expected value of the DC coefficient is level-shifted to 0 by subtracting L_S from each pixel data P_XY .

【００１４】図２７に示すように、二次元離散コサイン
変換されたブロックの左上および右下がそれぞれＤＣＴ
係数Ｓ_{0 0}，Ｓ_{7 7}に対応している。二次元離散コサイ
ン変換されたブロックの左から右に進むにつれて高周波
の水平周波数成分を多く含み、上から下に進むにつれて
高周波の垂直周波数成分を多く含むことになる。As shown in FIG. 27, the upper left and lower right of the block subjected to the two-dimensional discrete cosine transform are DCT, respectively.
It corresponds to the coefficients S_{0 0} and S_{7 7} . The two-dimensional discrete cosine transformed block contains many high frequency horizontal frequency components as it goes from left to right, and many high frequency vertical frequency components as it goes from top to bottom.

【００１５】一方、数２に示す逆二次元離散コサイン変
換によりＤＣＴ係数Ｓ_{U V}から６４個の画素データＰ
_{X Y}（Ｘ，Ｙ＝０，…，７）を得ることができる。On the other hand, 64 pixel data P from the DCT coefficient S_UV by the inverse two-dimensional discrete cosine transform shown in Equation 2.
_XY (X, Y = 0, ..., 7) can be obtained.

【００１６】[0016]

【数２】[Equation 2]

【００１７】数２に示されるように、各画素データにＬ
_Sを加えることにより、レベルシフト分を元に戻してい
る。ＤＣＴ係数Ｓ_{U V}は、係数位置ごとに異なる量子化
テーブルＱ_{U V}を用いて、数３により線型量子化され、
量子化ＤＣＴ係数ｒ_{U V}が得られる。As shown in equation 2, each pixel data has L
By adding_S , the level shift amount is restored. The DCT coefficient S_UV is linearly quantized by Equation 3 using a quantization table Q_UV which is different for each coefficient position,
The quantized DCT coefficient r_UV is obtained.

【００１８】[0018]

【数３】ｒ_{U V}＝ｒｏｕｎｄ（Ｓ_{U V}／Ｑ_{U V}）ｒｏｕｎｄは、最も近い整数への整数化を意味する。## EQU00003 ## r_UV = round (S_UV / Q_UV ) round means integerization to the nearest integer.

【００１９】復号化側では、逆量子化が行われる。ハフ
マン復号化で得られた量子化ＤＣＴ係数をｒ_{U V}とする
と、数４により逆量子化を行う。On the decoding side, inverse quantization is performed. When the quantized DCT coefficient obtained by the Huffman decoding is r_UV , inverse quantization is performed by the equation 4.

【００２０】[0020]

【数４】Ｓ_{U V}＝ｒ_{U V}×Ｑ_{U V} 量子化テーブルＱ_{U V}の値を変化させることにより画質
を制御することができる。量子化テーブルＱ_{U V}の値を
小さく設定すると、量子化ＤＣＴ係数ｒ_{U V}の値が大き
くなり、画質の良い画像を符号化することができる。逆
に、量子化テーブルＱ_{U V}の値を大きく設定すると、量
子化ＤＣＴ係数ｒ_{U V}の値が小さくなり、符号化情報量
は減少するが、画質は劣化する。## EQU00004 ## The image quality can be controlled by changing the value of S_UV = r_UV × Q_UV quantization table Q_UV . When the value of the quantization table Q_UV is set to be small, the value of the quantized DCT coefficient r_UV becomes large, and an image with good image quality can be encoded. On the contrary, when the value of the quantization table Q_UV is set to be large, the value of the quantized DCT coefficient r_UV becomes small and the amount of encoded information decreases, but the image quality deteriorates.

【００２１】このように、量子化テーブルＱ_{U V}の値を
変えることにより、画質および符号化情報を自由に制御
することができる。次に、特開平６−６７６９９号公報
に開示された二次元離散コサイン変換を用いた音声デー
タの圧縮方法を図２８を参照して説明する。As described above, by changing the value of the quantization table Q_UV , the image quality and the coding information can be freely controlled. Next, a method of compressing audio data using the two-dimensional discrete cosine transform disclosed in JP-A-6-67699 will be described with reference to FIG.

【００２２】図２８の（ａ）に示すように、一連の音声
データをＤ１，Ｄ２，…，Ｄ６４とする。音声データは
一次元性を有するので、そのままの形では二次元離散コ
サイン変換を行うことはできない。As shown in FIG. 28 (a), a series of audio data is D1, D2, ..., D64. Since the voice data has one-dimensionality, the two-dimensional discrete cosine transform cannot be performed as it is.

【００２３】そこで、図２８の（ｂ）に示すように、一
連の音声データＤ１，Ｄ２，…，Ｄ６４を例えば８×８
のマトリクスに配列し、疑似的に二次元マトリクスを形
成する。ここで、マトリクスのＸ方向の座標およびＹ方
向の座標をそれじれＸ＝０，１，…，７、Ｙ＝０，１，
…，７とする。Therefore, as shown in FIG. 28B, a series of audio data D1, D2, ..., D64 is, for example, 8 × 8.
To form a pseudo two-dimensional matrix. Here, the coordinates in the X direction and the coordinates in the Y direction of the matrix are respectively changed to X = 0, 1, ..., 7, Y = 0, 1,
…, 7

【００２４】まず、マトリクスのＹ＝０の行に音声デー
タＤ１，Ｄ２，…，Ｄ８をＸ方向に順に配列し、Ｙ＝１
の行に音声データＤ９，Ｄ１０，…，Ｄ１５をＸ方向に
順に配列する。同様にして、Ｙ＝２，３，…，７の各行
に音声データＤ１７，Ｄ１８，…，Ｄ６４をＸ方向に順
に配列する。このようにして、音声データＤ１，Ｄ２，
…，Ｄ６４が二次元マトリクスに配列される。First, audio data D1, D2, ..., D8 are arranged in order in the X direction on the row of Y = 0 in the matrix, and Y = 1.
, D15 are arranged in this order in the X direction. Similarly, audio data D17, D18, ..., D64 are sequentially arranged in the X direction in each row of Y = 2, 3 ,. In this way, the audio data D1, D2
..., D64 are arranged in a two-dimensional matrix.

【００２５】このようにして配列された音声データＤ
１，Ｄ２，…，Ｄ６４に上記の二次元離散コサイン変換
を行うことにより音声データを圧縮することができる。The voice data D arranged in this way
Audio data can be compressed by performing the above-described two-dimensional discrete cosine transform on 1, D2, ..., D64.

【００２６】[0026]

【発明が解決しようとする課題】しかしながら、図２８
に示した従来の音声データ圧縮方法においては、Ｘ方向
に隣接する音声データどうしは相互に相関性を有する
が、Ｙ方向に隣接する音声データどうしは相関性を有し
ない。すなわち、Ｘ方向には連続する音声データが順に
配列されているが、Ｙ方向には全く関連のない音声デー
タが配列されることになる。したがって、このように配
列された音声データに二次元離散コサイン変換を行った
場合、圧縮比をあまり大きくすることができないという
問題がある。However, as shown in FIG.
In the conventional audio data compression method shown in FIG. 3, audio data adjacent in the X direction have mutual correlation, but audio data adjacent in the Y direction do not have correlation. That is, although continuous audio data are arranged in order in the X direction, completely unrelated audio data is arranged in the Y direction. Therefore, when the two-dimensional discrete cosine transform is performed on the audio data arranged in this way, there is a problem that the compression ratio cannot be increased so much.

【００２７】一方、音声データの聴覚重み付け符号化に
おいて、符号データのデータ長を一定の長さ以下に固定
化するように重み付けを行った場合、音声データの内容
によって良い音質および悪い音質が出現するという問題
がある。On the other hand, in the perceptual weighting encoding of voice data, when weighting is performed so that the data length of the code data is fixed to a fixed length or less, good sound quality and bad sound quality appear depending on the content of the voice data. There is a problem.

【００２８】さらに、図２５に示した方式では、ＤＣＴ
係数の量子化処理においてデータ量が削減されるが、そ
の量子化処理により量子化誤差が生じるという問題があ
る。それゆえに、本発明の目的は、高い音質を維持しつ
つ圧縮率が向上された音声データの圧縮装置を提供する
ことである。Further, in the system shown in FIG. 25, the DCT
Although the amount of data is reduced in the coefficient quantization processing, there is a problem in that the quantization processing causes a quantization error. Therefore, an object of the present invention is to provide an audio data compression apparatus having an improved compression rate while maintaining high sound quality.

【００２９】本発明の他の目的は、高い音質を維持しつ
つデータ長を一定の長さ以下に固定化しかつデータ長を
短縮することができる音声データの圧縮方法を提供する
ことである。Another object of the present invention is to provide a method of compressing audio data, which can fix the data length to a fixed length or less and shorten the data length while maintaining high sound quality.

【００３０】本発明のさらに他の目的は、量子化誤差が
低減された画像データおよび音声データの圧縮装置を提
供することである。Still another object of the present invention is to provide a compression device for image data and audio data with reduced quantization error.

【００３１】[0031]

【課題を解決するための手段】[Means for Solving the Problems]

（１）第１の発明第１の発明に係るデータ圧縮装置は、配列手段および変
換手段を備える。配列手段は、一連の音声データを水平
方向および垂直方向に相関性を有するように二次元マト
リクスに配列する。変換手段は、配列手段により配列さ
れた二次元マトリクスの音声データに二次元離散コサイ
ン変換を行う。(1) First Invention A data compression apparatus according to the first invention comprises an arranging means and a converting means. The arranging means arranges a series of audio data in a two-dimensional matrix so as to have correlation in the horizontal direction and the vertical direction. The converting means performs a two-dimensional discrete cosine transform on the sound data of the two-dimensional matrix arranged by the arranging means.

【００３２】（２）第２の発明第２の発明に係るデータ圧縮方法は、一連の音声データ
を複数の周波数帯域に分割し、その分割により得られた
各周波数帯域の音声データを二次元マトリクスに配列す
る。各周波数帯域ごとに二次元マトリクスの音声データ
に二次元離散コサイン変換を行い、二次元離散コサイン
変換により得られた各周波数帯域ごとのデータから主た
るデータを所定の選択条件で選択する。符号データの長
さが所定の長さ以下に固定化されるように選択条件を修
正しつつ選択された主たるデータを符号化する。(2) Second Invention A data compression method according to the second invention divides a series of audio data into a plurality of frequency bands, and the audio data of each frequency band obtained by the division is a two-dimensional matrix. To array. The two-dimensional discrete cosine transform is performed on the sound data of the two-dimensional matrix for each frequency band, and the main data is selected from the data for each frequency band obtained by the two-dimensional discrete cosine transform under a predetermined selection condition. The selected main data is encoded while correcting the selection condition so that the length of the coded data is fixed to a predetermined length or less.

【００３３】（３）第３の発明第３の発明に係るデータ圧縮装置は、第１の変換手段、
分離手段、第２の変換手段、第１の符号化手段および第
２の符号化手段を備える。(3) Third Invention A data compression apparatus according to a third invention is a first conversion means,
It comprises a separating means, a second converting means, a first encoding means and a second encoding means.

【００３４】第１の変換手段は、ｍビットのデータに二
次元離散コサイン変換を行ってｎビットのデータを得
る。分離手段は、第１の変換手段により得られたｎビッ
トのデータを上位ｎ−ｋビットのデータおよび下位ｋビ
ットのデータに分離する。第２の変換手段は、分離手段
により得られた下位ｋビットのデータに二次元離散コサ
イン変換を行う。第１の符号化手段は、第２の変換手段
により得られたデータを符号化する。第２の符号化手段
は、分離手段により得られた上位ｎ−ｋビットのデータ
を符号化する。The first converting means performs two-dimensional discrete cosine conversion on the m-bit data to obtain n-bit data. The separating means separates the n-bit data obtained by the first converting means into upper n−k bit data and lower k bit data. The second converting means performs a two-dimensional discrete cosine transform on the lower k-bit data obtained by the separating means. The first encoding means encodes the data obtained by the second converting means. The second encoding means encodes the upper nk-bit data obtained by the separating means.

【００３５】第１の変換手段および第２の変換手段は１
つの変換手段で兼用してもよい。また、第１の符号化手
段および第２の符号化手段は１つの符号化手段で兼用し
てもよい。The first converting means and the second converting means are 1
The two conversion means may be used in common. Further, the first encoding means and the second encoding means may be combined with one encoding means.

【００３６】[0036]

【作用】[Action]

（１）第１の発明第１の発明に係るデータ圧縮装置においては、一連の音
声データが水平方向および垂直方向に相関性を有するよ
うに配列が行われるので、二次元離散コサイン変換にお
いて高い圧縮率が得られる。したがって、音声データの
圧縮において、高い音質を維持しつつ圧縮率を向上させ
ることが可能となる。(1) First invention In the data compression apparatus according to the first invention, since a series of audio data is arranged so as to have correlation in the horizontal direction and the vertical direction, high compression is achieved in the two-dimensional discrete cosine transform. The rate is obtained. Therefore, in compression of audio data, it is possible to improve the compression rate while maintaining high sound quality.

【００３７】（２）第２の発明第２の発明に係るデータ圧縮方法においては、一連の音
声データが複数の周波数帯域に分割され、各周波数帯域
ごとに二次元離散コサイン変換、主たるデータの選択お
よび符号化が行われる。その際、符号データのデータ長
が所定の長さ以下に固定化されるように、選択条件を修
正しつつ符号化が行われるので、高い音質を維持しつつ
データ長を一定の長さ以下に固定化しかつデータ長を短
縮することができる。(2) Second Invention In the data compression method according to the second invention, a series of audio data is divided into a plurality of frequency bands, two-dimensional discrete cosine transform is performed for each frequency band, and main data is selected. And encoding is performed. At that time, since the encoding is performed while correcting the selection condition so that the data length of the coded data is fixed to a predetermined length or less, the data length is kept below a certain length while maintaining high sound quality. It can be fixed and the data length can be shortened.

【００３８】（３）第３の発明第３の発明に係るデータ圧縮装置においては、ｍビット
のデータに二次元離散コサイン変換が行われ、ｎビット
（ｎ＞ｍ）のデータ（ＤＣＴ係数）が得られる。ｎビッ
トのデータは、上位ｎ−ｋビットのデータと下位ｋビッ
トのデータとに分離される。下位ｋビットのデータは再
び二次元離散コサイン変換された後、符号化される。上
位ｎ−ｋビットのデータはそのまま符号化される。(3) Third Invention In the data compression apparatus according to the third invention, two-dimensional discrete cosine transform is performed on m-bit data, and n-bit (n> m) data (DCT coefficient) is obtained. can get. The n-bit data is separated into upper n-k bit data and lower k-bit data. The data of the lower k bits is coded after being subjected to the two-dimensional discrete cosine transform again. The upper nk-bit data is encoded as it is.

【００３９】二次元離散コサイン変換により得られたｎ
ビットのデータを量子化因子で量子化した場合、下位ｋ
ビットのデータは量子化誤差となる。第３の発明に係る
データ圧縮装置においては、量子化誤差となる下位ｋビ
ットのデータに二次元離散コサイン変換を再び行うの
で、量子化誤差の発生がなくなり、データ圧縮による誤
差は二次元離散コサイン変換による誤差のみとなる。し
たがって、データ圧縮において、量子化誤差を低減する
ことができる。N obtained by the two-dimensional discrete cosine transform
When bit data is quantized by a quantization factor, lower k
Bit data becomes a quantization error. In the data compressing device according to the third aspect of the present invention, since the two-dimensional discrete cosine transform is performed again on the data of the lower k bits which is the quantizing error, the quantizing error does not occur, and the error due to the data compression is the two-dimensional discrete cosine. Only the error due to the conversion. Therefore, quantization error can be reduced in data compression.

【００４０】[0040]

【実施例】【Example】

（１）第１の実施例図１は第１の実施例によるＤＣＴ装置の構成を示すブロ
ック図である。図１において、ＤＣＴ装置１は、Ａ／Ｄ
変換器２、レジスタ３、マトリクス用メモリ４、アドレ
スセレクタ５および二次元ＤＣＴ部６を含む。(1) First Embodiment FIG. 1 is a block diagram showing the configuration of a DCT device according to the first embodiment. In FIG. 1, the DCT device 1 is an A / D
It includes a converter 2, a register 3, a matrix memory 4, an address selector 5, and a two-dimensional DCT unit 6.

【００４１】Ａ／Ｄ変換器２は、アナログ音声信号を一
連のディジタル音声データに変換してレジスタ３に与え
る。レジスタ３はその音声データを一時的に保持する。
マトリクス用メモリ４はＮ×Ｎ（Ｎは正の整数）の記憶
領域を有する。ここでは、マトリクス用メモリ４が８×
８の記憶領域を有するものとする。アドレスセレクタ５
はマトリクス用メモリ４のアドレス指定を行う。The A / D converter 2 converts the analog voice signal into a series of digital voice data and gives it to the register 3. The register 3 temporarily holds the audio data.
The matrix memory 4 has N × N (N is a positive integer) storage area. Here, the matrix memory 4 is 8 ×
Assume that it has 8 storage areas. Address selector 5
Addresses the matrix memory 4.

【００４２】レジスタ３に保持された音声データは、ア
ドレスセレクタ５のアドレス指定によりマトリクス用メ
モリ４内に順次書き込まれて配列される。また、アドレ
スセレクタ５のアドレス指定により順次音声データが読
み出され、二次元ＤＣＴ部６に与えられる。二次元ＤＣ
Ｔ部６は、８×８の音声データに二次元離散コサイン変
換を行い、８×８のＤＣＴ係数を出力する。The audio data held in the register 3 is sequentially written and arranged in the matrix memory 4 by the address designation of the address selector 5. Also, the audio data is sequentially read by the address designation of the address selector 5 and given to the two-dimensional DCT unit 6. Two-dimensional DC
The T unit 6 performs a two-dimensional discrete cosine transform on the 8 × 8 audio data and outputs an 8 × 8 DCT coefficient.

【００４３】次に、マトリクス用メモリ４における音声
データの配列方法を説明する。図２は音声データの第１
の配列方法を示す図である。図２において、マトリクス
用メモリ４のＸ方向のアドレスをＸ＝０，１，…，７と
し、Ｙ方向のアドレスをＹ＝０，１，…，７とする。ま
た、一連の音声データをＤ０，Ｄ１，Ｄ２，…とする。Next, a method of arranging audio data in the matrix memory 4 will be described. FIG. 2 shows the first voice data.
It is a figure which shows the arrangement method of. 2, the addresses in the X direction of the matrix memory 4 are X = 0, 1, ..., 7 and the addresses in the Y direction are Y = 0, 1 ,. Further, a series of audio data is D0, D1, D2, ....

【００４４】第１の配列方法では、Ｙ＝０の行に音声デ
ータＤ０，Ｄ１，…，Ｄ７をＸ方向に順に配列し、Ｙ＝
１の行に音声データＤ１，Ｄ２，…，Ｄ８をＸ方向に順
に配列し、Ｙ＝２の行に音声データＤ２，Ｄ３，…，Ｄ
９をＸ方向に順に配列する。同様にして、Ｙ＝３，４，
…，７の各行に先頭の音声データを１つずつシフトしな
がら連続する８個の音声データをＸ方向に順に配列す
る。In the first arrangement method, the audio data D0, D1, ..., D7 are arranged in order in the X direction in the row of Y = 0, and Y =
Audio data D1, D2, ..., D8 are sequentially arranged in the row 1 in the X direction, and audio data D2, D3 ,.
9 are arranged in order in the X direction. Similarly, Y = 3,4
, 8 consecutive voice data are arranged in order in the X direction while shifting the voice data at the beginning one by one in each row.

【００４５】その結果、１５個の音声データＤ０，Ｄ
１，…，Ｄ１４がＸ方向およびＹ方向に相関性を有する
ように二次元に配列される。例えば、Ｙ＝０の行には連
続する音声データＤ０，Ｄ１，…，Ｄ７が配列され、Ｘ
＝０の列にも連続する音声データＤ０，Ｄ１，…，Ｄ７
が配列される。この音声データの配列は、図１のアドレ
スセレクタ５によるアドレス指定により行われる。As a result, 15 pieces of voice data D0, D
, ..., D14 are arranged two-dimensionally so as to have a correlation in the X direction and the Y direction. For example, continuous audio data D0, D1, ..., D7 are arranged in a row of Y = 0, and X
= 0, continuous audio data D0, D1, ..., D7
Are arranged. The arrangement of the audio data is performed by the address designation by the address selector 5 in FIG.

【００４６】このように、一連の音声データがＸ方向お
よびＹ方向に相関性を有するように配列されているの
で、これらの音声データを図１の二次元ＤＣＴ部６によ
り二次元離散コサイン変換した場合、圧縮率が高くな
る。As described above, since a series of audio data are arranged so as to have correlation in the X and Y directions, these audio data are two-dimensional discrete cosine transformed by the two-dimensional DCT unit 6 in FIG. In this case, the compression rate becomes high.

【００４７】二次元ＤＣＴ部６により得られたＤＣＴ係
数は、必要に応じて図２６に示した量子化器２００によ
り量子化された後、エントロピー符号化器３００により
ハフマン符号化される。それにより、ハフマン符号が得
られる。なお、量子化器２００による量子化を行わなく
てもよい。The DCT coefficient obtained by the two-dimensional DCT unit 6 is quantized by the quantizer 200 shown in FIG. 26, if necessary, and then Huffman coded by the entropy coder 300. Thereby, a Huffman code is obtained. The quantizer 200 may not perform the quantization.

【００４８】図３は音声データの第２の配列方法を示す
図である。第２の配列方法では、Ｙ＝０の行に音声デー
タＤ０，Ｄ１，…，Ｄ７をＸ方向に順に配列し、Ｙ＝１
の行に音声データＤ２，Ｄ３，…Ｄ９をＸ方向に順に配
列し、Ｙ＝２の行に音声データＤ４，Ｄ５，…，Ｄ１１
をＸ方向に順に配列する。同様にして、先頭の音声デー
タを２つずつシフトしながらＹ＝３，４，…，７の各行
に連続する８個の音声データをＸ方向に順に配列する。FIG. 3 is a diagram showing a second arrangement method of voice data. In the second arrangement method, the audio data D0, D1, ..., D7 are arranged in order in the X direction in the row of Y = 0, and Y = 1.
, D9 are sequentially arranged in the X direction, and voice data D4, D5, ..., D11 are arranged in the Y = 2 line.
Are sequentially arranged in the X direction. Similarly, eight audio data consecutive in each row of Y = 3, 4, ..., 7 are arranged in the X direction while shifting the audio data at the beginning by two.

【００４９】その結果、Ｘ方向に連続する音声データが
順に配列され、Ｙ方向に１つおきの音声データが順に配
列される。このようにして、２２個の音声データＤ０，
Ｄ１，…，Ｄ２１がＸ方向およびＹ方向に相関性を有す
るように二次元マトリクスに配列される。As a result, audio data consecutive in the X direction are arranged in order, and every other audio data is arranged in order in the Y direction. In this way, 22 pieces of audio data D0,
, D21 are arranged in a two-dimensional matrix so as to have correlation in the X direction and the Y direction.

【００５０】第２の配列方法では、第１の配列方法に比
べてＹ方向の音声データの相関性が若干低くなるが、１
つのマトリクスに配列される音声データの数が増加す
る。図４は音声データの第３の配列方法を示す図であ
る。図４において、“＋”は２つの音声データの平均を
表わしている。In the second arrangement method, the correlation of the audio data in the Y direction is slightly lower than that in the first arrangement method.
The number of audio data arranged in one matrix increases. FIG. 4 is a diagram showing a third arrangement method of voice data. In FIG. 4, "+" represents the average of two audio data.

【００５１】第３の配列方法では、Ｙ＝０の行に音声デ
ータＤ０，Ｄ１の平均値，音声データＤ２，Ｄ３の平均
値，…，音声データＤ１４，Ｄ１５の平均値をＸ方向に
順に配列する。次に、Ｙ＝１の行に音声データＤ２，Ｄ
３の平均値，音声データＤ４，Ｄ５の平均値，…，音声
データＤ１６，Ｄ１７の平均値をＸ方向に順に配列す
る。さらに、Ｙ＝２の行に音声データＤ４，Ｄ５の平均
値，音声データＤ６，Ｄ７の平均値，…，音声データＤ
１８，Ｄ１９の平均値を順に配列する。同様にして、Ｙ
＝３，４，…，７の各行に連続する２つの音声データの
平均値を順に配列する。In the third arrangement method, the average value of the audio data D0 and D1, the average value of the audio data D2 and D3, ..., The average value of the audio data D14 and D15 are arranged in the X direction in order in the row of Y = 0. To do. Next, in the row of Y = 1, the voice data D2, D
3, the average value of the voice data D4, D5, ..., And the average value of the voice data D16, D17 are sequentially arranged in the X direction. Further, in the row of Y = 2, the average value of the voice data D4, D5, the average value of the voice data D6, D7, ..., The voice data D
The average values of 18 and D19 are arranged in order. Similarly, Y
= 3, 4, ..., 7 average values of two consecutive audio data are arranged in order.

【００５２】このようにして、音声データＤ０，Ｄ１，
…，Ｄ２９において各連続する２つの音声データの平均
値がＸ方向およびＹ方向に相関性を有するように配列さ
れる。第３の配列方法によれば、１つのマトリクスに配
列される音声データの数が３０個となる。In this way, the voice data D0, D1,
.., D29 are arranged so that the average values of the two consecutive audio data are correlated in the X and Y directions. According to the third arrangement method, the number of audio data arranged in one matrix is 30.

【００５３】図５、図６、図７、図８および図９はそれ
ぞれ１ｋＨｚ、２．５ｋＨｚ、５ｋＨｚ、９ｋＨｚおよ
び１７ｋＨｚの正弦波を４０ｋＨｚでサンプリングして
図１のＤＣＴ装置１により二次元離散コサイン変換した
結果を示す。5, FIG. 6, FIG. 7, FIG. 8 and FIG. 9 are two-dimensional discrete cosine by the DCT apparatus 1 of FIG. 1 by sampling sine waves of 1 kHz, 2.5 kHz, 5 kHz, 9 kHz and 17 kHz at 40 kHz. The result of conversion is shown.

【００５４】図５に示すように、１ｋＨｚの正弦波のＤ
ＣＴ係数は低域に集中している。また、図６に示すよう
に、２．５ｋＨｚの正弦波のＤＣＴ係数は低域から中域
に分布しており、図７に示すように、５ｋＨｚの正弦波
のＤＣＴ係数は主として中域に分布しており、図８に示
すように、９ｋＨｚの正弦波のＤＣＴ係数は中域から高
域に分布している。さらに、図９に示すように、１７ｋ
Ｈｚの正弦波のＤＣＴ係数は主として高域に分布してい
る。As shown in FIG. 5, a 1 kHz sine wave D
The CT coefficient is concentrated in the low range. Further, as shown in FIG. 6, the DCT coefficient of the sine wave of 2.5 kHz is distributed from the low range to the middle range, and as shown in FIG. 7, the DCT coefficient of the sine wave of 5 kHz is mainly distributed in the middle range. Therefore, as shown in FIG. 8, the DCT coefficients of the 9 kHz sine wave are distributed from the middle range to the high range. Furthermore, as shown in FIG.
The DCT coefficient of the sine wave of Hz is mainly distributed in the high range.

【００５５】このように、図１のＤＣＴ装置１によれ
ば、音声信号の周波数に応じてＤＣＴ係数がある周波数
領域に偏って現れるので、データの圧縮率が高くなる。
従来のＡＤＰＣＭによれば、音声データの圧縮比は１／
２となるが、第１の実施例によれば、音声データの圧縮
比は１／６となった。また、第１の実施例によれば、Ａ
ＤＰＣＭとほぼ同等のＳ／Ｎが得られ、音質はＡＤＰＣ
Ｍと同程度となった。As described above, according to the DCT device 1 of FIG. 1, the DCT coefficient appears in a certain frequency region in a biased manner in accordance with the frequency of the audio signal, resulting in a high data compression rate.
According to the conventional ADPCM, the compression ratio of audio data is 1 /
However, according to the first embodiment, the compression ratio of the audio data is 1/6. Further, according to the first embodiment, A
S / N almost equal to DPCM is obtained, and sound quality is ADPC
It was about the same as M.

【００５６】（２）第２の実施例図１０は本発明の第２の実施例によるデータ圧縮方法を
示す図である。まず、図１０の（ａ）に示すように、Ｑ
ＭＦ（Quartz Mirror Filter) 等を用いて音声データを
複数の帯域に分割する。ここでは、音声データの全帯域
を０〜２０ｋＨｚとする。０〜２０ｋＨｚの帯域を０〜
１０ｋＨｚの帯域と１０〜２０ｋＨｚの帯域とに分割す
る。さらに、０〜１０ｋＨｚの帯域を０〜５ｋＨｚの帯
域と５〜１０ｋＨｚの帯域とに分割し、１０〜２０ｋＨ
ｚの帯域を１０〜１５ｋＨｚの帯域と１５〜２０ｋＨｚ
の帯域とに分割する。このようにして、音声データを４
つの帯域に分割する。(2) Second Embodiment FIG. 10 is a diagram showing a data compression method according to a second embodiment of the present invention. First, as shown in FIG.
Audio data is divided into a plurality of bands using MF (Quartz Mirror Filter) or the like. Here, the entire band of the audio data is 0 to 20 kHz. 0 to 20 kHz band
It is divided into a band of 10 kHz and a band of 10 to 20 kHz. Furthermore, the band of 0 to 10 kHz is divided into a band of 0 to 5 kHz and a band of 5 to 10 kHz, and 10 to 20 kHz
Band of z is 10 to 15 kHz and 15 to 20 kHz
And the band of. In this way, the voice data
Divide into two bands.

【００５７】次に、図１０の（ｂ）に示すように、分割
された帯域ごとに音声データを二次元マトリクスに配列
し、二次元離散コサイン変換を行ってＤＣＴ係数を得
る。二次元マトリクスへの配列方法は、第１の実施例に
おける配列方法を用いる。Next, as shown in FIG. 10B, the voice data is arranged in a two-dimensional matrix for each divided band, and two-dimensional discrete cosine transform is performed to obtain DCT coefficients. The arrangement method in the two-dimensional matrix uses the arrangement method in the first embodiment.

【００５８】さらに、図１０の（ｃ）に示すように、二
次元離散コサイン変換により得られた各帯域のＤＣＴ係
数から主たるＤＣＴ係数を、次の（Ａ）〜（Ｃ）のいず
れかの方法を用いてあるいはそれらの方法を組み合わせ
て選択する。Further, as shown in (c) of FIG. 10, the main DCT coefficient from the DCT coefficients of each band obtained by the two-dimensional discrete cosine transform is calculated by any of the following methods (A) to (C). Or using a combination of these methods.

【００５９】（Ａ）あるしきい値以上のＤＣＴ係数のみ
を選択する。（Ｂ）ＤＣＴ係数の二次元マトリクスを低域から高域に
かけて複数の領域に分割し、各領域内で代表的なＤＣＴ
係数を選択する。(A) Select only DCT coefficients above a certain threshold. (B) A two-dimensional matrix of DCT coefficients is divided into a plurality of regions from a low band to a high band, and a typical DCT in each region.
Select a coefficient.

【００６０】（Ｃ）ある値以上の高周波数成分またはあ
る値以下の低周波成分を一律にカットして残りのＤＣＴ
係数を選択する。各帯域ごとに上記の（Ａ），（Ｂ），（Ｃ）のいずれか
または２つ以上を組み合わせてＤＣＴ係数の選択を行
う。(C) A high frequency component above a certain value or a low frequency component below a certain value is uniformly cut and the remaining DCT
Select a coefficient. For each band, any one of the above (A), (B), (C) or a combination of two or more is selected to select the DCT coefficient.

【００６１】次に、ある帯域において選択されたＤＣＴ
係数を仮にハフマン符号化し、得られた符号データのデ
ータ長からすべての帯域の符号データのデータ長を推測
する。そして、図１０の（ｄ），（ｅ）に示すように、
ＤＣＴ係数の選択条件を修正することにより符号データ
のデータ長が所定の長さ以下になるようにデータ長を固
定化する。すなわち、方法（Ａ）ではしきい値を変更
し、方法（Ｂ）では領域の分割方法を変更し、方法
（Ｃ）では高周波成分または低周波成分をカットする周
波数の値を変更する。修正された選択条件で、図１０の
（ｃ）示すＤＣＴ係数の選択を行う。Next, the DCT selected in a band
The coefficients are provisionally Huffman-encoded, and the data lengths of the code data in all bands are estimated from the obtained data lengths of the code data. Then, as shown in (d) and (e) of FIG.
By correcting the selection condition of the DCT coefficient, the data length of the coded data is fixed so that the data length becomes equal to or shorter than a predetermined length. That is, the method (A) changes the threshold value, the method (B) changes the area dividing method, and the method (C) changes the frequency value for cutting the high frequency component or the low frequency component. The DCT coefficient shown in FIG. 10C is selected under the modified selection condition.

【００６２】その後、図１０の（ｄ）に示すように、各
帯域において選択されたＤＣＴ係数をハフマン符号化
し、符号データを得る。このようにして、所定の長さ以
下のデータ長に固定化された符号データが得られる。Thereafter, as shown in FIG. 10D, the DCT coefficient selected in each band is Huffman-coded to obtain coded data. In this way, code data fixed to a data length equal to or shorter than the predetermined length is obtained.

【００６３】例えば、１ｋＨｚの正弦波をデータ圧縮す
る例を説明する。図１１は１ｋＨｚの音声データを第１
の実施例の図２に示す方法で二次元マトリクスに配列し
た状態を示す。図１１に示すように配列された音声デー
タに二次元離散コサイン変換を行うと、図１２に示すよ
うなＤＣＴ係数が得られる。For example, an example of data compression of a 1 kHz sine wave will be described. FIG. 11 shows the first 1 kHz audio data.
2 shows a state of being arranged in a two-dimensional matrix by the method shown in FIG. When the two-dimensional discrete cosine transform is performed on the audio data arranged as shown in FIG. 11, DCT coefficients as shown in FIG. 12 are obtained.

【００６４】図１２に示すＤＣＴ係数は、主として低域
Ｌ１に集中している。したがって、上記の方法（Ｃ）を
用いて高域Ｈ１の高周波成分を一律にカットし、低域Ｌ
１の“１５８９”，“−２９９”，“−２９９”を選択
する。それにより、図１３に示すようなＤＣＴ係数が得
られる。The DCT coefficients shown in FIG. 12 are mainly concentrated in the low range L1. Therefore, by using the above method (C), the high frequency component of the high frequency range H1 is uniformly cut to obtain the low frequency range L1.
1 “1589”, “−299”, and “−299” are selected. Thereby, the DCT coefficient as shown in FIG. 13 is obtained.

【００６５】図１３のＤＣＴ係数を二次元逆離散コサイ
ン変換により復号化すると、図１４に示すような音声デ
ータが得られる。図１４に示す復号化後の音声データ
は、図１１に示す二次元離散コサイン変換前の音声デー
タとほぼ一致していることがわかる。When the DCT coefficient of FIG. 13 is decoded by the two-dimensional inverse discrete cosine transform, the voice data as shown in FIG. 14 is obtained. It can be seen that the decoded voice data shown in FIG. 14 substantially matches the voice data before the two-dimensional discrete cosine transform shown in FIG.

【００６６】図１５は２．５ｋＨｚの音声データのＤＣ
Ｔ係数に上記の方法（Ｂ）を適用した状態を示す図であ
る。図１５に示すように、ＤＣＴ係数のマトリクスを低
域Ｌ２、中域Ｍ２および高域Ｈ２に分割し、低域Ｌ２の
代表値“３２０”および中域Ｍ２の代表値“−３８９”
を選択する。それにより、図１６に示すようなＤＣＴ係
数が得られる。FIG. 15 shows DC of 2.5 kHz audio data.
It is a figure which shows the state which applied the said method (B) to T coefficient. As shown in FIG. 15, the matrix of DCT coefficients is divided into a low range L2, a middle range M2, and a high range H2, and a representative value “320” of the low range L2 and a representative value “−389” of the middle range M2.
Select Thereby, the DCT coefficient as shown in FIG. 16 is obtained.

【００６７】図１７は５ｋＨｚの音声信号のＤＣＴ係数
に上記の（Ｂ）を適用した状態を示す図である。図１７
に示すように、ＤＣＴ係数のマトリクスを低域Ｌ３、中
域Ｍ３および高域Ｈ３に分割し、中域Ｍ３の代表値“８
６０”および“−６７０”を選択する。それにより、図
１８に示すＤＣＴ係数が得られる。FIG. 17 is a diagram showing a state in which the above (B) is applied to the DCT coefficient of a 5 kHz audio signal. FIG. 17
As shown in, the matrix of DCT coefficients is divided into a low range L3, a middle range M3, and a high range H3, and the representative value "8" of the middle range M3
60 "and" -670 "are selected to obtain the DCT coefficient shown in FIG.

【００６８】図１９は９ｋＨｚの音声信号のＤＣＴ係数
に上記の方法（Ｂ）を適用した状態を示す図である。図
１９に示すように、ＤＣＴ係数のマトリクスを低域Ｌ
４、中域Ｍ４および高域Ｈ４に分割し、中域Ｍ４の代表
値“９０４”および高域Ｈ４の代表値“−７２３”を選
択する。それにより、図２０に示すＤＣＴ係数が得られ
る。FIG. 19 is a diagram showing a state in which the above method (B) is applied to the DCT coefficient of a 9 kHz audio signal. As shown in FIG. 19, the DCT coefficient matrix is set to the low range L.
4, the middle range M4 and the high range H4 are divided, and the representative value "904" of the middle range M4 and the representative value "-723" of the high range H4 are selected. Thereby, the DCT coefficient shown in FIG. 20 is obtained.

【００６９】図２１は１７ｋＨｚの音声データのＤＣＴ
係数に上記の方法（Ｃ）を適用した状態を示す図であ
る。図２１に示すように、ＤＣＴ係数のマトリクスにお
いて低域Ｌ５の周波数成分を一律にカットし、高域Ｈ５
の“７０７”，“７０８”，“−４７７”を選択する。
それにより、図２２に示すＤＣＴ係数が得られる。FIG. 21 shows a DCT of 17 kHz audio data.
It is a figure which shows the state which applied the said method (C) to a coefficient. As shown in FIG. 21, in the matrix of DCT coefficients, the frequency components in the low frequency band L5 are uniformly cut, and the high frequency band H5 is cut.
“707”, “708”, and “−477” are selected.
As a result, the DCT coefficient shown in FIG. 22 is obtained.

【００７０】（３）第３の実施例図２３は第３の実施例によるデータ圧縮装置の構成の一
例を示すブロック図である。図２３に示すように、デー
タ圧縮装置２０は、二次元ＤＣＴ部２１，２３、分離部
２２およびハフマン符号化部２４，２５を含む。(3) Third Embodiment FIG. 23 is a block diagram showing an example of the arrangement of a data compression apparatus according to the third embodiment. As shown in FIG. 23, the data compression device 20 includes two-dimensional DCT units 21 and 23, a separation unit 22, and Huffman coding units 24 and 25.

【００７１】まず、データ圧縮装置２０による８ビット
の画像データのデータ圧縮を説明する。二次元ＤＣＴ部
２１は、８ビットの入力データに二次元離散コサイン変
換を行い、１１ビットのデータ（ＤＣＴ係数）を出力す
る。分離部２２は、二次元ＤＣＴ部２１から出力された
１１ビットのデータを量子化因子“１２８”で割ること
により、１１ビットのデータを上位７ビットのデータお
よび下位４ビットのデータに分離する。下位４ビットの
データが量子化誤差となる。First, the data compression of 8-bit image data by the data compression device 20 will be described. The two-dimensional DCT unit 21 performs two-dimensional discrete cosine transform on the 8-bit input data and outputs 11-bit data (DCT coefficient). The separation unit 22 divides the 11-bit data output from the two-dimensional DCT unit 21 by the quantization factor “128” to separate the 11-bit data into upper 7-bit data and lower 4-bit data. The lower 4-bit data becomes the quantization error.

【００７２】二次元ＤＣＴ部２３は、分離部２２により
得られた下位４ビットのデータに二次元離散コサイン変
換を行い、７ビットのデータ（ＤＣＴ係数）を得る。ハ
フマン符号化部２４は、二次元ＤＣＴ部２３により得ら
れた７ビットのデータにハフマン符号化を行い、下位側
の符号データＤＤを得る。The two-dimensional DCT section 23 performs a two-dimensional discrete cosine transform on the lower 4-bit data obtained by the separating section 22 to obtain 7-bit data (DCT coefficient). The Huffman coding unit 24 performs Huffman coding on the 7-bit data obtained by the two-dimensional DCT unit 23 to obtain lower-order code data DD.

【００７３】一方、ハフマン符号化部２５は、分離部２
２により得られた上位７ビットのデータにハフマン符号
化を行い、上位側の符号データＵＤを得る。セレクタ２
６は、ハフマン符号化部２５から得られた上位側の符号
データＵＤとハフマン符号化部２４により得られた下位
側の符号データＤＤとを８×８のブロックごとに交互に
選択して伝送する。On the other hand, the Huffman coding unit 25 has the separating unit 2
Huffman coding is performed on the higher-order 7-bit data obtained in step 2 to obtain higher-order code data UD. Selector 2
6 alternately selects the higher-order code data UD obtained from the Huffman coding unit 25 and the lower-order code data DD obtained by the Huffman coding unit 24 for each 8 × 8 block, and transmits the block. .

【００７４】図２３において、括弧内のビット数は音声
データ（オーディオデータ）のビット数を示す。次に、
データ圧縮装置２０による１６ビットの音声データのデ
ータ圧縮を説明する。In FIG. 23, the number of bits in parentheses indicates the number of bits of audio data (audio data). next,
Data compression of 16-bit audio data by the data compression device 20 will be described.

【００７５】二次元ＤＣＴ部２１は、１６ビットの音声
データに二次元離散コサイン変換を行い、１９ビットの
データ（ＤＣＴ係数）を出力する。分離部２２は、二次
元ＤＣＴ部２１から出力された１９ビットのデータを上
位１１ビットのデータと下位８ビットのデータとに分離
する。二次元ＤＣＴ部２３は、分離部２２により得られ
た下位８ビットのデータに二次元離散コサイン変換を行
い、１１ビットのデータ（ＤＣＴ係数）を得る。ハフマ
ン符号化部２４は、二次元ＤＣＴ部２３により得られた
１１ビットのデータにハフマン符号化を行い、下位側の
符号データＤＤを出力する。The two-dimensional DCT section 21 performs two-dimensional discrete cosine transform on 16-bit voice data and outputs 19-bit data (DCT coefficient). The separation unit 22 separates the 19-bit data output from the two-dimensional DCT unit 21 into upper 11-bit data and lower 8-bit data. The two-dimensional DCT unit 23 performs two-dimensional discrete cosine transform on the lower 8-bit data obtained by the separating unit 22 to obtain 11-bit data (DCT coefficient). The Huffman coding unit 24 performs Huffman coding on the 11-bit data obtained by the two-dimensional DCT unit 23, and outputs the lower code data DD.

【００７６】一方、ハフマン符号化部２５は、分離部２
２により得られた上位１１ビットのデータにハフマン符
号化を行い、上位側の符号データをＵＤを出力する。セ
レクタ２６は、ハフマン符号化部２５により得られた上
位側の符号データＵＤとハフマン符号化部２４により得
られた下位側の符号データＤＤとを８×８のブロックご
とに交互に選択して伝送する。On the other hand, the Huffman coding unit 25 has the separating unit 2
Huffman coding is performed on the upper 11-bit data obtained by 2, and UD is output as the upper-side code data. The selector 26 alternately selects the higher-order code data UD obtained by the Huffman coding unit 25 and the lower-order code data DD obtained by the Huffman coding unit 24 for each 8 × 8 block and transmits it. To do.

【００７７】このように、本実施例のデータ圧縮装置２
０では、従来の量子化において量子化誤差となる下位４
ビットのデータまたは下位８ビットのデータに再び二次
元離散コサイン変換を行った後、ハフマン符号化してい
るので、データ圧縮における誤差が二次元離散コサイン
変換による誤差のみとなる。したがって、高品質な画像
データまたは音声データが得られる。In this way, the data compression device 2 of this embodiment is
At 0, the lower 4 which is a quantization error in the conventional quantization.
Since the Huffman coding is performed again after the two-dimensional discrete cosine transform is performed on the bit data or the lower 8-bit data, the error in the data compression is only the error due to the two-dimensional discrete cosine transform. Therefore, high quality image data or audio data can be obtained.

【００７８】図２４は本発明の第３の実施例によるデー
タ圧縮装置の構成の他の例を示すブロック図である。図
２４において、データ圧縮装置３０は、セレクタ３１，
３４、二次元ＤＣＴ部３２、分離部３３およびハフマン
符号化部３５を含む。FIG. 24 is a block diagram showing another example of the structure of the data compression apparatus according to the third embodiment of the present invention. In FIG. 24, the data compression device 30 includes a selector 31,
34, a two-dimensional DCT unit 32, a separation unit 33, and a Huffman coding unit 35.

【００７９】ここでは、入力データが８ビットの画像デ
ータの場合を説明する。まず、セレクタ３１が、その８
ビットの入力データを選択して出力する。二次元ＤＣＴ
部３２は、セレクタ３１から出力される８ビットの入力
データに二次元離散コサイン変換を行い、１１ビットの
データ（ＤＣＴ係数）を出力する。分離部３３は、二次
元ＤＣＴ部３２から出力された１１ビットのデータを上
位７ビットのデータと下位４ビットのデータとに分離
し、下位４ビットのデータをセレクタ３１に戻し、上位
７ビットのデータをセレクタ３４に与える。Here, the case where the input data is 8-bit image data will be described. First, the selector 31
Selects and outputs bit input data. Two-dimensional DCT
The unit 32 performs a two-dimensional discrete cosine transform on the 8-bit input data output from the selector 31 and outputs 11-bit data (DCT coefficient). The separating unit 33 separates the 11-bit data output from the two-dimensional DCT unit 32 into upper 7-bit data and lower 4-bit data, returns the lower 4-bit data to the selector 31, and returns the upper 7-bit data. The data is given to the selector 34.

【００８０】セレクタ３４は、まず、分離部３３から出
力された上位７ビットのデータを選択して出力する。ハ
フマン符号化部３５は、セレクタ３４から出力された上
位７ビットのデータにハフマン符号化を行い、符号デー
タを出力する。The selector 34 first selects and outputs the upper 7-bit data output from the separating unit 33. The Huffman coding unit 35 performs Huffman coding on the upper 7-bit data output from the selector 34, and outputs the coded data.

【００８１】一方、セレクタ３１は、分離部３３により
得られた下位４ビットのデータを選択して出力する。二
次元ＤＣＴ部３２は、セレクタ３１から出力された下位
４ビットのデータに二次元離散コサイン変換を行い、７
ビットのデータ（ＤＣＴ係数）を出力し、セレクタ３４
に与える。セレクタ３４は、二次元ＤＣＴ部３２から出
力された７ビットのデータを選択して出力する。ハフマ
ン符号化部３５は、セレクタ３４から出力された７ビッ
トのデータにハフマン符号化を行い、符号データを出力
する。On the other hand, the selector 31 selects and outputs the lower 4-bit data obtained by the separating unit 33. The two-dimensional DCT unit 32 performs a two-dimensional discrete cosine transform on the lower 4-bit data output from the selector 31,
The bit data (DCT coefficient) is output, and the selector 34
Give to. The selector 34 selects and outputs the 7-bit data output from the two-dimensional DCT unit 32. The Huffman coding unit 35 performs Huffman coding on the 7-bit data output from the selector 34, and outputs the coded data.

【００８２】このように、図２４のデータ圧縮装置３０
においては、セレクタ３１，３４を用いることにより二
次元ＤＣＴ部３２が図２３のデータ圧縮装置２０におけ
る２つの二次元ＤＣＴ部２１，２３の働きを行い、ハフ
マン符号化部３５が２つのハフマン符号化部２４，２５
の働きを行う。As described above, the data compression device 30 of FIG.
23, the two-dimensional DCT unit 32 functions as the two two-dimensional DCT units 21 and 23 in the data compression device 20 of FIG. 23 by using the selectors 31 and 34, and the Huffman coding unit 35 performs two Huffman coding. Parts 24 and 25
Perform the work of.

【００８３】なお、図２４においても、括弧内は音声デ
ータのビット数を示す。入力データが１６ビットの音声
データの場合にも、同様にしてデータ圧縮が行われる。In FIG. 24, the number of bits of audio data is shown in parentheses. Even when the input data is 16-bit audio data, the data compression is performed similarly.

【００８４】[0084]

【発明の効果】第１の発明によれば、一連の音声データ
が水平方向および垂直方向に相関性を有するように二次
元マトリクスに配列され、二次元離散コサイン変換が行
われるので、データの圧縮率が高くなる。したがって、
高い音質を維持しつつビットレートを改善することがで
きる。According to the first aspect of the present invention, a series of audio data are arranged in a two-dimensional matrix so as to have correlation in the horizontal and vertical directions, and the two-dimensional discrete cosine transform is performed. Therefore, data compression is performed. The rate is high. Therefore,
It is possible to improve the bit rate while maintaining high sound quality.

【００８５】第２の発明によれば、一連の音声データが
複数の周波数帯域に分割され、各周波数帯域ごとに二次
元離散コサイン変換および主たるデータの選択が行わ
れ、かつ符号データのデータ長が所定の長さ以下になる
ように選択条件を修正しつつ符号化が行われるので、低
ビットレートで良好な音質が得られるとともに、符号デ
ータのデータ長が一定の長さ以下に固定化され、かつデ
ータ長が短縮される。According to the second invention, a series of voice data is divided into a plurality of frequency bands, two-dimensional discrete cosine transform and main data selection are performed for each frequency band, and the data length of code data is Since encoding is performed while correcting the selection condition so that the length is equal to or less than a predetermined length, good sound quality is obtained at a low bit rate, and the data length of the code data is fixed to a certain length or less, And the data length is shortened.

【００８６】第３の発明によれば、二次元離散コサイン
変換により得られたデータのうち量子化誤差となるデー
タにさらに二次元離散コサイン変換が行われた後、符号
化が行われるので、データ圧縮の誤差が二次元離散コサ
イン変換による誤差のみとなる。したがって、高品質な
データ圧縮が可能となる。According to the third aspect of the invention, of the data obtained by the two-dimensional discrete cosine transform, the data that is the quantization error is further subjected to the two-dimensional discrete cosine transform and then encoded, so that the data is The compression error is only the error due to the two-dimensional discrete cosine transform. Therefore, high quality data compression becomes possible.

【図面の簡単な説明】[Brief description of drawings]

【図１】本発明の第１の実施例によるＤＣＴ装置の構成
を示すブロック図である。FIG. 1 is a block diagram showing the configuration of a DCT device according to a first embodiment of the present invention.

【図２】第１の実施例における音声データの第１の配列
方法を示す図である。FIG. 2 is a diagram showing a first arrangement method of audio data in the first embodiment.

【図３】第１の実施例における音声データの第２の配列
方法を示す図である。FIG. 3 is a diagram showing a second arrangement method of audio data according to the first embodiment.

【図４】第１の実施例における音声データの第３の配列
方法を示す図である。FIG. 4 is a diagram showing a third arrangement method of audio data according to the first embodiment.

【図５】１ｋＨｚの正弦波を二次元離散コサイン変換す
ることにより得られたＤＣＴ係数の一例を示す図であ
る。FIG. 5 is a diagram showing an example of DCT coefficients obtained by two-dimensional discrete cosine transform of a 1 kHz sine wave.

【図６】２．５ｋＨｚの正弦波を二次元離散コサイン変
換することにより得られたＤＣＴ係数の一例を示す図で
ある。FIG. 6 is a diagram showing an example of DCT coefficients obtained by subjecting a 2.5 kHz sine wave to two-dimensional discrete cosine transform.

【図７】５ｋＨｚの正弦波を二次元離散コサイン変換す
ることにより得られたＤＣＴ係数の一例を示す図であ
る。FIG. 7 is a diagram showing an example of DCT coefficients obtained by subjecting a 5 kHz sine wave to two-dimensional discrete cosine transform.

【図８】９ｋＨｚの正弦波を二次元離散コサイン変換す
ることにより得られたＤＣＴ係数の一例を示す図であ
る。FIG. 8 is a diagram showing an example of DCT coefficients obtained by performing a two-dimensional discrete cosine transform on a 9 kHz sine wave.

【図９】１７ｋＨｚの正弦波を二次元離散コサイン変換
することにより得られたＤＣＴ係数の一例を示す図であ
る。FIG. 9 is a diagram showing an example of DCT coefficients obtained by subjecting a 17 kHz sine wave to two-dimensional discrete cosine transform.

【図１０】本発明の第２の実施例によるデータ圧縮方法
を示す図である。FIG. 10 is a diagram showing a data compression method according to a second embodiment of the present invention.

【図１１】１ｋＨｚの音声データを第１の実施例におけ
る第１の配列方法により配列した状態を示す図である。FIG. 11 is a diagram showing a state in which 1 kHz audio data is arranged by the first arrangement method in the first embodiment.

【図１２】図１１の音声データを二次元離散コサイン変
換することにより得られたＤＣＴ係数を示す図である。12 is a diagram showing DCT coefficients obtained by performing two-dimensional discrete cosine transform on the audio data of FIG.

【図１３】図１２のＤＣＴ係数から主たるＤＣＴ係数を
第２の実施例における選択方法により選択した状態を示
す図である。FIG. 13 is a diagram showing a state in which a main DCT coefficient is selected from the DCT coefficients shown in FIG. 12 by the selection method according to the second embodiment.

【図１４】図１３のＤＣＴ係数のハフマン符号化により
得られた符号データを二次元逆離散コサイン変換するこ
とにより復号された音声データを示す図である。14 is a diagram showing speech data decoded by performing two-dimensional inverse discrete cosine transform on the code data obtained by the Huffman coding of the DCT coefficient shown in FIG.

【図１５】２．５ｋＨｚの音声データを二次元離散コサ
イン変換することにより得られたＤＣＴ係数を示す図で
ある。FIG. 15 is a diagram showing DCT coefficients obtained by two-dimensional discrete cosine transform of 2.5 kHz voice data.

【図１６】図１５に示すＤＣＴ係数から主たるＤＣＴ係
数を第２の実施例における選択方法により選択した状態
を示す図である。16 is a diagram showing a state in which a main DCT coefficient is selected from the DCT coefficients shown in FIG. 15 by the selection method in the second embodiment.

【図１７】５ｋＨｚの音声データを二次元離散コサイン
変換することにより得られたＤＣＴ係数を示す図であ
る。FIG. 17 is a diagram showing DCT coefficients obtained by two-dimensional discrete cosine transform of 5 kHz voice data.

【図１８】図１７のＤＣＴ係数から主たるＤＣＴ係数を
第２の実施例における選択方法により選択した状態を示
す図である。FIG. 18 is a diagram showing a state in which a main DCT coefficient is selected from the DCT coefficients of FIG. 17 by the selection method according to the second embodiment.

【図１９】９ｋＨｚの音声データを二次元離散コサイン
変換することにより得られたＤＣＴ係数を示す図であ
る。FIG. 19 is a diagram showing DCT coefficients obtained by two-dimensional discrete cosine transform of 9 kHz voice data.

【図２０】図１９のＤＣＴ係数から主たるＤＣＴ係数を
第２の実施例における選択方法により選択した状態を示
す図である。20 is a diagram showing a state in which a main DCT coefficient is selected from the DCT coefficients of FIG. 19 by the selection method in the second embodiment.

【図２１】１７ｋＨｚの音声データを二次元離散コサイ
ン変換することにより得られたＤＣＴ係数を示す図であ
る。FIG. 21 is a diagram showing DCT coefficients obtained by two-dimensional discrete cosine transform of 17 kHz audio data.

【図２２】図２１のＤＣＴ係数から主たるＤＣＴ係数を
第２の実施例における選択方法により選択した状態を示
す図である。FIG. 22 is a diagram showing a state in which a main DCT coefficient is selected from the DCT coefficients of FIG. 21 by the selection method in the second embodiment.

【図２３】本発明の第３の実施例によるデータ圧縮装置
の構成の一例を示すブロック図である。FIG. 23 is a block diagram showing an example of a configuration of a data compression device according to a third embodiment of the present invention.

【図２４】本発明の第３の実施例によるデータ圧縮装置
の構成の他の例を示すブロック図である。FIG. 24 is a block diagram showing another example of the configuration of the data compression apparatus according to the third embodiment of the present invention.

【図２５】ＤＣＴ方式の画像データ圧縮システムの基本
構成を示すブロック図である。FIG. 25 is a block diagram showing the basic configuration of a DCT image data compression system.

【図２６】画像データのブロック化を示す図である。FIG. 26 is a diagram showing how image data is divided into blocks.

【図２７】８×８画素ブロックおよび二次元離散コサイ
ン変換されたブロックを示す図である。FIG. 27 is a diagram showing an 8 × 8 pixel block and a block subjected to a two-dimensional discrete cosine transform.

【図２８】二次元離散コサイン変換を用いた従来の音声
データの圧縮方法を示す図である。FIG. 28 is a diagram showing a conventional audio data compression method using a two-dimensional discrete cosine transform.

【符号の説明】[Explanation of symbols]

１ＤＣＴ装置２Ａ／Ｄ変換器３レジスタ４マトリクス用メモリ５アドレスセレクタ６二次元ＤＣＴ部２０，３０データ圧縮装置２１，２３，３２二次元ＤＣＴ部２２，３３分離部２４，２５，３５ハフマン符号化部２６，３１，３４セレクタなお、各図中同一符号は同一または相当部分を示す。 1 DCT device 2 A / D converter 3 Register 4 Memory for matrix 5 Address selector 6 Two-dimensional DCT unit 20,30 Data compression device 21,23,32 Two-dimensional DCT unit 22,33 Separation unit 24,25,35 Huffman code Converting unit 26, 31, 34 Selector In the drawings, the same reference numerals indicate the same or corresponding parts.

Claims

Translated fromJapanese

【特許請求の範囲】[Claims]

【請求項１】一連の音声データを水平方向および垂直
方向に相関性を有するように二次元マトリクスに配列す
る配列手段と、前記配列手段により配列された二次元マトリクスの音声
データに二次元離散コサイン変換を行う変換手段とを備
えたデータ圧縮装置。1. Arrangement means for arranging a series of audio data in a two-dimensional matrix so as to have correlation in the horizontal and vertical directions, and two-dimensional discrete cosine to the audio data of the two-dimensional matrix arranged by the arrangement means. A data compression device comprising a conversion means for performing conversion.

【請求項２】一連の音声データを複数の周波数帯域に
分割し、前記分割により得られた各周波数帯域の音声デ
ータを二次元マトリクスに配列し、各周波数帯域ごとに
二次元マトリクスの音声データに二次元離散コサイン変
換を行い、前記二次元離散コサイン変換により得られた
各周波数帯域ごとのデータから主たるデータを所定の選
択条件に基づいて選択し、符号データの長さが所定の長
さ以下になるように前記選択条件を修正しつつ選択され
た主たるデータを符号化することを特徴とするデータ圧
縮方法。2. A series of audio data is divided into a plurality of frequency bands, the audio data of each frequency band obtained by the division is arranged in a two-dimensional matrix, and each frequency band is converted into a two-dimensional matrix of audio data. Performing a two-dimensional discrete cosine transform, select the main data from the data for each frequency band obtained by the two-dimensional discrete cosine transform based on a predetermined selection condition, the length of the code data is less than a predetermined length A data compression method, characterized in that the selected main data is encoded while the selection condition is corrected so that

【請求項３】ｍビットのデータに二次元離散コサイン
変換を行ってｎビットのデータを得る第１の変換手段
と、前記第１の変換手段により得られたデータを上位ｎ−ｋ
ビットのデータおよび下位ｋビットのデータに分離する
分離手段と、前記分離手段により得られた前記下位ｋビットのデータ
に二次元離散コサイン変換を行う第２の変換手段と、前記第２の変換手段により得られたデータを符号化する
第１の符号化手段と、前記分離手段により得られた前記上位ｎ−ｋビットのデ
ータを符号化する第２の符号化手段とを備えたデータ圧
縮装置。3. A first conversion unit for performing two-dimensional discrete cosine transformation on m-bit data to obtain n-bit data, and the data obtained by the first conversion unit is higher-order nk.
Separation means for separating the low-order k-bit data and the low-order k-bit data, a second conversion means for performing a two-dimensional discrete cosine transform on the low-order k-bit data obtained by the separation means, and the second conversion means. A data compression apparatus comprising: a first encoding unit that encodes the data obtained by the above; and a second encoding unit that encodes the data of the higher order nk bits obtained by the separating unit.