JP6904209B2

Movatterモバイル変換

Info

Publication number: JP6904209B2
Application number: JP2017199673A
Authority: JP
Inventors: 鈴木　政直; 政直鈴木; 晃釜野; 洋平岸; 美由紀白川
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2017-07-28
Filing date: 2017-10-13
Publication date: 2021-07-14
Anticipated expiration: 2037-10-13
Also published as: JP2019028419A

Description

本発明は、オーディオ符号化装置等に関する。 The present invention relates to an audio coding device or the like.

近年、テレビ放送、ラジオ放送、インターネットラジオ、音楽配信等にＳＢＲ（Spectral Band Replication）が用いられている。ＳＢＲは、音声・音楽などの音信号を圧縮・伸長する符号化技術である。 In recent years, SBR (Spectral Band Replication) has been used for television broadcasting, radio broadcasting, Internet radio, music distribution, and the like. SBR is a coding technology that compresses and decompresses sound signals such as voice and music.

ＳＢＲに基づく符号化を行う従来の符号化装置と、従来の復号装置について説明する。 A conventional coding device that performs coding based on SBR and a conventional decoding device will be described.

図３５は、従来の符号化装置の一例を示す図である。図３５に示すように、従来の符号化装置１０は、低域信号抽出部１１、低域符号化部１２、高域情報抽出部１３、高域符号化部１４、多重化部１５を有する。 FIG. 35 is a diagram showing an example of a conventional coding device. As shown in FIG. 35, theconventional coding device 10 includes a low-frequency signal extraction unit 11, a low-frequency coding unit 12, a high-frequency information extraction unit 13, a high-frequency coding unit 14, and a multiplexing unit 15.

低域信号抽出部１１は、外部装置から音信号を取得し、音信号の低域信号を抽出する処理部である。低域信号抽出部１１は、低域信号を、低域符号化部１２に出力する。 The low-frequency signal extraction unit 11 is a processing unit that acquires a sound signal from an external device and extracts the low-frequency signal of the sound signal. The low frequency signal extraction unit 11 outputs the low frequency signal to the lowfrequency coding unit 12.

図３６は、音信号の周波数スペクトルを示す図である。図３６の横軸は、周波数に対応する軸であり、縦軸は、音信号の電力（値）に対応する軸である。たとえば、所定の周波数未満の周波数帯域を「低域」とし、所定の周波数以上の周波数帯域を「高域」とする。低域の音信号を「低域信号」とし、高域の音信号を「高域信号」とする。図３６に示す例では、帯域５ａが低域となり、帯域５ｂが高域となる。 FIG. 36 is a diagram showing a frequency spectrum of a sound signal. The horizontal axis of FIG. 36 is the axis corresponding to the frequency, and the vertical axis is the axis corresponding to the electric power (value) of the sound signal. For example, a frequency band below a predetermined frequency is defined as a "low frequency band", and a frequency band above a predetermined frequency is defined as a "high frequency band". The low-frequency sound signal is referred to as a "low-frequency signal", and the high-frequency sound signal is referred to as a "high-frequency signal". In the example shown in FIG. 36, theband 5a is the low frequency band and the band 5b is the high frequency band.

低域符号化部１２は、低域信号を符号化することで、「低域符号」を生成する処理部である。たとえば、低域符号化部１２は、ＡＡＣ（Advanced Audio Coding）に基づいて、符号化を行う。低域符号化部１２は、低域符号を、多重化部１５に出力する。 The low-frequency coding unit 12 is a processing unit that generates a "low-frequency code" by encoding a low-frequency signal. For example, the lowfrequency coding unit 12 performs coding based on AAC (Advanced Audio Coding). The lowfrequency coding unit 12 outputs the low frequency code to the multiplexing unit 15.

高域情報抽出部１３は、外部装置から音信号を取得し、音信号に基づいて高域情報を抽出する処理部である。高域情報抽出部１３は、高域情報を高域符号化部１４に出力する。 The high-frequency information extraction unit 13 is a processing unit that acquires a sound signal from an external device and extracts high-frequency information based on the sound signal. The high-frequency information extraction unit 13 outputs high-frequency information to the high-frequency coding unit 14.

高域情報には、包絡電力、トーン周波数、周波数分解能が含まれる。包絡電力は、音信号の周波数スペクトルの高域について、包絡線を示すものであり、たとえば、図３６の包絡電力６ａに対応する。 High frequency information includes envelope power, tone frequency, and frequency resolution. The envelope power indicates an envelope in the high frequency range of the frequency spectrum of the sound signal, and corresponds to, for example, theenvelope power 6a in FIG. 36.

トーン周波数は、トーンの存在する周波数を示す。たとえば、トーンは、電力の値が突出して大きい電力である。図３６に示す例では、トーン６ｂに示すものとなり、トーン周波数は、線分７に対応する周波数となる。周波数分解能は、周波数の分解能（最小単位）を示すものである。 The tone frequency indicates the frequency at which the tone is present. For example, a tone is a power with an outstandingly large power value. In the example shown in FIG. 36, thetone 6b is shown, and the tone frequency is a frequency corresponding to theline segment 7. The frequency resolution indicates the resolution (minimum unit) of the frequency.

高域符号化部１４は、高域情報を符号化することで、「高域符号」を生成する処理部である。高域符号化部１４は、高域符号を、多重化部１５に出力する。 The high-frequency coding unit 14 is a processing unit that generates a “high-frequency code” by encoding high-frequency information. The highfrequency coding unit 14 outputs the high frequency code to the multiplexing unit 15.

多重化部１５は、低域符号と、高域符号とを多重化することで、ストリームを生成する処理部である。多重化部１５は、ストリームを、ネットワークを介して、復号装置に送信する。 The multiplexing unit 15 is a processing unit that generates a stream by multiplexing the low frequency code and the high frequency code. The multiplexing unit 15 transmits the stream to the decoding device via the network.

図３７は、従来の復号装置の一例を示す図である。図３７に示すように、従来の復号装置２０は、分離部２１、低域復号部２２、高域生成部２３、高域復号部２４、高域整形部２５を有する。 FIG. 37 is a diagram showing an example of a conventional decoding device. As shown in FIG. 37, theconventional decoding device 20 includes aseparation unit 21, a lowfrequency decoding unit 22, a highfrequency generation unit 23, a highfrequency decoding unit 24, and a highfrequency shaping unit 25.

分離部３１は、符号化装置１０からストリームを取得し、取得したストリームを、低域符号と高域符号とに分離する処理部である。分離部２１は、低域符号を、低域復号部２２に出力する。分離部２１は、高域符号を、高域復号部２４に出力する。 The separation unit 31 is a processing unit that acquires a stream from thecoding device 10 and separates the acquired stream into a low frequency code and a high frequency code. Theseparation unit 21 outputs the low frequency code to the lowfrequency decoding unit 22. Theseparation unit 21 outputs the high frequency code to the highfrequency decoding unit 24.

低域復号部２２は、低域符号を復号することで、低域信号を抽出する処理部である。低域復号部２２は、低域信号を、高域生成部２３に出力する。 The low-frequency decoding unit 22 is a processing unit that extracts a low-frequency signal by decoding a low-frequency code. The lowfrequency decoding unit 22 outputs a low frequency signal to the highfrequency generation unit 23.

高域生成部２３は、低域信号の波形を、高域側に複製することで、高域信号を生成する処理部である。高域生成部２３は、低域信号と高域信号とを合わせた信号情報を、高域整形部２５に出力する。 The highfrequency generation unit 23 is a processing unit that generates a high frequency signal by duplicating the waveform of the low frequency signal on the high frequency side. The high-frequency generation unit 23 outputs signal information including the low-frequency signal and the high-frequency signal to the high-frequency shaping unit 25.

高域復号部２４は、高域符号を復号することで、高域情報を抽出する処理部である。高域復号部２４は、高域情報を、高域整形部２５に出力する。上記のように、高域情報には、包絡電力、トーン周波数、周波数分解能が含まれる。 The high-frequency decoding unit 24 is a processing unit that extracts high-frequency information by decoding a high-frequency code. The highfrequency decoding unit 24 outputs high frequency information to the highfrequency shaping unit 25. As mentioned above, the high frequency information includes the envelope power, the tone frequency, and the frequency resolution.

高域整形部２５は、高域情報を基にして、信号情報の高域信号を整形する処理部である。高域整形部２５は、整形した信号情報を、外部装置に出力する。 The high-frequency shaping unit 25 is a processing unit that shapes the high-frequency signal of the signal information based on the high-frequency information. The highfrequency shaping unit 25 outputs the shaped signal information to an external device.

図３８は、従来の復号装置の処理を説明するための図である。図３８のステップＳ１０〜Ｓ１１に示す周波数スペクトルの横軸は周波数に対応する軸であり、縦軸は電力（値）に対応する軸である。図３８のステップＳ１０について説明する。復号装置２０の高域生成部２３は、低域信号８ａの波形を、高域側に複製することで、高域信号８ｂを生成する。 FIG. 38 is a diagram for explaining the processing of the conventional decoding device. The horizontal axis of the frequency spectrum shown in steps S10 to S11 of FIG. 38 is the axis corresponding to the frequency, and the vertical axis is the axis corresponding to the electric power (value). Step S10 of FIG. 38 will be described. The highfrequency generation unit 23 of thedecoding device 20 generates thehigh frequency signal 8b by duplicating the waveform of thelow frequency signal 8a on the high frequency side.

図３８のステップＳ１１について説明する。復号装置２０の高域整形部２５は、粗い分解能で、包絡電力に合わせて、高域信号８ｂを整形することで、信号８ｃを生成する。 Step S11 of FIG. 38 will be described. The high-frequency shaping unit 25 of thedecoding device 20 generates thesignal 8c by shaping the high-frequency signal 8b according to the envelope power with a coarse resolution.

図３８のステップＳ１２について説明する。復号装置２０の高域整形部２５は、信号８ｃに、トーン周波数に対応する周波数位置にトーン８ｄを付加することで、信号情報８ｅを生成する。この信号情報８ｅが復号した音信号となる。 Step S12 of FIG. 38 will be described. The highfrequency shaping unit 25 of thedecoding device 20 generatessignal information 8e by adding the tone 8d to thesignal 8c at a frequency position corresponding to the tone frequency. Thissignal information 8e becomes the decoded sound signal.

国際公開第２０１４／１９９６３２号International Publication No. 2014/199632特開２０１６−１７３５９７号公報Japanese Unexamined Patent Publication No. 2016-173597

しかしながら、上述した従来技術では、音信号の音質が劣化するという問題がある。 However, the above-mentioned conventional technique has a problem that the sound quality of the sound signal is deteriorated.

たとえば、トーンが低域と高域との境界にある場合、高域側の分解能が粗いことにより、復号時に、低域とずれた周波数にトーンが生成されることがある。低域とずれた周波数にトーンが生成されると、隣接した２つのトーンが生成され、うなりが発生し、音質が劣化する。 For example, when the tone is at the boundary between the low frequency band and the high frequency band, the tone may be generated at a frequency deviated from the low frequency band at the time of decoding due to the coarse resolution on the high frequency band side. When a tone is generated at a frequency deviated from the low frequency range, two adjacent tones are generated, a beat occurs, and the sound quality deteriorates.

図３９は、従来技術の問題を説明するための図である。たとえば、入力音の時間波形および周波数スペクトルを、時間波形３０ａ、周波数スペクトル３１ａとする。復号音の時間波形および周波数スペクトルを時間波形３０ｂ、周波数スペクトル３１ｂとする。時間波形３０ａ，３０ｂの横軸は時間に対応する軸であり、縦軸は、電力（値）に対応する軸である。周波数スペクトル３１ａ，３１ｂの横軸は周波数に対応する軸であり、縦軸は電力（値）に対応する軸である。 FIG. 39 is a diagram for explaining a problem of the prior art. For example, the time waveform and frequency spectrum of the input sound are the time waveform 30a and thefrequency spectrum 31a. Let the time waveform and frequency spectrum of the decoded sound be thetime waveform 30b and thefrequency spectrum 31b. The horizontal axis of thetime waveforms 30a and 30b is the axis corresponding to time, and the vertical axis is the axis corresponding to electric power (value). The horizontal axis of thefrequency spectra 31a and 31b is the axis corresponding to the frequency, and the vertical axis is the axis corresponding to the electric power (value).

たとえば、入力音自体にはうなりは発生していないが、低域と高域の境界に１つのトーンが存在している。ここで、図３８で説明したように、復号装置２０が信号情報を生成すると、信号情報には、２つのトーン３２ａ，３２ｂが含まれてしまい、うなりの原因となる。 For example, the input sound itself does not growl, but there is one tone at the boundary between the low and high frequencies. Here, as described with reference to FIG. 38, when thedecoding device 20 generates the signal information, the signal information includes the twotones 32a and 32b, which causes a beat.

図４０は、高域のトーンがずれる理由を説明するための図である。ステップＳ２１について説明する。たとえば、低域信号には、ある電力値３５ａと、トーン３６ａが存在し、トーン３６ａが存在する周波数を境界とする。復号装置２０の高域生成部２３は、低域信号を高域側に複製することで、高域信号を生成する。たとえば、高域信号には、電力値３５ａに基づき複製された電力値３５ｂと、トーン３６ａに基づき複製された電力値（トーン）３６ｂが含まれる。 FIG. 40 is a diagram for explaining the reason why the high frequency tone shifts. Step S21 will be described. For example, in the low frequency signal, acertain power value 35a and atone 36a exist, and the frequency at which thetone 36a exists is defined as a boundary. The high-frequency generation unit 23 of thedecoding device 20 generates a high-frequency signal by duplicating the low-frequency signal on the high-frequency side. For example, the high frequency signal includes apower value 35b duplicated based on thepower value 35a and a power value (tone) 36b duplicated based on thetone 36a.

ステップＳ２２について説明する。復号装置２０の高域整形部２５は、高域信号を包絡情報９に基づいて整形する。たとえば、分解能が粗いと、包絡情報９は、トーン３６ａの影響により、境界の値が大きくなり、右端側の値が小さくなるように調整されている。このため、電力値３５ｂが、トーン３６ａと同等の大きさとなる電力値３５ｂ’に整形され、トーン３６ｂが、電力値３６ｂ’に整形される。このうち、トーン３６ａと、電力値３５ｂ’とがうなり成分となり、音質が劣化する。 Step S22 will be described. The high-frequency shaping unit 25 of thedecoding device 20 shapes the high-frequency signal based on the envelope information 9. For example, when the resolution is coarse, the envelope information 9 is adjusted so that the boundary value becomes large and the value on the right end side becomes small due to the influence of thetone 36a. Therefore, thepower value 35b is shaped into a power value 35b'having the same magnitude as thetone 36a, and thetone 36b is shaped into apower value 36b'. Of these, thetone 36a and the power value 35b'become beat components, and the sound quality deteriorates.

１つの側面では、本発明は、音信号の音質が劣化することを抑止することができるオーディオ符号化装置、オーディオ符号化方法およびオーディオ符号化プログラムを提供することを目的とする。 In one aspect, it is an object of the present invention to provide an audio coding device, an audio coding method, and an audio coding program capable of suppressing deterioration of the sound quality of a sound signal.

第１の案では、オーディオ符号化装置は、判定部と、符号化部と、多重化部とを有する。判定部は、入力信号の低域と高域との境界にトーンが含まれるか否かを判定する。符号化部は、境界にトーンが含まれている場合に、低域、又は、高域の一方のトーンを抑圧し、入力信号の低域を符号化することで、低域符号を生成し、入力信号の高域を符号化することで、高域符号を生成する。多重化部は、低域符号と高域符号を多重化することで、符号化ストリームを生成する。 In the first plan, the audio coding device has a determination unit, a coding unit, and a multiplexing unit. The determination unit determines whether or not a tone is included in the boundary between the low frequency band and the high frequency band of the input signal. When a tone is included in the boundary, the coding unit suppresses one of the low-frequency or high-frequency tones and encodes the low-frequency of the input signal to generate a low-frequency code. A high frequency code is generated by encoding the high frequency band of the input signal. The multiplexing unit generates a coded stream by multiplexing the low-frequency code and the high-frequency code.

音信号の音質が劣化することを抑止することができる。 It is possible to prevent the sound quality of the sound signal from deteriorating.

図１は、本実施例１に係るシステムの構成を示す図である。FIG. 1 is a diagram showing a configuration of a system according to the first embodiment.図２は、本実施例１に係るオーディオ符号化装置の構成を示す機能ブロック図である。FIG. 2 is a functional block diagram showing a configuration of an audio coding device according to the first embodiment.図３は、本実施例１に係る判定部の構成を示す機能ブロック図である。FIG. 3 is a functional block diagram showing the configuration of the determination unit according to the first embodiment.図４は、ＢＰＦを説明するための図である。FIG. 4 is a diagram for explaining BPF.図５は、本実施例１に係る低域補正部の構成を示す機能ブロック図である。FIG. 5 is a functional block diagram showing a configuration of a low frequency correction unit according to the first embodiment.図６は、動的マスキング閾値を説明するための図である。FIG. 6 is a diagram for explaining the dynamic masking threshold value.図７は、本実施例１に係る低域補正部の処理を説明するための図である。FIG. 7 is a diagram for explaining the processing of the low frequency correction unit according to the first embodiment.図８は、本実施例１に係る高域補正部の構成を示す機能ブロック図である。FIG. 8 is a functional block diagram showing a configuration of a high frequency correction unit according to the first embodiment.図９は、本実施例１に係る高域補正部の処理を説明するための図である。FIG. 9 is a diagram for explaining the processing of the high frequency correction unit according to the first embodiment.図１０は、本実施例１に係る判定部の処理手順を示すフローチャート（１）である。FIG. 10 is a flowchart (1) showing a processing procedure of the determination unit according to the first embodiment.図１１は、本実施例１に係る判定部の処理手順を示すフローチャート（２）である。FIG. 11 is a flowchart (2) showing a processing procedure of the determination unit according to the first embodiment.図１２は、本実施例１に係るオーディオ符号化装置の処理手順を示すフローチャートである。FIG. 12 is a flowchart showing a processing procedure of the audio coding apparatus according to the first embodiment.図１３は、本実施例１に係るオーディオ符号化装置の効果を説明するための図である。FIG. 13 is a diagram for explaining the effect of the audio coding device according to the first embodiment.図１４は、本実施例２に係るオーディオ符号化装置の構成を示す機能ブロック図である。FIG. 14 is a functional block diagram showing the configuration of the audio coding device according to the second embodiment.図１５は、本実施例２に係る入力信号補正部の構成を示す機能ブロック図である。FIG. 15 is a functional block diagram showing a configuration of an input signal correction unit according to the second embodiment.図１６Ａは、本実施例３に係るオーディオ符号化装置の構成を示す機能ブロック図である。FIG. 16A is a functional block diagram showing the configuration of the audio coding device according to the third embodiment.図１６Ｂは、本実施例３に係る補正制御部の処理を説明するための図である。FIG. 16B is a diagram for explaining the processing of the correction control unit according to the third embodiment.図１７Ａは、本実施例４に係るオーディオ符号化装置の構成を示す機能ブロック図である。FIG. 17A is a functional block diagram showing the configuration of the audio coding device according to the fourth embodiment.図１７Ｂは、本実施例４に係る補正制御部の処理を説明するための図である。FIG. 17B is a diagram for explaining the processing of the correction control unit according to the fourth embodiment.図１８は、本実施例５に係るオーディオ符号化装置の構成を示す機能ブロック図である。FIG. 18 is a functional block diagram showing the configuration of the audio coding device according to the fifth embodiment.図１９は、本実施例５に係る高域補正部の構成を示す機能ブロック図である。FIG. 19 is a functional block diagram showing a configuration of a high frequency correction unit according to the fifth embodiment.図２０は、本実施例５に係る高域補正部の処理を説明するための図である。FIG. 20 is a diagram for explaining the processing of the high frequency correction unit according to the fifth embodiment.図２１は、判定部のその他の処理手順を示すフローチャートである。FIG. 21 is a flowchart showing other processing procedures of the determination unit.図２２は、オーディオ符号化装置の問題を説明するための図である。FIG. 22 is a diagram for explaining a problem of the audio coding device.図２３は、低域の信号を非相関化したことによる問題を説明するための図である。FIG. 23 is a diagram for explaining the problem caused by uncorrelating the low frequency signal.図２４は、本実施例６に係るシステムの構成を示す図である。FIG. 24 is a diagram showing a configuration of the system according to the sixth embodiment.図２５は、本実施例６に係るオーディオ符号化装置の構成を示す機能ブロック図である。FIG. 25 is a functional block diagram showing the configuration of the audio coding device according to the sixth embodiment.図２６は、時間周波数信号のデータ構造の一例を示す図である。FIG. 26 is a diagram showing an example of a data structure of a time frequency signal.図２７は、逆フィルタレベルの判定手順を示すフローチャートである。FIG. 27 is a flowchart showing a procedure for determining the inverse filter level.図２８は、本実施例６に係る低域補正部の処理手順を示すフローチャートである。FIG. 28 is a flowchart showing a processing procedure of the low frequency correction unit according to the sixth embodiment.図２９は、符号化ストリームのデータ構造の一例を示す図である。FIG. 29 is a diagram showing an example of the data structure of the coded stream.図３０は、本実施例６に係る復号装置の構成を示す機能ブロック図である。FIG. 30 is a functional block diagram showing the configuration of the decoding device according to the sixth embodiment.図３１は、本実施例６に係るオーディオ符号化装置の処理手順を示すフローチャートである。FIG. 31 is a flowchart showing a processing procedure of the audio coding apparatus according to the sixth embodiment.図３２は、本実施例６に係る復号装置の処理手順を示すフローチャートである。FIG. 32 is a flowchart showing a processing procedure of the decoding apparatus according to the sixth embodiment.図３３は、オーディオ符号化装置と同様の機能を実現するコンピュータのハードウェア構成の一例を示す図である。FIG. 33 is a diagram showing an example of a hardware configuration of a computer that realizes a function similar to that of an audio coding device.図３４は、復号装置と同様の機能を実現するコンピュータのハードウェア構成の一例を示す図である。FIG. 34 is a diagram showing an example of a hardware configuration of a computer that realizes the same function as the decoding device.図３５は、従来の符号化装置の一例を示す図である。FIG. 35 is a diagram showing an example of a conventional coding device.図３６は、音信号の周波数スペクトルを示す図である。FIG. 36 is a diagram showing a frequency spectrum of a sound signal.図３７は、従来の復号装置の一例を示す図である。FIG. 37 is a diagram showing an example of a conventional decoding device.図３８は、従来の復号装置の処理を説明するための図である。FIG. 38 is a diagram for explaining the processing of the conventional decoding device.図３９は、従来技術の問題を説明するための図である。FIG. 39 is a diagram for explaining a problem of the prior art.図４０は、高域のトーンがずれる理由を説明するための図である。FIG. 40 is a diagram for explaining the reason why the high frequency tone shifts.

以下に、本願の開示するオーディオ符号化装置、オーディオ符号化方法およびオーディオ符号化プログラムの実施例を図面に基づいて詳細に説明する。なお、この実施例によりこの発明が限定されるものではない。 Hereinafter, examples of the audio coding apparatus, the audio coding method, and the audio coding program disclosed in the present application will be described in detail with reference to the drawings. The present invention is not limited to this embodiment.

図１は、本実施例１に係るシステムの構成を示す図である。図１に示すように、このシステムは、オーディオ符号化装置１００と、復号装置２０とを有する。オーディオ符号化装置１００は、ネットワーク５０を介して、復号装置２０に接続される。 FIG. 1 is a diagram showing a configuration of a system according to the first embodiment. As shown in FIG. 1, this system has anaudio coding device 100 and adecoding device 20. Theaudio coding device 100 is connected to thedecoding device 20 via thenetwork 50.

オーディオ符号化装置１００は、外部装置から音信号を取得し、音信号を符号化する装置である。たとえば、オーディオ符号化装置１００は、トーンが低域・高域の境界にあることを検出した場合に、低域側のトーンまたは高域側のトーンの一方を抑圧した後、低域符号および高域符号を多重化してストリームを生成する。オーディオ符号化装置１００は、ストリームを復号装置２０に送信する。ストリームは、符号化ストリームに対応する。 Theaudio coding device 100 is a device that acquires a sound signal from an external device and encodes the sound signal. For example, when theaudio coding device 100 detects that the tone is at the boundary between the low frequency band and the high frequency band, theaudio coding device 100 suppresses either the low frequency band tone or the high frequency band tone, and then suppresses the low frequency code and the high frequency band. Generate a stream by multiplexing the region codes. Theaudio coding device 100 transmits the stream to thedecoding device 20. The stream corresponds to a coded stream.

復号装置２０は、オーディオ符号化装置１００からストリームを受信し、ストリームを復号する装置である。復号装置２０の説明は、図３７で説明した復号装置２０の説明と同様である。 Thedecoding device 20 is a device that receives a stream from theaudio coding device 100 and decodes the stream. The description of thedecoding device 20 is the same as the description of thedecoding device 20 described with reference to FIG. 37.

図２は、本実施例１に係るオーディオ符号化装置の構成を示す機能ブロック図である。図２に示すように、このオーディオ符号化装置１００は、低域信号抽出部１１０、高域情報抽出部１２０、判定部１３０、低域補正部１４０、低域符号化部１５０、高域補正部１６０、高域符号化部１７０、多重化部１８０を有する。たとえば、低域信号抽出部１１０、高域情報抽出部１２０、低域補正部１４０、低域符号化部１５０、高域補正部１６０、高域符号化部１７０は、符号化部に対応する。 FIG. 2 is a functional block diagram showing a configuration of an audio coding device according to the first embodiment. As shown in FIG. 2, theaudio coding device 100 includes a low-frequencysignal extraction unit 110, a high-frequencyinformation extraction unit 120, adetermination unit 130, a low-frequency correction unit 140, a low-frequency coding unit 150, and a high-frequency correction unit. It has 160, a highfrequency coding unit 170, and amultiplexing unit 180. For example, the low-frequencysignal extraction unit 110, the high-frequencyinformation extraction unit 120, the low-frequency correction unit 140, the low-frequency coding unit 150, the high-frequency correction unit 160, and the high-frequency coding unit 170 correspond to the coding unit.

低域信号抽出部１１０は、外部装置から音信号を取得し、音信号の低域に含まれる低域信号を抽出する処理部である。低域信号抽出部１１０は、低域信号を、低域補正部１４０に出力する。管理者は、低域の上限周波数を予め設定しておくものとする。 The low frequencysignal extraction unit 110 is a processing unit that acquires a sound signal from an external device and extracts a low frequency signal included in the low frequency of the sound signal. The low frequencysignal extraction unit 110 outputs the low frequency signal to the lowfrequency correction unit 140. The administrator shall set the upper limit frequency of the low frequency in advance.

高域情報抽出部１２０は、外部装置から音信号を取得し、音信号の高域から、高域情報を抽出する処理部である。高域情報抽出部１２０は、高域情報を、高域補正部１６０に出力する。高域情報には、包絡電力、トーン周波数、周波数分解能が含まれる。管理者は、高域の下限周波数を予め設定しておくものとする。また、高域の下限周波数は、低域の上限周波数より低い帯域でも良い。 The high-frequencyinformation extraction unit 120 is a processing unit that acquires a sound signal from an external device and extracts high-frequency information from the high-frequency range of the sound signal. The high frequencyinformation extraction unit 120 outputs high frequency information to the highfrequency correction unit 160. High frequency information includes envelope power, tone frequency, and frequency resolution. The administrator shall set the lower limit frequency of the high frequency band in advance. Further, the lower limit frequency of the high frequency band may be a band lower than the upper limit frequency of the low frequency band.

たとえば、高域情報抽出部１２０は、音信号を周波数スペクトルに変換し、周波数スペクトルの高域側の包絡線の形状を包絡電力として抽出する。高域情報抽出部１２０は、周波数スペクトルの高域について、電力が閾値以上となる周波数を、トーン周波数として抽出する。周波数分解能は、予め設定されているものとする。 For example, the high frequencyinformation extraction unit 120 converts a sound signal into a frequency spectrum and extracts the shape of the envelope on the high frequency side of the frequency spectrum as the envelope power. The high frequencyinformation extraction unit 120 extracts a frequency at which the power is equal to or higher than a threshold value as a tone frequency for the high frequency of the frequency spectrum. It is assumed that the frequency resolution is set in advance.

判定部１３０は、外部装置から音信号を取得し、音信号の低域と高域との境界にトーンが含まれるか否かを判定する処理部である。また、判定部１３０は、境界にトーンが含まれると判定した場合に、低域のトーンを抑圧するのか、高域のトーンを抑圧するのかを判定する。低域と高域との境界は、低域の上限から高域の下限の間の帯域とする。また、低域の上限から高域の下限の間の帯域の上下に幅を持たせても良い。例えば、「境界の帯の下限−ε〜境界の帯域の上限＋ε」としても良い。 Thedetermination unit 130 is a processing unit that acquires a sound signal from an external device and determines whether or not a tone is included in the boundary between the low frequency band and the high frequency band of the sound signal. Further, when it is determined that the boundary includes a tone, thedetermination unit 130 determines whether to suppress the low frequency tone or the high frequency tone. The boundary between the low range and the high range is the band between the upper limit of the low range and the lower limit of the high range. Further, a width may be provided above and below the band between the upper limit of the low frequency band and the lower limit of the high frequency band. For example, "lower limit of boundary band-ε to upper limit of boundary band + ε" may be set.

図３は、本実施例１に係る判定部の構成を示す機能ブロック図である。図３に示すように、この判定部１３０は、ＢＰＦ（Band Pass Filter）１３１と、トーン検出部１３２と、補正判定部１３３とを有する。 FIG. 3 is a functional block diagram showing the configuration of the determination unit according to the first embodiment. As shown in FIG. 3, thedetermination unit 130 includes a BPF (Band Pass Filter) 131, a tone detection unit 132, and a correction determination unit 133.

ＢＰＦ１３１は、音信号のうち、低域と高域との境界付近の音信号を通過させるフィルタである。ＢＰＦ１３１を通過した音信号は、トーン検出部１３２に出力される。 TheBPF 131 is a filter that passes a sound signal near the boundary between the low frequency band and the high frequency band among the sound signals. The sound signal that has passed through theBPF 131 is output to the tone detection unit 132.

図４は、ＢＰＦを説明するための図である。図４において、横軸は周波数に対応する軸であり、縦軸は電力に対応する軸である。低域と高域との境界６０を含むように、幅６０ａのＢＰＦを適用する。幅６０ａは、低域の上限と、高域の下限を基にして、決定されても良い。たとえば、幅６０ａを「低域の上限−α〜高域の下限＋α」としても良い。また、高域の下限周波数≦低域の下限周波数の場合は、幅６０ａを「高域の下限−α〜低域の上限＋α」としても良い。 FIG. 4 is a diagram for explaining BPF. In FIG. 4, the horizontal axis is the axis corresponding to the frequency, and the vertical axis is the axis corresponding to the electric power. A BPF with a width of 60a is applied so as to include theboundary 60 between the low frequency band and the high frequency band. Thewidth 60a may be determined based on the upper limit of the low frequency band and the lower limit of the high frequency band. For example, thewidth 60a may be set to "upper limit of low frequency range-α to lower limit of high frequency range + α". Further, when the lower limit frequency of the high range ≤ the lower limit frequency of the low range, thewidth 60a may be set to "lower limit of high range −α to upper limit of low range + α".

ここでは一例として、音信号から、境界付近の音信号を抽出する場合に、ＢＰＦ１３１を用いたが、これに限定されるものではない。たとえば、ＦＦＴ（Fast Fourier Transform）、ＭＤＣＴ（Modified Discrete Cosine Transform）、ＱＭＦ（Quadrature Mirror Filter）変換等を用いて、境界付近の音信号を抽出しても良い。 Here, as an example, BPF131 is used when extracting a sound signal near the boundary from the sound signal, but the present invention is not limited to this. For example, a sound signal near the boundary may be extracted by using FFT (Fast Fourier Transform), MDCT (Modified Discrete Cosine Transform), QMF (Quadrature Mirror Filter) transform, or the like.

トーン検出部１３２は、境界付近の音信号にトーンが含まれているか否かを判定する処理部である。たとえば、トーン検出部１３２は、境界付近の音信号を基にして、トーン性を示す数値を算出し、トーン性を示す数値が閾値以上である場合に、トーンが含まれていると判定する。トーン検出部１３２に関する以下の説明では、境界付近の音信号を、単に音信号と表記する。トーン検出部１３２は、第１のトーン検出処理または第２のトーン検出処理を実行することで、トーンの有無を検出する。 The tone detection unit 132 is a processing unit that determines whether or not the sound signal near the boundary contains a tone. For example, the tone detection unit 132 calculates a numerical value indicating the tone property based on the sound signal near the boundary, and determines that the tone is included when the numerical value indicating the tone property is equal to or higher than the threshold value. In the following description of the tone detection unit 132, the sound signal near the boundary is simply referred to as a sound signal. The tone detection unit 132 detects the presence or absence of a tone by executing the first tone detection process or the second tone detection process.

第１のトーン検出処理の一例について説明する。トーン検出部１３２は、式（１）に基づいて、音信号のパワースペクトルの平坦性の逆数を、トーン性を示す数値Ｔ１として算出する。数値Ｔ１が小さいほど、音信号の周波数スペクトルの波形がより平坦であり、トーンが含まれている可能性が低いと言える。式（１）において、Ｘ（ω）は、周波数ωに対応する音信号の電力を示す。 An example of the first tone detection process will be described. Based on the equation (1), the tone detection unit 132 calculates the reciprocal of the flatness of the power spectrum of the sound signal as the numerical value T1 indicating the tone property. It can be said that the smaller the numerical value T1, the flatter the waveform of the frequency spectrum of the sound signal, and the less likely it is that the tone is included. In equation (1), X (ω) indicates the power of the sound signal corresponding to the frequency ω.

トーン検出部１３２は、数値Ｔ１が閾値ＴＨ１よりも大きい場合に、音信号にトーンが含まれていると判定する。一方、トーン検出部１３２は、数値Ｔ１が閾値ＴＨ１よりも大きくない場合に、音信号にトーンが含まれていないと判定する。 When the numerical value T1 is larger than the threshold value TH1, the tone detection unit 132 determines that the sound signal contains a tone. On the other hand, the tone detection unit 132 determines that the sound signal does not include a tone when the numerical value T1 is not larger than the threshold value TH1.

第２のトーン検出処理の一例について説明する。トーン検出部１３２は、式（２）、（３ａ）に基づいて、音信号の時間領域に関して、時刻ｉにおける音信号の値ｘ（ｉ）での自己相関Ｒ（ｊ）を求め、自己相関Ｒ（ｊ）の最大値を、トーン性を示す数値Ｔ２として算出する。トーン検出部１３２は、数値Ｔ２が閾値ＴＨ２よりも大きい場合に、音信号にトーンが含まれていると判定する。一方、トーン検出部１３２は、数値Ｔ２が閾値ＴＨ２よりも大きくない場合に、音信号にトーンが含まれていないと判定する。 An example of the second tone detection process will be described. Based on the equations (2) and (3a), the tone detection unit 132 obtains the autocorrelation R (j) at the sound signal value x (i) at time i with respect to the time domain of the sound signal, and the autocorrelation R The maximum value of (j) is calculated as the numerical value T2 indicating the tone property. When the numerical value T2 is larger than the threshold value TH2, the tone detection unit 132 determines that the sound signal contains a tone. On the other hand, the tone detection unit 132 determines that the sound signal does not include a tone when the numerical value T2 is not larger than the threshold value TH2.

Ｔ２＝ｍａｘ（Ｒ（ｊ））・・・（３ａ） T2 = max (R (j)) ... (3a)

トーン検出部１３２は、第１のトーン検出処理または第２のトーン検出処理を実行し、トーン有りと判定した場合には、トーン有の情報を、補正判定部１３３に出力する。また、トーン検出部１３２は、トーン電力を、低域補正部１４０および高域補正部１６０に出力する。トーン電力は、低域と高域との境界に存在するトーンの電力である。 The tone detection unit 132 executes the first tone detection process or the second tone detection process, and when it is determined that there is a tone, the tone detection unit 132 outputs the information with the tone to the correction determination unit 133. Further, the tone detection unit 132 outputs the tone power to the lowfrequency correction unit 140 and the highfrequency correction unit 160. Tone power is the power of the tone that exists at the boundary between the low frequency band and the high frequency band.

一方、トーン検出部１３２は、トーン無と判定した場合には、トーン無の情報を、補正判定部１３３に出力する。 On the other hand, when the tone detection unit 132 determines that there is no tone, the tone detection unit 132 outputs the information of no tone to the correction determination unit 133.

補正判定部１３３は、トーン検出部１３２から「トーン有」である旨の情報を取得した場合には、符号化条件を取得し、符号化条件を基にして、音信号の低域のトーンを抑圧するのか、高域のトーンを抑圧するのかを判定する処理部である。符号化条件は、たとえば、符号化のビットレートの情報を含む。符号化条件の情報は、管理者が入力しても良いし、補正判定部１３３に予め設定しておいても良い。 When the correction determination unit 133 acquires the information indicating that "there is a tone" from the tone detection unit 132, the correction determination unit 133 acquires the coding condition, and based on the coding condition, determines the low-frequency tone of the sound signal. It is a processing unit that determines whether to suppress or suppress high-frequency tones. The coding condition includes, for example, information on the bit rate of coding. The information of the coding condition may be input by the administrator, or may be set in advance in the correction determination unit 133.

補正判定部１３３は、符号化条件に含まれるビットレートの値が、閾値以上である場合に、符号化条件が高レートであると判定する。補正判定部１３３は、高レートであると判定した場合に、高域のトーンを抑圧すると判定し、制御信号を、高域補正部１６０に出力する。 The correction determination unit 133 determines that the coding condition is a high rate when the value of the bit rate included in the coding condition is equal to or greater than the threshold value. When the correction determination unit 133 determines that the rate is high, it determines that the high frequency tone is suppressed, and outputs a control signal to the highfrequency correction unit 160.

補正判定部１３３は、符号化条件に含まれるビットレートの値が、閾値未満である場合に、符号化条件が低レートであると判定する。補正判定部１３３は、低レートであると判定した場合に、低域のトーンを抑圧すると判定し、制御信号を、低域補正部１４０に出力する。 The correction determination unit 133 determines that the coding condition is a low rate when the value of the bit rate included in the coding condition is less than the threshold value. When the correction determination unit 133 determines that the rate is low, it determines that the low frequency tone is suppressed, and outputs a control signal to the lowfrequency correction unit 140.

図２の説明に戻る。低域補正部１４０は、判定部１３０から制御信号を受け付けた場合に、低域信号に含まれる境界のトーン成分を抑圧することで、低域信号を補正する処理部である。低域補正部１４０は、補正した低域信号を、低域符号化部１５０に出力する。 Returning to the description of FIG. The low-frequency correction unit 140 is a processing unit that corrects the low-frequency signal by suppressing the tone component of the boundary included in the low-frequency signal when the control signal is received from thedetermination unit 130. The lowfrequency correction unit 140 outputs the corrected low frequency signal to the lowfrequency coding unit 150.

低域補正部１４０は、判定部１３０から制御信号を受け付けていない場合には、低域信号抽出部１１０から受け付ける低域信号を、そのまま、低域符号化部１５０に出力する。 When the low-frequency correction unit 140 does not receive the control signal from thedetermination unit 130, the low-frequency correction unit 140 outputs the low-frequency signal received from the low-frequencysignal extraction unit 110 to the low-frequency coding unit 150 as it is.

図５は、本実施例１に係る低域補正部の構成を示す機能ブロック図である。図５に示すように、この低域補正部１４０は、スイッチ１４１と、抑圧ゲイン算出部１４２と、平滑化部１４３と、トーン抑圧部１４４とを有する。 FIG. 5 is a functional block diagram showing a configuration of a low frequency correction unit according to the first embodiment. As shown in FIG. 5, the lowfrequency correction unit 140 includes aswitch 141, a suppressiongain calculation unit 142, a smoothingunit 143, and atone suppression unit 144.

スイッチ１４１は、判定部１３０から取得する制御信号に応じて、低域信号の経路を切り替えるスイッチである。スイッチ１４１は、制御信号を受け付けていない場合には、端子１４１ａと端子１４１ｂとを接続することで、低域信号をそのまま通過させる。スイッチ１４１は、制御信号を受け付けた場合には、端子１４１ａと端子１４１ｃとを接続することで、低域信号を、トーン抑圧部１４４に入力する。 Theswitch 141 is a switch that switches the path of the low frequency signal according to the control signal acquired from thedetermination unit 130. When theswitch 141 does not receive the control signal, theswitch 141 connects the terminal 141a and the terminal 141b so that the low frequency signal is passed as it is. When theswitch 141 receives the control signal, theswitch 141 inputs the low frequency signal to thetone suppression unit 144 by connecting the terminal 141a and the terminal 141c.

抑圧ゲイン算出部１４２は、低域信号のトーンを動的マスキング閾値以下に抑圧するゲインを算出する処理部である。動的マスキング閾値は、抑圧対象のトーンが存在する周波数と、トーン電力との組に応じて決定される閾値である。 The suppressiongain calculation unit 142 is a processing unit that calculates a gain that suppresses the tone of the low-frequency signal below the dynamic masking threshold value. The dynamic masking threshold is a threshold determined according to the combination of the frequency in which the tone to be suppressed exists and the tone power.

図６は、動的マスキング閾値を説明するための図である。図６において、横軸は周波数に対応する軸であり、縦軸は電力に対応する軸である。たとえば、トーンが隣接する場合であっても、トーン電力が動的マスキング閾値未満である場合には、トーンが聞こえなくなる。 FIG. 6 is a diagram for explaining the dynamic masking threshold value. In FIG. 6, the horizontal axis is the axis corresponding to the frequency, and the vertical axis is the axis corresponding to the electric power. For example, even if the tones are adjacent, if the tone power is less than the dynamic masking threshold, the tones are inaudible.

トーン６５Ａの動的マスキング閾値は、閾値６６となる。トーン６５Ａのトーン電力は閾値６６以上であるため、トーン６５Ａの音は聞こえる。一方、トーン６５Ａのトーン電力を抑圧して、トーン６５Ｂに補正すると、閾値６６未満となり、トーン６５Ｂの音は聞こえない。 The dynamic masking threshold of thetone 65A is thethreshold 66. Since the tone power of thetone 65A is equal to or higher than thethreshold value 66, the sound of thetone 65A can be heard. On the other hand, when the tone power of thetone 65A is suppressed and corrected to thetone 65B, the threshold value is less than 66, and the sound of thetone 65B cannot be heard.

トーン６５Ｃの動的マスキング閾値は、閾値６７となる。トーン６５Ｃのトーン電力は閾値６７以上であるため、トーン６５Ｃの音は聞こえる。一方、トーン６５Ｃのトーン電力を抑圧して、トーン６５Ｄに補正すると、閾値６７未満となり、トーン６５Ｄの音は聞こえない。 The dynamic masking threshold of thetone 65C is thethreshold 67. Since the tone power of thetone 65C is thethreshold value 67 or more, the sound of thetone 65C can be heard. On the other hand, when the tone power of thetone 65C is suppressed and corrected to thetone 65D, the threshold value is less than 67, and the sound of thetone 65D cannot be heard.

抑圧ゲイン算出部１４２は、トーンの周波数およびトーン電力と、動的マスキング閾値とを対応づけたテーブルを参照して、動的マスキング閾値を特定する。たとえば、トーンの周波数を、低域と高域との境界の周波数とする。抑圧ゲイン算出部１４２は、トーン電力と、動的マスキング閾値とを比較して、トーン電力が動的マスキング閾値未満となる抑圧ゲインを特定する。抑圧ゲイン算出部１４２は、抑圧ゲインを平滑化部１４３に出力する。 The suppressiongain calculation unit 142 specifies the dynamic masking threshold value with reference to a table in which the tone frequency and tone power are associated with the dynamic masking threshold value. For example, let the tone frequency be the frequency at the boundary between the low and high frequencies. The suppressiongain calculation unit 142 compares the tone power with the dynamic masking threshold value to specify the suppression gain at which the tone power is less than the dynamic masking threshold value. The suppressiongain calculation unit 142 outputs the suppression gain to thesmoothing unit 143.

平滑化部１４３は、低域信号のトーン成分をなだらかに抑圧するために、段階的に大きくする抑圧ゲインを、トーン抑圧部１４４に出力する処理部である。たとえば、平滑化部１４３は、抑圧ゲインを初期値から徐々に大きくしていき、最終的に、抑圧ゲインの大きさを、抑圧ゲイン算出部１４２から通知される抑圧ゲインの大きさになるように調整する。 The smoothingunit 143 is a processing unit that outputs a suppression gain that is gradually increased to thetone suppression unit 144 in order to gently suppress the tone component of the low frequency signal. For example, the smoothingunit 143 gradually increases the suppression gain from the initial value, and finally sets the magnitude of the suppression gain to the magnitude of the suppression gain notified from the suppressiongain calculation unit 142. adjust.

トーン抑圧部１４４は、平滑化部１４３から取得する抑圧ゲインを、トーン成分に乗算することで、境界のトーンを抑圧し、低域信号を補正する処理部である。トーン抑圧部１４４は、補正した低域信号を、低域符号化部１５０に出力する。 Thetone suppression unit 144 is a processing unit that suppresses the boundary tone and corrects the low frequency signal by multiplying the tone component by the suppression gain acquired from the smoothingunit 143. Thetone suppression unit 144 outputs the corrected low-frequency signal to the low-frequency coding unit 150.

図７は、本実施例１に係る低域補正部の処理を説明するための図である。図７において、補正前の低域信号の周波数スペクトルを、周波数スペクトル７０ａとする。補正後の低域信号の周波数スペクトルを、周波数スペクトル７０ｂとする。周波数スペクトル７０ａ，７０ｂの横軸は、周波数に対応する軸であり、周波数スペクトル７０ａ，７０ｂの縦軸は、電力に対応する軸である。 FIG. 7 is a diagram for explaining the processing of the low frequency correction unit according to the first embodiment. In FIG. 7, the frequency spectrum of the low frequency signal before correction is defined as thefrequency spectrum 70a. The frequency spectrum of the corrected low frequency signal is defined as thefrequency spectrum 70b. The horizontal axis of thefrequency spectra 70a and 70b is the axis corresponding to the frequency, and the vertical axis of thefrequency spectra 70a and 70b is the axis corresponding to the electric power.

周波数スペクトル７０ａに示すように、境界にトーン７１ａがある。トーン７１ａに対応する動的マスキング閾値を、動的マスキング閾値７２とする。トーン抑圧部１４４は、トーン７１ａが動的マスキング閾値７２未満となるような抑圧ゲインを与えることで、トーン７１ａを、トーン７１ｂに補正する。これにより、トーン７１ｂが、動的マスキング閾値７２未満となり、聞こえなくなるため、音信号の音質が劣化することができる。 As shown in thefrequency spectrum 70a, there is atone 71a at the boundary. The dynamic masking threshold corresponding to thetone 71a is defined as thedynamic masking threshold 72. Thetone suppression unit 144 corrects thetone 71a to thetone 71b by giving a suppression gain so that thetone 71a becomes less than the dynamicmasking threshold value 72. As a result, thetone 71b becomes less than the dynamicmasking threshold value 72 and cannot be heard, so that the sound quality of the sound signal can be deteriorated.

図２の説明に戻る。低域符号化部１５０は、低域補正部から低域信号を取得し、低域信号をビット列に符号化することで、低域符号を生成する処理部である。たとえば、低域符号化部１５０は、ＡＡＣに基づいて、符号化を行う。低域符号化部１５０は、低域符号を、多重化部１８０に出力する。 Returning to the description of FIG. The low-frequency coding unit 150 is a processing unit that generates a low-frequency code by acquiring a low-frequency signal from the low-frequency correction unit and encoding the low-frequency signal into a bit string. For example, the lowfrequency coding unit 150 performs coding based on AAC. The lowfrequency coding unit 150 outputs the low frequency code to themultiplexing unit 180.

高域補正部１６０は、判定部１３０から制御信号を受け付けた場合に、高域情報に含まれる境界の包絡電力を抑圧することで、高域情報を補正する処理部である。高域補正部１６０は、補正した高域情報を、高域符号化部１７０に出力する。 The highfrequency correction unit 160 is a processing unit that corrects the high frequency information by suppressing the envelope power of the boundary included in the high frequency information when the control signal is received from thedetermination unit 130. The highfrequency correction unit 160 outputs the corrected high frequency information to the highfrequency coding unit 170.

高域補正部１６０は、判定部１３０から制御信号を受け付けていない場合には、高域情報抽出部１２０から取得する高域情報を、そのまま、高域符号化部１７０に出力する。 When the high-frequency correction unit 160 does not receive the control signal from thedetermination unit 130, the high-frequency correction unit 160 outputs the high-frequency information acquired from the high-frequencyinformation extraction unit 120 to the high-frequency coding unit 170 as it is.

図８は、本実施例１に係る高域補正部の構成を示す機能ブロック図である。図８に示すように、この高域補正部１６０は、スイッチ１６１と、抑圧ゲイン算出部１６２と、平滑化部１６３と、トーン抑圧部１６４とを有する。 FIG. 8 is a functional block diagram showing a configuration of a high frequency correction unit according to the first embodiment. As shown in FIG. 8, the highfrequency correction unit 160 includes aswitch 161, a suppressiongain calculation unit 162, a smoothingunit 163, and atone suppression unit 164.

スイッチ１６１は、判定部１３０から取得する制御信号に応じて、高域情報の経路を切り替えるスイッチである。スイッチ１６１は、制御信号を受け付けていない場合には、端子１６１ａと端子１６１ｂとを接続することで、高域情報をそのまま通過させる。スイッチ１６１は、制御信号を受け付けた場合には、端子１６１ａと端子１６１ｃとを接続することで、高域情報を、トーン抑圧部１６４に入力する。 Theswitch 161 is a switch that switches the path of high frequency information according to the control signal acquired from thedetermination unit 130. When theswitch 161 does not receive the control signal, theswitch 161 connects the terminal 161a and the terminal 161b so that the high frequency information is passed as it is. When theswitch 161 receives the control signal, theswitch 161 connects the terminal 161a and the terminal 161c to input high frequency information to thetone suppression unit 164.

抑圧ゲイン算出部１６２は、高域情報に含まれる境界の包絡電力（トーン電力）を動的マスキング閾値以下に抑圧するゲインを算出する処理部である。動的マスキング閾値は、境界の周波数と、境界の包絡電力との組に応じて決定される閾値である。 The suppressiongain calculation unit 162 is a processing unit that calculates a gain that suppresses the boundary envelopment power (tone power) included in the high frequency information to be equal to or lower than the dynamic masking threshold value. The dynamic masking threshold is a threshold determined according to the combination of the boundary frequency and the boundary envelope power.

抑圧ゲイン算出部１６２は、境界の周波数および境界の包絡電力と、動的マスキング閾値とを対応づけたテーブルを参照して、動的マスキング閾値を特定する。抑圧ゲイン算出部１６２は、境界の包絡電力と、動的マスキング閾値とを比較して、包絡電力が動的マスキング閾値未満となる抑圧ゲインを特定する。抑圧ゲイン算出部１６２は、抑圧ゲインを平滑化部１６３に出力する。 The suppressiongain calculation unit 162 specifies the dynamic masking threshold value with reference to a table in which the boundary frequency and the boundary envelopment power are associated with the dynamic masking threshold value. The suppressiongain calculation unit 162 compares the envelope power at the boundary with the dynamic masking threshold value to specify the suppression gain at which the envelope power is less than the dynamic masking threshold value. The suppressiongain calculation unit 162 outputs the suppression gain to thesmoothing unit 163.

平滑化部１６３は、包絡電力の値をなだらかに抑圧するために、段階的に大きくする抑圧ゲインを、トーン抑圧部１６４に出力する処理部である。たとえば、平滑化部１６３は、抑圧ゲインを初期値から徐々に大きくしていき、最終的に、抑圧ゲインの大きさを、抑圧ゲイン算出部１６２から通知される抑圧ゲインの大きさになるように調整する。 The smoothingunit 163 is a processing unit that outputs a suppression gain that is gradually increased to thetone suppression unit 164 in order to gently suppress the value of the envelope power. For example, the smoothingunit 163 gradually increases the suppression gain from the initial value, and finally sets the magnitude of the suppression gain to the magnitude of the suppression gain notified from the suppressiongain calculation unit 162. adjust.

トーン抑圧部１６４は、平滑化部１６３から取得する抑圧ゲインを、境界の包絡電力に乗算することで、高域情報を補正する処理部である。境界の包絡電力を抑制することで、復号装置２０で復号される境界のトーンが、動的マスキング閾値未満となる。トーン抑圧部１６４は、補正した高域情報を、高域符号化部１７０に出力する。なお、トーン抑圧部１６４は、高域情報に含まれる包絡電力、トーン周波数、周波数分解能のうち、包絡電力のみ補正し、トーン周波数、周波数分解能の補正は行わないものとする。 Thetone suppression unit 164 is a processing unit that corrects high frequency information by multiplying the suppression gain acquired from the smoothingunit 163 by the envelope power of the boundary. By suppressing the envelope power of the boundary, the tone of the boundary decoded by thedecoding device 20 becomes less than the dynamic masking threshold. Thetone suppression unit 164 outputs the corrected high frequency information to the highfrequency coding unit 170. Thetone suppression unit 164 corrects only the envelope power among the envelope power, tone frequency, and frequency resolution included in the high frequency information, and does not correct the tone frequency and frequency resolution.

図９は、本実施例１に係る高域補正部の処理を説明するための図である。図９において、補正前の包絡電力７６ａを、周波数スペクトル７５ａ上に示す。補正後の包絡電力７６ｂを、周波数スペクトル７５ｂ上に示す。周波数スペクトル７５ａ，７５ｂの横軸は、周波数に対応する軸であり、周波数スペクトル７５ａ，７５ｂの縦軸は、電力に対応する軸である。また、低域と高域との境界を、境界７７とする。 FIG. 9 is a diagram for explaining the processing of the high frequency correction unit according to the first embodiment. In FIG. 9, theenvelope power 76a before correction is shown on thefrequency spectrum 75a. The correctedenvelope power 76b is shown on thefrequency spectrum 75b. The horizontal axis of thefrequency spectra 75a and 75b is the axis corresponding to the frequency, and the vertical axis of thefrequency spectra 75a and 75b is the axis corresponding to the electric power. Further, the boundary between the low range and the high range is defined as theboundary 77.

たとえば、境界７７付近の包絡電力７６ａに対応する動的マスキング閾値を、動的マスキング閾値７８とする。トーン抑圧部１６４は、境界７７の包絡電力７６ａが、動的マスキング閾値７８未満となるように、包絡電力７６ａを抑圧した包絡電力７６ｂを生成することで、高域情報を補正する。包絡電力７６ｂは、動的マスキング閾値７８未満であるため、この包絡電力７６ｂを基に復号される境界のトーン成分が抑えられる。 For example, the dynamic masking threshold value corresponding to theenvelope power 76a near theboundary 77 is set to the dynamicmasking threshold value 78. Thetone suppression unit 164 corrects high frequency information by generating anenvelope power 76b that suppresses theenvelope power 76a so that theenvelope power 76a at theboundary 77 becomes less than the dynamicmasking threshold value 78. Since theenvelope power 76b is less than the dynamicmasking threshold value 78, the tone component of the boundary decoded based on theenvelope power 76b is suppressed.

図２の説明に戻る。多重化部１８０は、低域符号と、高域符号とを多重化することで、ストリームを生成する処理部である。多重化部１８０は、ストリームを、ネットワーク５０を介して、復号装置２０に送信する。 Returning to the description of FIG. Themultiplexing unit 180 is a processing unit that generates a stream by multiplexing the low frequency code and the high frequency code. Themultiplexing unit 180 transmits the stream to thedecoding device 20 via thenetwork 50.

次に、本実施例１に係るオーディオ符号化装置１００の判定部１３０の処理手順について説明する。図１０は、本実施例１に係る判定部の処理手順を示すフローチャート（１）である。図１０に示すように、オーディオ符号化装置１００の判定部１３０は、トーン性Ｔを算出する（ステップＳ１０１）。ステップＳ１０１において、判定部１３０は、第１のトーン検出処理によりトーン性Ｔ１を算出しても良いし、第２のトーン検出処理によりトーン性Ｔ２を算出しても良い。 Next, the processing procedure of thedetermination unit 130 of theaudio coding device 100 according to the first embodiment will be described. FIG. 10 is a flowchart (1) showing a processing procedure of the determination unit according to the first embodiment. As shown in FIG. 10, thedetermination unit 130 of theaudio coding device 100 calculates the tone property T (step S101). In step S101, thedetermination unit 130 may calculate the tone property T1 by the first tone detection process, or may calculate the tone property T2 by the second tone detection process.

判定部１３０は、トーン性Ｔが、閾値ＴＨよりも大きいか否かを判定する（ステップＳ１０２）。ステップＳ１０２において、判定部１３０は、トーン性Ｔ１を算出した場合には、閾値ＴＨ１と比較する。判定部１３０は、トーン性Ｔ２を算出した場合には、閾値ＴＨ２と比較する。 Thedetermination unit 130 determines whether or not the tone property T is larger than the threshold value TH (step S102). In step S102, when thedetermination unit 130 calculates the tone property T1, it compares it with the threshold value TH1. When thedetermination unit 130 calculates the tone property T2, it compares it with the threshold value TH2.

判定部１３０は、トーン性Ｔが閾値ＴＨよりも大きい場合には（ステップＳ１０２，Ｙｅｓ）、トーン有と判定する（ステップＳ１０４）。一方、判定部１３０は、トーン性Ｔが閾値ＴＨよりも大きくない場合には（ステップＳ１０２，Ｎｏ）、トーン無と判定する（ステップＳ１０３）。判定部１３０は、トーン電力を計算する（ステップＳ１０５）。 When the tone property T is larger than the threshold value TH (step S102, Yes), thedetermination unit 130 determines that the tone is present (step S104). On the other hand, when the tone property T is not larger than the threshold value TH (step S102, No), thedetermination unit 130 determines that there is no tone (step S103). Thedetermination unit 130 calculates the tone power (step S105).

図１１は、本実施例１に係る判定部の処理手順を示すフローチャート（２）である。図１１に示すように、オーディオ符号化装置１００の判定部１３０は、トーン検出結果が、トーン有りであるか否かを判定する（ステップＳ２０１）。判定部１３０は、トーン検出結果が、トーン有りでない場合には（ステップＳ２０１，Ｎｏ）、補正処理を実行しない旨の制御信号を出力する（ステップＳ２０２）。なお、ステップＳ２０２において、判定部１３０は、補正処理を実行しないと判定した場合に、制御信号の出力を抑止しても良い。 FIG. 11 is a flowchart (2) showing a processing procedure of the determination unit according to the first embodiment. As shown in FIG. 11, thedetermination unit 130 of theaudio coding device 100 determines whether or not the tone detection result has a tone (step S201). If the tone detection result does not have a tone (steps S201, No), thedetermination unit 130 outputs a control signal indicating that the correction process is not executed (step S202). In step S202, thedetermination unit 130 may suppress the output of the control signal when it is determined that the correction process is not executed.

判定部１３０は、トーン検出結果が、トーン有りの場合には（ステップＳ２０１，Ｙｅｓ）、符号化条件のビットレートが所定値以上であるか否かを判定する（ステップＳ２０３）。判定部１３０は、符号化条件のビットレートが所定値以上である場合には（ステップＳ２０３，Ｙｅｓ）、高域補正を行う旨の、制御信号を高域補正部１６０に出力する（ステップＳ２０４）。 When the tone detection result has a tone (step S201, Yes), thedetermination unit 130 determines whether or not the bit rate of the coding condition is equal to or higher than a predetermined value (step S203). When the bit rate of the coding condition is equal to or higher than a predetermined value (step S203, Yes), thedetermination unit 130 outputs a control signal to the highfrequency correction unit 160 to perform high frequency correction (step S204). ..

判定部１３０は、符号化条件のビットレートが所定値以上でない場合には（ステップＳ２０３，Ｎｏ）、低域補正を行う旨の、制御信号を低域補正部１４０に出力する（ステップＳ２０５）。 When the bit rate of the coding condition is not equal to or higher than a predetermined value (step S203, No), thedetermination unit 130 outputs a control signal to the lowfrequency correction unit 140 to perform low frequency correction (step S205).

次に、本実施例１に係るオーディオ符号化装置１００の処理手順の一例について説明する。図１２は、本実施例１に係るオーディオ符号化装置の処理手順を示すフローチャートである。図１２に示すように、このオーディオ符号化装置１００は、音信号を受け付ける（ステップＳ３０１）。 Next, an example of the processing procedure of theaudio coding apparatus 100 according to the first embodiment will be described. FIG. 12 is a flowchart showing a processing procedure of the audio coding apparatus according to the first embodiment. As shown in FIG. 12, theaudio coding device 100 receives a sound signal (step S301).

オーディオ符号化装置１００の低域信号抽出部１１０は、音信号から低域信号を抽出する（ステップＳ３０２）。オーディオ符号化装置１００の高域情報抽出部１２０は、音信号から高域情報を抽出する（ステップＳ３０３）。 The low-frequencysignal extraction unit 110 of theaudio coding device 100 extracts the low-frequency signal from the sound signal (step S302). The high-frequencyinformation extraction unit 120 of theaudio coding device 100 extracts high-frequency information from the sound signal (step S303).

オーディオ符号化装置１００の判定部１３０は、境界におけるトーンの有無を判定する。判定部１３０は、トーン有の場合には、低域を補正するのか、高域を補正するのかを判定する（ステップＳ３０４）。 Thedetermination unit 130 of theaudio coding device 100 determines the presence or absence of a tone at the boundary. When there is a tone, thedetermination unit 130 determines whether to correct the low frequency band or the high frequency band (step S304).

オーディオ符号化装置１００の低域補正部１４０は、低域を補正すると判定された場合に、低域信号を補正する（ステップＳ３０５）。オーディオ符号化装置１００の高域補正部１６０は、高域を補正すると判定された場合に、高域情報の包絡電力を補正する（ステップＳ３０６）。 The low-frequency correction unit 140 of theaudio coding device 100 corrects the low-frequency signal when it is determined to correct the low-frequency signal (step S305). The highfrequency correction unit 160 of theaudio coding apparatus 100 corrects the envelope power of the high frequency information when it is determined to correct the high frequency (step S306).

オーディオ符号化装置１００の低域符号化部１５０は、低域信号を符号化して、低域符号を生成する（ステップＳ３０７）。オーディオ符号化装置１００の高域符号化部１７０は、高域情報を符号化して、高域符号を生成する（ステップＳ３０８）。 The low-frequency coding unit 150 of theaudio coding device 100 encodes the low-frequency signal to generate a low-frequency code (step S307). The high-frequency coding unit 170 of theaudio coding device 100 encodes the high-frequency information to generate a high-frequency code (step S308).

オーディオ符号化装置１００の多重化部１８０は、低域符号と高域符号とを多重化したストリームを生成する（ステップＳ３０９）。多重化部１８０は、ストリームを復号装置２０に送信する（ステップＳ３１０）。 Themultiplexing unit 180 of theaudio coding apparatus 100 generates a stream in which the low frequency code and the high frequency code are multiplexed (step S309). Themultiplexing unit 180 transmits the stream to the decoding device 20 (step S310).

次に、本実施例１に係るオーディオ符号化装置１００の効果について説明する。オーディオ符号化装置１００は、トーンが低域・高域の境界にあることを検出した場合に、低域側または高域側のトーンの一方を抑圧した後に、低域符号および高域符号を多重化したストリームを生成する。これにより、音信号の音質が劣化することを抑止することができる。 Next, the effect of theaudio coding device 100 according to the first embodiment will be described. When theaudio coding device 100 detects that the tone is at the boundary between the low and high frequencies, it suppresses either the low or high tone and then multiplexes the low and high frequencies. Generate a converted stream. As a result, it is possible to prevent the sound quality of the sound signal from deteriorating.

たとえば、オーディオ符号化装置１００は、トーンが境界にあることを検出し、低域信号のトーンを抑圧することで、たとえば、図３９のトーン３２ａが小さくなる。これにより、うなり成分がなくなり、音質劣化を抑止できる。オーディオ符号化装置１００は、トーンが境界にあることを検出し、高域情報のトーン（包絡電力）を抑圧することで、たとえば、図３９のトーン３２ｂが小さくなる。これにより、うなり成分がなくなり、音質劣化を抑止できる。 For example, theaudio coding device 100 detects that the tone is at the boundary and suppresses the tone of the low frequency signal, so that thetone 32a in FIG. 39 becomes smaller, for example. As a result, the beat component is eliminated and the deterioration of sound quality can be suppressed. Theaudio coding device 100 detects that the tone is at the boundary and suppresses the tone (envelope power) of the high frequency information, so that thetone 32b in FIG. 39 becomes smaller, for example. As a result, the beat component is eliminated and the deterioration of sound quality can be suppressed.

オーディオ符号化装置１００は、符号化条件のビットレートと閾値との比較により、低域のトーンを抑圧するのか、高域のトーンを抑圧するのかを判定し、判定結果に応じた帯域のトーンを抑圧する。これにより、ビットレートに応じて、音質の劣る方の帯域で、補正を行うことができる。たとえば、ビットレートが高レートの場合には、高域の音質が劣るため、高域を補正する。一方、ビットレートが低レートの場合には、低域の音質が劣るため、低域を補正する。 Theaudio coding apparatus 100 determines whether to suppress the low-frequency tone or the high-frequency tone by comparing the bit rate of the coding condition with the threshold value, and determines whether to suppress the high-frequency tone, and determines the tone in the band according to the determination result. Suppress. As a result, correction can be performed in the band with inferior sound quality according to the bit rate. For example, when the bit rate is high, the sound quality of the high frequency band is inferior, so the high frequency band is corrected. On the other hand, when the bit rate is low, the sound quality of the low frequency band is inferior, so the low frequency band is corrected.

図１３は、本実施例１に係るオーディオ符号化装置の効果を説明するための図である。図１３において、スペクトラム８１ａ、時間波形８２ａは、原音（正解）のスペクトラムおよび時間波形である。一例として、チェンバロの響きが衰退するトーン（１６ｂｉｔ、４８ｋＨｚ、ｍｏｎｏ）を原音として用いた。また、低域と高域との境界を６．７ｋＨｚとした。 FIG. 13 is a diagram for explaining the effect of the audio coding device according to the first embodiment. In FIG. 13, the spectrum 81a and the time waveform 82a are the spectrum and the time waveform of the original sound (correct answer). As an example, a tone (16 bit, 48 kHz, mono) in which the harpsichord reverberates is used as the original sound. The boundary between the low range and the high range was set to 6.7 kHz.

スペクトラム８１ｂ、時間波形８２ｂは、従来の符号化装置１０が符号化したストリームを復号装置２０が復号した信号に関する、スペクトラムおよび時間波形である。スペクトラム８１ｃ、時間波形８２ｃは、オーディオ符号化装置１００が符号化したストリームを復号装置２０が復号した信号に関する、スペクトラムおよび時間波形である。 The spectrum 81b and thetime waveform 82b are spectrum and time waveforms relating to a signal obtained by decoding the stream encoded by theconventional coding device 10 by thedecoding device 20. The spectrum 81c and thetime waveform 82c are spectrum and time waveforms relating to a signal obtained by decoding the stream encoded by theaudio coding device 100 by thedecoding device 20.

スペクトラム８１ａ〜８１ｃの横軸は時間に対応する軸であり、縦軸は周波数に対応する軸である。また、スペクトラム８１ａ〜８１ｃでは、明暗により電力値の大小を表しており、明るい部分は、電力が大きく、暗い部分は、電力が小さい。時間波形８２ａ〜８２ｃの横軸は時間に対応する軸であり、縦軸は振幅に対応する軸である。 The horizontal axis of the spectra 81a to 81c is the axis corresponding to time, and the vertical axis is the axis corresponding to frequency. Further, in the spectra 81a to 81c, the magnitude of the electric power value is indicated by light and dark, and the bright portion has a large electric power and the dark portion has a small electric power. The horizontal axis of the time waveforms 82a to 82c is the axis corresponding to time, and the vertical axis is the axis corresponding to the amplitude.

スペクトラム８１ａ〜８１ｃを比較、および、時間波形８２ａ〜８２ｃを比較すると、従来技術と比較して、オーディオ符号化装置１００の符号化は、ゆらぎが抑えられ、音質劣化を抑止できている。 Comparing the spectra 81a to 81c and comparing the time waveforms 82a to 82c, the coding of theaudio coding apparatus 100 suppresses fluctuations and suppresses deterioration of sound quality as compared with the prior art.

ところで、図２に示したオーディオ符号化装置１００は、低域補正部１４０および高域補正部１６０のいずれか一方を有していれば良く、必ずしも、低域補正部１４０および高域補正部１６０の双方を有していなくても良い。 By the way, theaudio coding device 100 shown in FIG. 2 need only have either the lowfrequency correction unit 140 or the highfrequency correction unit 160, and does not necessarily have the lowfrequency correction unit 140 and the highfrequency correction unit 160. It is not necessary to have both of them.

たとえば、オーディオ符号化装置１００が、低域補正部１４０を有し、高域補正部１６０を有していない場合には、境界のトーンが検出される度に、低域補正部１４０が、低域信号の補正を行う。一方、オーディオ符号化装置１００が、低域補正部１４０を有しておらず、高域補正部１６０を有している場合には、境界のトーンが検出される度に、高域補正部１６０が、高域情報の包絡電力の補正を行う。このように構成することで、オーディオ符号化装置１００のハードウェア資源を節約しつつ、音信号の劣化を抑止することができる。 For example, when theaudio coding device 100 has a lowfrequency correction unit 140 and does not have a highfrequency correction unit 160, the lowfrequency correction unit 140 is low every time a boundary tone is detected. Correct the area signal. On the other hand, when theaudio coding device 100 does not have the lowfrequency correction unit 140 and has the highfrequency correction unit 160, the highfrequency correction unit 160 is used every time the boundary tone is detected. However, the envelope power of the high frequency information is corrected. With this configuration, it is possible to suppress the deterioration of the sound signal while saving the hardware resources of theaudio coding device 100.

図１４は、本実施例２に係るオーディオ符号化装置の構成を示す機能ブロック図である。図１４に示すように、このオーディオ符号化装置２００は、判定部２１０と、入力信号補正部２２０とを有する。オーディオ符号化装置２００は、低域信号抽出部１１０、高域情報抽出部１２０、低域符号化部１５０、高域符号化部１７０、多重化部１８０を有する。 FIG. 14 is a functional block diagram showing the configuration of the audio coding device according to the second embodiment. As shown in FIG. 14, theaudio coding device 200 includes adetermination unit 210 and an inputsignal correction unit 220. Theaudio coding device 200 includes a low-frequencysignal extraction unit 110, a high-frequencyinformation extraction unit 120, a low-frequency coding unit 150, a high-frequency coding unit 170, and amultiplexing unit 180.

判定部２１０は、外部装置から音信号を取得し、音信号の低域と高域との境界にトーンが含まれるか否かを判定する処理部である。また、判定部２１０は、境界にトーンが含まれると判定した場合に、制御信号と、トーン電力を入力信号補正部２２０に出力する。判定部２１０が、境界にトーンが含まれるか否かを判定する処理は、実施例１に示した判定部１３０の処理と同様である。 Thedetermination unit 210 is a processing unit that acquires a sound signal from an external device and determines whether or not a tone is included in the boundary between the low frequency band and the high frequency band of the sound signal. Further, thedetermination unit 210 outputs the control signal and the tone power to the inputsignal correction unit 220 when it is determined that the boundary includes the tone. The process of determining whether or not the boundary includes a tone by thedetermination unit 210 is the same as the process of thedetermination unit 130 shown in the first embodiment.

入力信号補正部２２０は、判定部２１０から制御信号を受け付けた場合に、音信号に含まれる境界のトーン成分を抑圧することで、音信号を補正する処理部である。入力信号補正部２２０は、補正した音信号を、低域信号抽出部１１０に出力する。 The inputsignal correction unit 220 is a processing unit that corrects the sound signal by suppressing the tone component of the boundary included in the sound signal when the control signal is received from thedetermination unit 210. The inputsignal correction unit 220 outputs the corrected sound signal to the low frequencysignal extraction unit 110.

図１５は、本実施例２に係る入力信号補正部の構成を示す機能ブロック図である。図１５に示すように、この入力信号補正部２２０は、スイッチ２２１と、抑圧ゲイン算出部２２２と、平滑化部２２３と、トーン抑圧部２２４とを有する。 FIG. 15 is a functional block diagram showing a configuration of an input signal correction unit according to the second embodiment. As shown in FIG. 15, the inputsignal correction unit 220 includes aswitch 221, a suppressiongain calculation unit 222, a smoothingunit 223, and atone suppression unit 224.

スイッチ２２１は、判定部２１０から取得する制御信号に応じて、音信号の経路を切り替えるスイッチである。スイッチ２２１は、制御信号を受け付けていない場合には、端子２２１ａと端子２２１ｂとを接続することで、音信号をそのまま通過させる。スイッチ２２１は、制御信号を受け付けた場合には、端子２２１ａと端子２２１ｃとを接続することで、音信号を、トーン抑圧部２２４に入力する。 Theswitch 221 is a switch that switches the path of the sound signal according to the control signal acquired from thedetermination unit 210. When theswitch 221 does not receive the control signal, theswitch 221 connects the terminal 221a and the terminal 221b so that the sound signal can be passed as it is. When theswitch 221 receives the control signal, theswitch 221 connects the terminal 221a and the terminal 221c to input the sound signal to thetone suppression unit 224.

抑圧ゲイン算出部２２２は、音信号の境界に位置するトーンを動的マスキング閾値以下に抑圧するゲインを算出する処理部である。抑圧ゲイン算出部２２２は、抑圧ゲインを平滑化部２２３に出力する。抑圧ゲイン算出部２２２が、抑圧ゲインを算出する処理は、実施例１に示した抑圧ゲイン算出部１４２の処理に対応する。 The suppressiongain calculation unit 222 is a processing unit that calculates a gain that suppresses the tone located at the boundary of the sound signal to the dynamic masking threshold value or less. The suppressiongain calculation unit 222 outputs the suppression gain to thesmoothing unit 223. The process of calculating the suppression gain by the suppressiongain calculation unit 222 corresponds to the processing of the suppressiongain calculation unit 142 shown in the first embodiment.

平滑化部２２３は、音信号のトーン成分をなだらかに抑圧するために、段階的に大きくする抑圧ゲインを、トーン抑圧部２２４に出力する処理部である。たとえば、平滑化部２２３は、抑圧ゲインを初期値から徐々に大きくしていき、最終的に、抑圧ゲインの大きさを、抑圧ゲイン算出部２２２から通知される抑圧ゲインの大きさになるように調整する。 The smoothingunit 223 is a processing unit that outputs a suppression gain that is gradually increased to thetone suppression unit 224 in order to gently suppress the tone component of the sound signal. For example, the smoothingunit 223 gradually increases the suppression gain from the initial value, and finally sets the magnitude of the suppression gain to the magnitude of the suppression gain notified from the suppressiongain calculation unit 222. adjust.

トーン抑圧部２２４は、平滑化部２２３から取得する抑圧ゲインを、音信号の境界のトーン成分に乗算することで、境界のトーンを抑圧し、低域信号を補正する処理部である。トーン抑圧部２２４は、補正した音信号を、低域信号抽出部１１０に出力する。 Thetone suppression unit 224 is a processing unit that suppresses the boundary tone and corrects the low frequency signal by multiplying the suppression gain acquired from the smoothingunit 223 by the tone component at the boundary of the sound signal. Thetone suppression unit 224 outputs the corrected sound signal to the low frequencysignal extraction unit 110.

図１４の説明に戻る。低域信号抽出部１１０、高域情報抽出部１２０、低域符号化部１５０、高域符号化部１７０、多重化部１８０に関する説明は、実施例１に示した、低域信号抽出部１１０、高域情報抽出部１２０、低域符号化部１５０、高域符号化部１７０、多重化部１８０に関する説明と同様であるため、同一の符号を付して説明を省略する。 Returning to the description of FIG. The description of the low-frequencysignal extraction unit 110, the high-frequencyinformation extraction unit 120, the low-frequency coding unit 150, the high-frequency coding unit 170, and themultiplexing unit 180 is described in the low-frequencysignal extraction unit 110, which is shown in the first embodiment. Since the description is the same as that for the high frequencyinformation extraction unit 120, the lowfrequency coding unit 150, the highfrequency coding unit 170, and themultiplexing unit 180, the same reference numerals are given and the description thereof will be omitted.

次に、本実施例２に係るオーディオ符号化装置２００の効果について説明する。トーンが低域・高域の境界にあることを検出した場合に、音信号の境界のトーンを抑圧した後に、低域符号および高域符号を多重化したストリームを生成する。これにより、音信号の音質が劣化することを抑止することができる。また、原音となる音信号のトーンを抑圧しておくため、低域、高域のいずれのトーンを抑圧するのかを判定する処理をスキップすることができるため、処理負荷を軽減することができる。また、ハードウェア資源を節約することも可能になる。 Next, the effect of theaudio coding device 200 according to the second embodiment will be described. When it is detected that the tone is at the boundary between the low and high frequencies, the tone at the boundary of the sound signal is suppressed, and then a stream in which the low and high frequencies are multiplexed is generated. As a result, it is possible to prevent the sound quality of the sound signal from deteriorating. Further, since the tone of the sound signal as the original sound is suppressed, it is possible to skip the process of determining which tone of the low frequency band or the high frequency band is suppressed, so that the processing load can be reduced. It also makes it possible to save hardware resources.

図１６Ａは、本実施例３に係るオーディオ符号化装置の構成を示す機能ブロック図である。図１６Ａに示すように、オーディオ符号化装置３００は、低域信号抽出部１１０、高域情報抽出部１２０、高域符号化部１７０、多重化部１８０、補正制御部３１０、低域符号化部３２０を有する。 FIG. 16A is a functional block diagram showing the configuration of the audio coding device according to the third embodiment. As shown in FIG. 16A, theaudio coding device 300 includes a low frequencysignal extraction unit 110, a high frequencyinformation extraction unit 120, a highfrequency coding unit 170, amultiplexing unit 180, acorrection control unit 310, and a low frequency coding unit. It has 320.

低域信号抽出部１１０、高域情報抽出部１２０、高域符号化部１７０、多重化部１８０に関する説明は、実施例１に説明した低域信号抽出部１１０、高域情報抽出部１２０、高域符号化部１７０、多重化部１８０に関する説明と同様である。 The description of the low-frequencysignal extraction unit 110, the high-frequencyinformation extraction unit 120, the high-frequency coding unit 170, and themultiplexing unit 180 is described in the low-frequencysignal extraction unit 110, the high-frequencyinformation extraction unit 120, and the high-frequencyinformation extraction unit 120 described in the first embodiment. This is the same as the description regarding theregion coding unit 170 and themultiplexing unit 180.

補正制御部３１０は、低域信号を符号化する際の対象となる帯域を制限する処理部である。補正制御部３１０は、符号化部の一例である。実施例３に関して、以下の説明では、低域信号を符号化する際の対象となる帯域を、「符号化対象帯域」と表記する。 Thecorrection control unit 310 is a processing unit that limits a band that is a target when encoding a low-frequency signal. Thecorrection control unit 310 is an example of a coding unit. With respect to the third embodiment, in the following description, the target band when encoding the low frequency signal is referred to as “encoding target band”.

図１６Ｂは、本実施例３に係る補正制御部の処理を説明するための図である。図１６Ｂに示す周波数スペクトル８５の横軸は、周波数に対応する軸であり、縦軸は、音信号の電力（値）に対応する軸である。たとえば、音信号の境界８６には、トーン８６ａが存在している。 FIG. 16B is a diagram for explaining the processing of the correction control unit according to the third embodiment. The horizontal axis of thefrequency spectrum 85 shown in FIG. 16B is the axis corresponding to the frequency, and the vertical axis is the axis corresponding to the electric power (value) of the sound signal. For example, thetone 86a exists at theboundary 86 of the sound signal.

たとえば、符号化対象帯域のデフォルトの帯域は、符号化対象帯域８７ａとなる。補正制御部３１０は、符号化対象帯域８７ａを、符号化対象帯域８７ｂに補正する。たとえば、補正制御部３１０は、符号化対象帯域８７ｂは、符号化対象帯域８７ａの上限を１サブバンド分低域にずらしたものに対応する。補正制御部３１０は、補正した符号化対象帯域の情報を、低域符号化部３２０に出力する。 For example, the default band of the coding target band is thecoding target band 87a. Thecorrection control unit 310 corrects thecoding target band 87a to thecoding target band 87b. For example, thecorrection control unit 310 corresponds to thecoding target band 87b in which the upper limit of thecoding target band 87a is shifted to a low band by one subband. Thecorrection control unit 310 outputs the corrected coding target band information to the lowfrequency coding unit 320.

低域符号化部３２０は、低域信号抽出部１１０から低域信号を取得し、低域信号をビット列に符号化することで、低域符号を生成する処理部である。低域符号化部３２０は、低域符号を、多重化部１８０に出力する。なお、低域符号化部３２０は、補正制御部３１０から受け付ける、符号化対象帯域８７ｂに含まれる低域信号を符号化する。符号化対象帯域８７ｂには、境界８６のトーン８６ａが含まれないので、このトーン８６ａが、低域符号に含まれないこととなり、結果として、音質劣化を抑止することが可能となる。 The low-frequency coding unit 320 is a processing unit that generates a low-frequency code by acquiring a low-frequency signal from the low-frequencysignal extraction unit 110 and encoding the low-frequency signal into a bit string. The lowfrequency coding unit 320 outputs the low frequency code to themultiplexing unit 180. The low-frequency coding unit 320 encodes the low-frequency signal included in thecoding target band 87b received from thecorrection control unit 310. Since thecoding target band 87b does not include thetone 86a at theboundary 86, thetone 86a is not included in the low frequency code, and as a result, deterioration of sound quality can be suppressed.

次に、本実施例３に係るオーディオ符号化装置３００の効果について説明する。オーディオ符号化装置３００は、低域信号を符号化する場合に、トーンの存在する境界を除いた、符号化対象帯域の音信号に対して符号化を行う。これにより、境界のトーンが、低域信号に含まれないため、音質劣化を抑止することが可能となる。 Next, the effect of theaudio coding device 300 according to the third embodiment will be described. When encoding a low-frequency signal, theaudio coding device 300 encodes the sound signal in the coding target band excluding the boundary where the tone exists. As a result, the boundary tone is not included in the low-frequency signal, so that deterioration of sound quality can be suppressed.

図１７Ａは、本実施例４に係るオーディオ符号化装置の構成を示す機能ブロック図である。図１７Ａに示すように、オーディオ符号化装置３０１は、低域信号抽出部１１０、低域符号化部１５０、高域符号化部１７０、多重化部１８０、補正制御部３０２、高域情報抽出部３０３を有する。 FIG. 17A is a functional block diagram showing the configuration of the audio coding device according to the fourth embodiment. As shown in FIG. 17A, the audio coding device 301 includes a low frequencysignal extraction unit 110, a lowfrequency coding unit 150, a highfrequency coding unit 170, amultiplexing unit 180, acorrection control unit 302, and a high frequency information extraction unit. It has 303.

低域信号抽出部１１０、低域符号化部１５０、高域符号化部１７０、多重化部１８０に関する説明は、実施例１に説明した低域信号抽出部１１０、低域符号化部１５０、高域符号化部１７０、多重化部１８０に関する説明と同様である。 The description of the low-frequencysignal extraction unit 110, the low-frequency coding unit 150, the high-frequency coding unit 170, and themultiplexing unit 180 describes the low-frequencysignal extraction unit 110, the low-frequency coding unit 150, and the high-frequency coding unit 150 described in the first embodiment. This is the same as the description regarding theregion coding unit 170 and themultiplexing unit 180.

補正制御部３０２は、高域信号を符号化する際の対象となる帯域を制限する処理部である。補正制御部３０２は、符号化部の一例である。実施例４に関して、以下の説明では、高域信号を符号化する際の対象となる帯域を、「符号化対象帯域」と表記する。 Thecorrection control unit 302 is a processing unit that limits a band that is a target when encoding a high-frequency signal. Thecorrection control unit 302 is an example of a coding unit. With respect to the fourth embodiment, in the following description, the target band when encoding the high frequency signal is referred to as “encoding target band”.

図１７Ｂは、本実施例４に係る補正制御部の処理を説明するための図である。図１７Ｂに示す周波数スペクトル８５の横軸は、周波数に対応する軸であり、縦軸は、音信号の電力（値）に対応する軸である。たとえば、音信号の境界８６には、トーン８６ａが存在している。 FIG. 17B is a diagram for explaining the processing of the correction control unit according to the fourth embodiment. The horizontal axis of thefrequency spectrum 85 shown in FIG. 17B is the axis corresponding to the frequency, and the vertical axis is the axis corresponding to the electric power (value) of the sound signal. For example, thetone 86a exists at theboundary 86 of the sound signal.

たとえば、符号化対象帯域のデフォルトの帯域は、符号化対象帯域８９ａとなる。補正制御部３０２は、符号化対象帯域８９ａを、符号化対象帯域８９ｂに補正する。たとえば、符号化対象帯域８９ｂは、符号化対象帯域８９ａの下限を１サブバンド分高域にずらしたものに対応する。補正制御部３０２は、補正した符号化対象帯域の情報を、高域情報抽出部３０３に出力する。 For example, the default band of the coding target band is thecoding target band 89a. Thecorrection control unit 302 corrects thecoding target band 89a to thecoding target band 89b. For example, thecoding target band 89b corresponds to the one in which the lower limit of thecoding target band 89a is shifted to the height region by one subband. Thecorrection control unit 302 outputs the corrected coded target band information to the high frequencyinformation extraction unit 303.

高域情報抽出部３０３は、外部装置から音信号を取得し、音信号の高域（図１７Ｂに示した符号化対象帯域８９ｂ）から、高域情報を抽出する処理部である。高域情報抽出部３０３は、高域情報を、高域符号化部１７０に出力する。図１７Ｂで説明したように、符号化対象帯域８９ｂには、トーン８６ａが存在していない。 The high frequencyinformation extraction unit 303 is a processing unit that acquires a sound signal from an external device and extracts high frequency information from the high frequency band of the sound signal (encodingtarget band 89b shown in FIG. 17B). The high frequencyinformation extraction unit 303 outputs high frequency information to the highfrequency coding unit 170. As described with reference to FIG. 17B, thetone 86a does not exist in thecoding target band 89b.

次に、本実施例４に係るオーディオ符号化装置３０１の効果について説明する。オーディオ符号化装置３０１は、高域信号を符号化する場合に、トーンの存在する境界を除いた、符号化対象帯域の音信号に対して符号化を行う。これにより、境界のトーンが、高域信号に含まれないため、音質劣化を抑止することが可能となる。 Next, the effect of the audio coding device 301 according to the fourth embodiment will be described. When encoding a high-frequency signal, the audio coding device 301 encodes the sound signal in the coding target band excluding the boundary where the tone exists. As a result, since the boundary tone is not included in the high frequency signal, it is possible to suppress the deterioration of sound quality.

図１８は、本実施例５に係るオーディオ符号化装置の構成を示す機能ブロック図である。図１８に示すように、このオーディオ符号化装置４００の構成は、低域信号抽出部１１０、高域情報抽出部１２０、判定部１３０、低域補正部１４０、低域符号化部１５０、高域符号化部１７０、多重化部１８０、高域補正部４１０を有する。高域補正部４１０は、符号化部の一例である。 FIG. 18 is a functional block diagram showing the configuration of the audio coding device according to the fifth embodiment. As shown in FIG. 18, the configuration of theaudio coding device 400 includes a low frequencysignal extraction unit 110, a high frequencyinformation extraction unit 120, adetermination unit 130, a lowfrequency correction unit 140, a lowfrequency coding unit 150, and a high frequency band. It has acoding unit 170, amultiplexing unit 180, and a highfrequency correction unit 410. The highfrequency correction unit 410 is an example of a coding unit.

低域信号抽出部１１０、高域情報抽出部１２０、判定部１３０、低域補正部１４０、低域符号化部１５０、高域符号化部１７０、多重化部１８０に関する説明は、図２に示した各処理部に関する説明と同様であるため、同様の符号を付して説明を省略する。 A description of the low-frequencysignal extraction unit 110, the high-frequencyinformation extraction unit 120, thedetermination unit 130, the low-frequency correction unit 140, the low-frequency coding unit 150, the high-frequency coding unit 170, and themultiplexing unit 180 is shown in FIG. Since the description is the same as that for each processing unit, the description will be omitted with the same reference numerals.

高域補正部４１０は、判定部１３０から制御信号を受け付けた場合に、高域情報に含まれるトーン周波数を補正することで、高域情報を補正する処理部である。たとえば、トーン周波数の情報には、分解能に応じて分割された高域の複数の帯域について、トーンの有無に関する情報が含まれている。高域補正部４１０は、境界に対応する帯域のトーンの有無が「有」となっている場合に、境界に対応する帯域のトーンの有無を「無」に補正する。 The highfrequency correction unit 410 is a processing unit that corrects the high frequency information by correcting the tone frequency included in the high frequency information when the control signal is received from thedetermination unit 130. For example, the tone frequency information includes information on the presence or absence of tones for a plurality of high frequency bands divided according to the resolution. When the presence / absence of the tone in the band corresponding to the boundary is "Yes", the highfrequency correction unit 410 corrects the presence / absence of the tone in the band corresponding to the boundary to "No".

図１９は、本実施例５に係る高域補正部の構成を示す機能ブロック図である。図１９に示すように、この高域補正部４１０は、スイッチ４１１と、追加トーン抑圧部４１２とを有する。 FIG. 19 is a functional block diagram showing a configuration of a high frequency correction unit according to the fifth embodiment. As shown in FIG. 19, the highfrequency correction unit 410 includes aswitch 411 and an additionaltone suppression unit 412.

スイッチ４１１は、判定部１３０から取得する制御信号に応じて、高域情報の経路を切り替えるスイッチである。スイッチ４１１は、制御信号を受け付けていない場合には、端子４１１ａと端子４１１ｂとを接続することで、高域情報をそのまま通過させる。スイッチ４１１は、制御信号を受け付けた場合には、端子４１１ａと端子４１１ｃとを接続することで、高域情報を、追加トーン抑圧部４１２に入力する。 Theswitch 411 is a switch that switches the path of high frequency information according to the control signal acquired from thedetermination unit 130. When theswitch 411 does not receive the control signal, theswitch 411 connects the terminal 411a and the terminal 411b so that the high frequency information can be passed as it is. When theswitch 411 receives the control signal, theswitch 411 connects the terminal 411a and the terminal 411c to input high frequency information to the additionaltone suppression unit 412.

追加トーン抑圧部４１２は、高域情報に含まれるトーン周波数を補正する処理部である。図２０は、本実施例５に係る高域補正部の処理を説明するための図である。図２０において、周波数スペクトル９０の横軸は、周波数に対応する軸であり、縦軸は、信号の電力に対応する軸である。図２０に示す例では、境界９１にトーン９２が含まれている。 The additionaltone suppression unit 412 is a processing unit that corrects the tone frequency included in the high frequency information. FIG. 20 is a diagram for explaining the processing of the high frequency correction unit according to the fifth embodiment. In FIG. 20, the horizontal axis of thefrequency spectrum 90 is the axis corresponding to the frequency, and the vertical axis is the axis corresponding to the electric power of the signal. In the example shown in FIG. 20, theboundary 91 includes thetone 92.

たとえば、トーン周波数は、該当する帯域にトーンがあるか否かを「０」または「１」で示す情報であり、分割される帯域の細かさは、周波数分解能に応じたものとなる。トーンがある場合には、該当する帯域のブロックに「１」が設定され、トーンがない場合には、該当する帯域のブロックに「０」が設定される。 For example, the tone frequency is information indicating whether or not there is a tone in the corresponding band by "0" or "1", and the fineness of the divided band depends on the frequency resolution. If there is a tone, "1" is set in the block of the corresponding band, and if there is no tone, "0" is set in the block of the corresponding band.

図２０に示したトーン周波数９５ａ，９５ｂは、各帯域対応するブロック２１〜２５が含まれる。このうち、ブロック２１が、境界９１の帯域に対応するブロックとなる。トーン周波数９５ａは、補正前のトーン周波数であり、トーン周波数９５ｂは、補正後のトーン周波数である。 Thetone frequencies 95a and 95b shown in FIG. 20 includeblocks 21 to 25 corresponding to each band. Of these, theblock 21 is a block corresponding to the band of theboundary 91. Thetone frequency 95a is the tone frequency before correction, and thetone frequency 95b is the tone frequency after correction.

追加トーン抑圧部４１２は、トーン周波数９５ａのブロック２１に「１」が設定されている場合には、ブロック２１を「０」に補正することで、トーン周波数９５ｂを生成する。追加トーン抑圧部４１２は、補正したトーン周波数９５ｂと、包絡電力と、周波数分解能とを含む高域情報を、高域符号化部１７０に出力する。 When theblock 21 of thetone frequency 95a is set to "1", the additionaltone suppression unit 412 generates thetone frequency 95b by correcting theblock 21 to "0". The additionaltone suppression unit 412 outputs high frequency information including the correctedtone frequency 95b, the envelope power, and the frequency resolution to the highfrequency coding unit 170.

次に、本実施例５に係るオーディオ符号化装置４００の効果について説明する。オーディオ符号化装置４００は、境界にトーンが存在する場合には、高域情報のトーン周波数を補正することで、境界にトーンが存在しないことにする。これにより、補正された高域情報に基づいて復号される高域信号の境界には、トーンが生成されなくなるため、音質劣化を抑止することが可能となる。 Next, the effect of theaudio coding device 400 according to the fifth embodiment will be described. When the tone exists at the boundary, theaudio coding apparatus 400 corrects the tone frequency of the high frequency information so that the tone does not exist at the boundary. As a result, no tone is generated at the boundary of the high frequency signal decoded based on the corrected high frequency information, so that deterioration of sound quality can be suppressed.

ところで、実施例１〜５に示したオーディオ符号化装置１００〜４００の処理は一例である。ここでは、オーディオ符号化装置のその他の処理について説明を行う。ここでは、図２に示したオーディオ符号化装置１００のブロック図を用いて説明を行う。 By the way, the processing of theaudio coding devices 100 to 400 shown in Examples 1 to 5 is an example. Here, other processing of the audio coding apparatus will be described. Here, a block diagram of theaudio coding apparatus 100 shown in FIG. 2 will be used for description.

オーディオ符号化装置１００の判定部１３０は、低域の誤差電力と、高域の誤差電力とを比較して、低域のトーンを抑圧するのか、高域のトーンを抑圧するのかを判定しても良い。 Thedetermination unit 130 of theaudio coding device 100 compares the low-frequency error power with the high-frequency error power, and determines whether to suppress the low-frequency tone or the high-frequency tone. Is also good.

たとえば、音信号（原音）の低域信号を第１低域信号と表記し、低域符号を復号した低域信号を第２低域信号と表記する。低域の誤差電力を、第１低域信号と第２低域信号との差分値とする。音信号（原音）の高域信号を第１高域信号と表記し、高域符号に基づき復号した高域信号を第２高域信号と表記する。高域の誤差電力を、第１高域信号と第２高域信号との差分値とする。 For example, the low-frequency signal of the sound signal (original sound) is referred to as a first low-frequency signal, and the low-frequency signal obtained by decoding the low-frequency code is referred to as a second low-frequency signal. The low-frequency error power is defined as the difference value between the first low-frequency signal and the second low-frequency signal. The high-frequency signal of the sound signal (original sound) is referred to as a first high-frequency signal, and the high-frequency signal decoded based on the high-frequency code is referred to as a second high-frequency signal. The high frequency error power is defined as the difference value between the first high frequency signal and the second high frequency signal.

判定部１３０は、低域の誤差電力＞高域の誤差電力となる場合には、高域のトーンを抑圧すると判定する。一方、判定部１３０は、低域の誤差電力≦高域の誤差電力となる場合には、低域のトーンを抑圧すると判定する。 Thedetermination unit 130 determines that the high frequency tone is suppressed when the low frequency error power> the high frequency error power. On the other hand, thedetermination unit 130 determines that the low frequency tone is suppressed when the low frequency error power ≤ the high frequency error power.

図２１は、判定部のその他の処理手順を示すフローチャートである。図２１に示すように、オーディオ符号化装置１００の判定部１３０は、トーン検出結果が、トーン有りであるか否かを判定する（ステップＳ４０１）。判定部１３０は、トーン検出結果が、トーン有りでない場合には（ステップＳ４０１，Ｎｏ）、補正処理を実行しない旨の制御信号を出力する（ステップＳ４０２）。なお、ステップＳ４０２において、判定部１３０は、補正処理を実行しないと判定した場合に、制御信号の出力を抑止しても良い。 FIG. 21 is a flowchart showing other processing procedures of the determination unit. As shown in FIG. 21, thedetermination unit 130 of theaudio coding device 100 determines whether or not the tone detection result has a tone (step S401). If the tone detection result does not have a tone (step S401, No), thedetermination unit 130 outputs a control signal indicating that the correction process is not executed (step S402). In step S402, thedetermination unit 130 may suppress the output of the control signal when it is determined that the correction process is not executed.

判定部１３０は、トーン検出結果が、トーン有りの場合には（ステップＳ４０１，Ｙｅｓ）、低域の誤差電力＞高域の誤差電力となるか否かを判定する（ステップＳ４０３）。判定部１３０は、低域の誤差電力＞高域の誤差電力となる場合には（ステップＳ４０３，Ｙｅｓ）、高域補正を行う旨の、制御信号を高域補正部１６０に出力する（ステップＳ４０４）。 When the tone detection result has a tone (step S401, Yes), thedetermination unit 130 determines whether or not the low frequency error power> the high frequency error power (step S403). When the low frequency error power> the high frequency error power (step S403, Yes), thedetermination unit 130 outputs a control signal to the highfrequency correction unit 160 to perform high frequency correction (step S404). ).

判定部１３０は、低域の誤差電力＞高域の誤差電力とならない場合には（ステップＳ４０３，Ｎｏ）、低域補正を行う旨の、制御信号を低域補正部１４０に出力する（ステップＳ４０５）。 When the low-frequency error power> the high-frequency error power does not hold (step S403, No), thedetermination unit 130 outputs a control signal to the low-frequency correction unit 140 to perform low-frequency correction (step S405). ).

上記のように、低域の誤差電力と、高域の誤差電力との比較により、実際にトーンを抑圧した帯域が適切であったか否かをフィードバックすることで、音質を改善するための、トーンを抑圧する帯域を適切に選択することができる。 As described above, by comparing the error power in the low frequency range with the error power in the high frequency range, feedback is given as to whether or not the band in which the tone is actually suppressed is appropriate, so that the tone can be adjusted to improve the sound quality. The band to be suppressed can be appropriately selected.

実施例６の説明を行う前に、上述した実施例１で説明したオーディオ符号化装置１００の問題について説明する。オーディオ符号化装置１００により生成された符号化ストリームを復号装置２０が復号する際、復号装置２０の逆フィルタモードの設定に応じて、図２２で説明するように、復号後の音信号の品質が劣化する場合がある。 Before explaining the sixth embodiment, the problem of theaudio coding apparatus 100 described in the first embodiment described above will be described. When thedecoding device 20 decodes the coded stream generated by theaudio coding device 100, the quality of the sound signal after decoding is determined according to the setting of the inverse filter mode of thedecoding device 20, as described with reference to FIG. It may deteriorate.

図２２は、オーディオ符号化装置の問題を説明するための図である。図２２に示す音信号の周波数スペクトル９０１は、横軸が周波数に対応する軸であり、縦軸が電力（値）に対応する軸である。周波数スペクトル９０１の低域と高域との境界９０２の付近には、トーン９０３が含まれている。 FIG. 22 is a diagram for explaining a problem of the audio coding device. In thefrequency spectrum 901 of the sound signal shown in FIG. 22, the horizontal axis is the axis corresponding to the frequency, and the vertical axis is the axis corresponding to the electric power (value). Thetone 903 is included in the vicinity of theboundary 902 between the low frequency band and the highfrequency frequency spectrum 901.

たとえば、オーディオ符号化装置１００は、境界９０２付近のトーン９０３を検出すると、低域に含まれるトーン９０３を抑圧することで、低域信号を補正し、低域信号を符号化した低域符号を生成する。オーディオ符号化装置１００は、低域符号と、高域情報を符号化した高域符号とを多重化することで、符号化ストリームを生成し、復号装置２０に出力する。 For example, when theaudio coding device 100 detects thetone 903 near theboundary 902, it suppresses thetone 903 included in the low frequency band to correct the low frequency signal and obtain a low frequency code in which the low frequency signal is encoded. Generate. Theaudio coding device 100 generates a coded stream by multiplexing the low-frequency code and the high-frequency code that encodes the high-frequency information, and outputs the coded stream to thedecoding device 20.

復号装置２０は、オーディオ符号化装置１００から受信した符号化ストリームを復号することで、周波数スペクトル９１０を生成することになる。ここで、復号装置２０の処理に応じて、周波数スペクトル９２０が生成される場合がある。周波数スペクトル９１０，９２０に関して、横軸は周波数に対応する軸であり、縦軸は電力（値）に対応する軸である。 Thedecoding device 20 generates thefrequency spectrum 910 by decoding the coded stream received from theaudio coding device 100. Here, thefrequency spectrum 920 may be generated depending on the processing of thedecoding device 20. With respect to thefrequency spectra 910 and 920, the horizontal axis is the axis corresponding to the frequency, and the vertical axis is the axis corresponding to the electric power (value).

周波数スペクトル９１０は、適切に復号された周波数スペクトルであり、境界９１１の付近にトーン９１２が含まれている。一方、周波数スペクトル９２０は、境界９２１の付近にトーンが含まれておらず、音信号の品質が劣化する。 Thefrequency spectrum 910 is an appropriately decoded frequency spectrum, and thetone 912 is included in the vicinity of theboundary 911. On the other hand, thefrequency spectrum 920 does not include a tone in the vicinity of theboundary 921, and the quality of the sound signal deteriorates.

続いて、周波数スペクトル９２０の境界９２１付近において、トーンが生成されない理由について説明する。たとえば、ＳＢＲ技術を利用する復号装置２０には、逆フィルタモードのオン、オフの機能が備わっている。 Next, the reason why the tone is not generated near theboundary 921 of thefrequency spectrum 920 will be described. For example, thedecoding device 20 using the SBR technology has an on / off function of the inverse filter mode.

復号装置２０は、逆フィルタモードが「オフ」となっている場合には、周波数スペクトルの低域をそのまま高域に複製して、音信号を生成する。このように、復号装置２０が、低域の周波数スペクトルをそのまま高域に複製する処理を行う場合には、図２２に示した周波数スペクトル９１０が生成され、音信号の品質が劣化することはない。 When the inverse filter mode is "off", thedecoding device 20 replicates the low frequency band of the frequency spectrum as it is to the high frequency band to generate a sound signal. In this way, when thedecoding device 20 performs the process of duplicating the low frequency spectrum as it is in the high frequency band, thefrequency spectrum 910 shown in FIG. 22 is generated, and the quality of the sound signal is not deteriorated. ..

一方、復号装置２０は、逆フィルタモードが「オン」となっている場合には、周波数スペクトルの低域を非相関化してから高域に複製して、音信号を生成する。このように、復号装置２０が、低域信号を非相関化してから高域を複製すると、高域にトーンが生成されず、図２２に示した周波数スペクトル９２０が生成され、音信号の品質が劣化する。 On the other hand, when the inverse filter mode is "on", thedecoding device 20 uncorrelates the low frequency band of the frequency spectrum and then replicates it to the high frequency band to generate a sound signal. In this way, when thedecoding device 20 uncorrelates the low frequency signal and then duplicates the high frequency band, no tone is generated in the high frequency band, thefrequency spectrum 920 shown in FIG. 22 is generated, and the quality of the sound signal is improved. to degrade.

図２３は、低域の信号を非相関化したことによる問題を説明するための図である。図２３において、各周波数スペクトル９３０〜９３２の横軸は周波数に対応する軸であり、縦軸は電力（値）に対応する軸である。 FIG. 23 is a diagram for explaining the problem caused by uncorrelating the low frequency signal. In FIG. 23, the horizontal axis of each frequency spectrum 930 to 932 is the axis corresponding to the frequency, and the vertical axis is the axis corresponding to the electric power (value).

復号装置２０は、周波数スペクトル９３０の低域を非相関化することで、周波数スペクトル９３１を生成する。復号装置２０は、周波数スペクトル９３１の帯域９３１ａを選択し、選択した帯域９３１ａの周波数スペクトルを高域に複製することで、周波数スペクトル９３２を生成する。復号装置２０は、周波数スペクトル９３２に対して包絡調整を行うことで、最終的な周波数スペクトルを復号する。図２３で説明したように、低域の信号を非相関化してから高域を複製すると、復号された周波数スペクトルには、高域にトーンが生成されない。 Thedecoding device 20 generates the frequency spectrum 931 by uncorrelating the low frequencies of the frequency spectrum 930. Thedecoding device 20 selects the band 931a of the frequency spectrum 931 and duplicates the frequency spectrum of the selected band 931a in a high region to generate the frequency spectrum 932. Thedecoding device 20 decodes the final frequency spectrum by performing envelope adjustment with respect to the frequency spectrum 932. As described with reference to FIG. 23, if the low frequency signal is uncorrelated and then the high frequency is duplicated, no tone is generated in the high frequency spectrum in the decoded frequency spectrum.

図２２および図２３で説明した問題を解消するために、本実施例６に係るオーディオ符号化装置は、逆フィルタモードのオン、オフに応じて、低域信号の補正の有無を制御する。たとえば、オーディオ符号化装置は、逆フィルタモードが「オフ」の場合には、低域信号の補正を行うことで、トーンを抑圧する。一方、オーディオ符号化装置は、逆フィルタモードが「オン」の場合には、低域信号の補正を行わないことで、低域信号のトーンを抑圧しない。このように、逆フィルタモードのオン、オフに応じて、トーンの抑圧の制御を行い、復号装置２０が復号を行う場合における、音信号の品質劣化の問題を解消する。 In order to solve the problems described with reference to FIGS. 22 and 23, the audio coding apparatus according to the sixth embodiment controls the presence / absence of correction of the low frequency signal according to the on / off of the inverse filter mode. For example, the audio coding device suppresses the tone by correcting the low frequency signal when the inverse filter mode is “off”. On the other hand, when the inverse filter mode is "on", the audio coding device does not suppress the tone of the low frequency signal by not correcting the low frequency signal. In this way, the suppression of the tone is controlled according to the on / off of the inverse filter mode, and the problem of quality deterioration of the sound signal when thedecoding device 20 performs decoding is solved.

図２４は、本実施例６に係るシステムの構成を示す図である。図２４に示すように、このシステムは、オーディオ符号化装置６００と、復号装置７００とを有する。オーディオ符号化装置６００は、ネットワーク５０を介して、復号装置７００に接続される。 FIG. 24 is a diagram showing a configuration of the system according to the sixth embodiment. As shown in FIG. 24, this system has anaudio coding device 600 and adecoding device 700. Theaudio coding device 600 is connected to thedecoding device 700 via thenetwork 50.

図２５は、本実施例６に係るオーディオ符号化装置の構成を示す機能ブロック図である。図２５に示すように、このオーディオ符号化装置６００は、符号化部６００ａと、判定部６０４と、多重化部６０９とを有する。符号化部６００ａは、時間周波数変換部６０１と、高域情報抽出部６０２と、高域符号化部６０３と、低域抽出部６０５と、低域補正部６０６と、周波数時間変換部６０７と、低域符号化部６０８とを含む。 FIG. 25 is a functional block diagram showing the configuration of the audio coding device according to the sixth embodiment. As shown in FIG. 25, theaudio coding device 600 includes a coding unit 600a, a determination unit 604, and amultiplexing unit 609. The coding unit 600a includes a time-frequency conversion unit 601, a high-frequencyinformation extraction unit 602, a high-frequency coding unit 603, a low-frequency extraction unit 605, a low-frequency correction unit 606, and a frequency-time conversion unit 607. Includes a low frequency coding unit 608.

時間周波数変換部６０１は、音信号を時間周波数信号に変換する処理部である。時間周波数変換部６０１は、時間周波数信号を、高域情報抽出部６０２、判定部６０４、低域抽出部６０５に出力する。 The time-frequency conversion unit 601 is a processing unit that converts a sound signal into a time-frequency signal. The time frequency conversion unit 601 outputs the time frequency signal to the high frequencyinformation extraction unit 602, the determination unit 604, and the lowfrequency extraction unit 605.

たとえば、時間周波数変換部６０１は、式（３）で定義されるＱＭＦ（Quadrature Mirror Filter）フィルタバンクを用いて、音信号ｓ［ｎ］を、周波数信号Ｓ［ｋ］［ｎ］に変換する。式（３）において、ｎは時間を表す変数であり、ｋは周波数を表す変数である。 For example, the time-frequency conversion unit 601 converts the sound signal s [n] into the frequency signal S [k] [n] by using the QMF (Quadrature Mirror Filter) filter bank defined in the equation (3). In equation (3), n is a variable representing time, and k is a variable representing frequency.

時間周波数変換部６０１は、各時間と各周波数の周波数信号Ｓとを対応づけることで、時間周波数信号Ｌ［ｋ］［ｎ］を生成する。図２６は、時間周波数信号のデータ構造の一例を示す図である。図２６において、横軸は時間に対応する軸であり、縦軸は周波数に対応する軸である。時間周波数信号は、時間毎の周波数スペクトルの情報を含む。たとえば、Ｓ（０，０）、Ｓ（１，０）、・・・Ｓ（６３，０）は、時間ｎ＝０における、周波数と周波数信号Ｓの値（電力値に相当）との関係を示す周波数スペクトルの情報である。 The time-frequency conversion unit 601 generates time-frequency signals L [k] [n] by associating each time with the frequency signal S of each frequency. FIG. 26 is a diagram showing an example of a data structure of a time frequency signal. In FIG. 26, the horizontal axis is the axis corresponding to time, and the vertical axis is the axis corresponding to frequency. The time frequency signal contains information on the frequency spectrum for each hour. For example, S (0,0), S (1,0), ... S (63,0) describes the relationship between the frequency and the value of the frequency signal S (corresponding to the power value) at time n = 0. It is the information of the frequency spectrum shown.

図２５の説明に戻る。高域情報抽出部６０２は、時間周波数信号の高域から、高域情報を抽出する処理部である。高域情報抽出部６０２は、抽出した高域情報を、高域符号化部６０３に出力する。高域情報には、包絡電力、トーン周波数、周波数分解能が含まれる。高域情報を抽出する処理は、実施例１で説明した高域情報抽出部１２０の処理と同様である。 Returning to the description of FIG. The high frequencyinformation extraction unit 602 is a processing unit that extracts high frequency information from the high frequency of the time frequency signal. The high frequencyinformation extraction unit 602 outputs the extracted high frequency information to the highfrequency coding unit 603. High frequency information includes envelope power, tone frequency, and frequency resolution. The process of extracting high-frequency information is the same as the process of the high-frequencyinformation extraction unit 120 described in the first embodiment.

また、高域情報抽出部６０２は、時間周波数信号を基にして、復号装置７００に設定される逆フィルタモードがオンであるかオフであるかを推定する。高域情報抽出部６０２は、推定した逆フィルタモードの情報を、低域補正部６０６に出力する。 Further, the high frequencyinformation extraction unit 602 estimates whether the inverse filter mode set in thedecoding device 700 is on or off based on the time frequency signal. The high frequencyinformation extraction unit 602 outputs the estimated information of the inverse filter mode to the lowfrequency correction unit 606.

高域情報抽出部６０２は、時間周波数信号のトーン成分の平均値を算出する。トーン成分の平均値を「帯域トーン成分」と表記する。高域情報抽出部６０２は、帯域トーン成分を用いて、フレーム内の平均電力を計算する。フレームは、時間周波数信号を所定時間毎に分割したデータに対応する。高域情報抽出部６０２は、前フレームの帯域トーン成分を用いて、現フレームの帯域トーン成分を平滑化する。 The high frequencyinformation extraction unit 602 calculates the average value of the tone components of the time frequency signal. The average value of the tone component is expressed as "band tone component". The high frequencyinformation extraction unit 602 calculates the average power in the frame using the band tone component. The frame corresponds to the data obtained by dividing the time frequency signal into predetermined time intervals. The high frequencyinformation extraction unit 602 smoothes the band tone component of the current frame by using the band tone component of the previous frame.

高域情報抽出部６０２は、平滑化された帯域トーン成分と、平均電力とを基にして、逆フィルタモードのオン、オフを判定する。たとえば、高域情報抽出部６０２は、図２７で説明するような閾値比較を行うことで、逆フィルタレベルを判定する。図２７は、逆フィルタレベルの判定手順を示すフローチャートである。図２７に示す第１〜第４閾値は、事前に設定されているものとする。なお、第１閾値〜第３閾値の大小関係は、第１閾値＜第２閾値＜第３閾値とする。 The high frequencyinformation extraction unit 602 determines whether the inverse filter mode is on or off based on the smoothed band tone component and the average power. For example, the high frequencyinformation extraction unit 602 determines the inverse filter level by performing a threshold comparison as described with reference to FIG. 27. FIG. 27 is a flowchart showing a procedure for determining the inverse filter level. It is assumed that the first to fourth threshold values shown in FIG. 27 are set in advance. The magnitude relationship between the first threshold value and the third threshold value is defined as the first threshold value <second threshold value <third threshold value.

図２７に示すように、高域情報抽出部６０２は、帯域トーン成分が第１閾値未満である場合には（ステップＳ３１，Ｎｏ）、逆フィルタレベル＝０と判定し（ステップＳ３２）、ステップＳ３８に移行する。 As shown in FIG. 27, when the band tone component is less than the first threshold value (step S31, No), the high frequencyinformation extraction unit 602 determines that the inverse filter level = 0 (step S32), and steps S38. Move to.

高域情報抽出部６０２は、帯域トーン成分が第１閾値以上である場合には（ステップＳ３１，Ｙｅｓ）、ステップＳ３３に移行する。高域情報抽出部６０２は、帯域トーン成分が第２閾値未満である場合には（ステップＳ３３，Ｎｏ）、逆フィルタレベル＝１と判定し（ステップＳ３４）、ステップＳ３８に移行する。 When the band tone component is equal to or higher than the first threshold value (step S31, Yes), the high frequencyinformation extraction unit 602 shifts to step S33. When the band tone component is less than the second threshold value (step S33, No), the high frequencyinformation extraction unit 602 determines that the inverse filter level = 1 (step S34), and proceeds to step S38.

高域情報抽出部６０２は、帯域トーン成分が第２閾値以上である場合には（ステップＳ３３，Ｙｅｓ）、ステップＳ３５に移行する。高域情報抽出部６０２は、帯域トーン成分が第３閾値未満である場合には（ステップＳ３５，Ｎｏ）、逆フィルタレベル＝２と判定し（ステップＳ３６）、ステップＳ３８に移行する。 When the band tone component is equal to or higher than the second threshold value (step S33, Yes), the high frequencyinformation extraction unit 602 shifts to step S35. When the band tone component is less than the third threshold value (step S35, No), the high frequencyinformation extraction unit 602 determines that the inverse filter level = 2 (step S36), and proceeds to step S38.

高域情報抽出部６０２は、帯域トーン成分が第３閾値以上である場合には（ステップＳ３５，Ｙｅｓ）、逆フィルタレベル＝３と判定し（ステップＳ３７）、ステップＳ３８に移行する。 When the band tone component is equal to or higher than the third threshold value (step S35, Yes), the high frequencyinformation extraction unit 602 determines that the inverse filter level = 3 (step S37), and proceeds to step S38.

高域情報抽出部６０２は、平均電力が第４閾値未満であるか否かを判定する（ステップＳ３８）。高域情報抽出部６０２は、平均電力が第４閾値未満である場合には（ステップＳ３８，Ｙｅｓ）、逆フィルタレベルを０に更新し（ステップＳ３９）、逆フィルタレベルを判定する処理を終了する。一方、高域情報抽出部６０２は、平均電力が第４閾値以上である場合には（ステップＳ３８，Ｎｏ）、逆フィルタレベルを判定する処理を終了する。 The high frequencyinformation extraction unit 602 determines whether or not the average power is less than the fourth threshold value (step S38). When the average power is less than the fourth threshold value (step S38, Yes), the high frequencyinformation extraction unit 602 updates the inverse filter level to 0 (step S39), and ends the process of determining the inverse filter level. .. On the other hand, when the average power is equal to or higher than the fourth threshold value (steps S38, No), the high frequencyinformation extraction unit 602 ends the process of determining the inverse filter level.

ほとんど無音の信号に対して逆フィルタの処理を行わないようにするため、平均電力が微小な場合には、逆フィルタレベルを「０」に設定する。このため、第４閾値は、微小な値が設定される。 When the average power is very small, the inverse filter level is set to "0" so that the inverse filter processing is not performed on the almost silent signal. Therefore, a minute value is set for the fourth threshold value.

高域情報抽出部６０２は、図２７に示す処理を実行し、逆フィルタレベルが「０」である場合には、逆フィルタモード「オフ」の情報を、低域補正部６０６に出力する。高域情報抽出部６０２は、逆フィルタレベルが「１」以上である場合には、逆フィルタモード「オン」の情報を、低域補正部６０６に出力する。 The high frequencyinformation extraction unit 602 executes the process shown in FIG. 27, and when the inverse filter level is “0”, outputs the information of the inverse filter mode “off” to the lowfrequency correction unit 606. When the inverse filter level is "1" or higher, the high frequencyinformation extraction unit 602 outputs the information of the inverse filter mode "on" to the lowfrequency correction unit 606.

図２５の説明に戻る。高域符号化部６０３は、高域情報を符号化することで、高域符号を生成する処理部である。高域符号化部６０３は、高域符号を、多重化部６０９に出力する。 Returning to the description of FIG. The high-frequency coding unit 603 is a processing unit that generates a high-frequency code by encoding high-frequency information. The highfrequency coding unit 603 outputs the high frequency code to themultiplexing unit 609.

判定部６０４は、時間周波数信号を基にして、音信号の低域と高域との境界にトーンが含まれるか否かを判定する処理部である。判定部６０４は、境界にトーンが含まれると判定した場合には、制御信号を低域補正部６０６に出力する。判定部６０４が、音信号の低域と高域との境界にトーンが含まれるか否かを判定する処理は、判定部１３０の処理と同様である。 The determination unit 604 is a processing unit that determines whether or not a tone is included in the boundary between the low frequency band and the high frequency band of the sound signal based on the time frequency signal. When the determination unit 604 determines that the boundary contains a tone, the determination unit 604 outputs a control signal to the lowfrequency correction unit 606. The process of determining whether or not the tone is included in the boundary between the low frequency band and the high frequency band of the sound signal by the determination unit 604 is the same as the process of thedetermination unit 130.

低域抽出部６０５は、時間周波数信号の低域の情報を抽出する処理部である。低域抽出部６０５は、抽出した低域の情報を、低域補正部６０６に出力する。管理者は、低域の上限周波数を予め設定しておくものとする。 The lowfrequency extraction unit 605 is a processing unit that extracts low frequency information of the time frequency signal. The lowfrequency extraction unit 605 outputs the extracted low frequency information to the lowfrequency correction unit 606. The administrator shall set the upper limit frequency of the low frequency in advance.

低域補正部６０６は、逆フィルタモードの情報および制御信号を基にして、低域補正を行う処理部である。具体的に、低域補正部６０６は、逆フィルタモードが「オフ」であり、かつ、制御信号を受信した場合（トーンが含まれる場合）に、低域補正を行う。低域補正部６０６は、時間周波数信号の低域に対して、低域補正を行う。たとえば、低域補正部６０６は、時間周波数信号の低域に含まれるトーン成分を抑圧することで、低域補正を行う。低域補正部６０６は、低域補正を行った時間周波数信号を、周波数時間変換部６０７に出力する。 The lowfrequency correction unit 606 is a processing unit that performs low frequency correction based on the information of the inverse filter mode and the control signal. Specifically, the low-frequency correction unit 606 performs low-frequency correction when the inverse filter mode is "off" and a control signal is received (when a tone is included). The lowfrequency correction unit 606 corrects the low frequency of the time frequency signal. For example, the lowfrequency correction unit 606 performs low frequency correction by suppressing the tone component included in the low frequency of the time frequency signal. The lowfrequency correction unit 606 outputs the time frequency signal for which low frequency correction has been performed to the frequency time conversion unit 607.

一方、低域補正部６０６は、逆フィルタモードが「オン」の場合、または、制御信号を受信しない場合（トーンが含まれない場合）に、低域補正を行わず、時間周波数信号の低域の情報を、周波数時間変換部６０７に出力する。 On the other hand, when the inverse filter mode is "on" or when the control signal is not received (when the tone is not included), the lowfrequency correction unit 606 does not perform the low frequency correction and the low frequency of the time frequency signal. Is output to the frequency time conversion unit 607.

図２８は、本実施例６に係る低域補正部の処理手順を示すフローチャートである。図２８に示すように、低域補正部６０６は、逆フィルタモードがオンであるか否かを判定する（ステップＳ５０）。低域補正部６０６は、逆フィルタモードがオンである場合には（ステップＳ５０，Ｙｅｓ）、トーンを抑圧していない時間周波数信号の低域の情報を周波数時間変換部６０７に出力する（ステップＳ５１）。 FIG. 28 is a flowchart showing a processing procedure of the low frequency correction unit according to the sixth embodiment. As shown in FIG. 28, the lowfrequency correction unit 606 determines whether or not the inverse filter mode is on (step S50). When the inverse filter mode is on (step S50, Yes), the lowfrequency correction unit 606 outputs low frequency information of the time frequency signal that does not suppress the tone to the frequency time conversion unit 607 (step S51). ).

一方、低域補正部６０６は、逆フィルタモードがオフである場合には（ステップＳ５０，Ｎｏ）、制御信号を受信したか否かを判定する（ステップＳ５２）。低域補正部６０６は、制御信号を受信していない場合には（ステップＳ５２，Ｎｏ）、ステップＳ５１に移行する。 On the other hand, when the inverse filter mode is off (step S50, No), the lowfrequency correction unit 606 determines whether or not a control signal has been received (step S52). When the lowfrequency correction unit 606 has not received the control signal (steps S52, No), the lowfrequency correction unit 606 proceeds to step S51.

低域補正部６０６は、制御信号を受信した場合には（ステップＳ５２，Ｙｅｓ）、時間周波数信号の低域に含まれるトーン成分を抑圧する（ステップＳ５３）。低域補正部６０６は、トーンを抑圧した時間周波数信号の低域の情報を周波数時間変換部６０７に出力する（ステップＳ５４）。 When the lowfrequency correction unit 606 receives the control signal (step S52, Yes), the lowfrequency correction unit 606 suppresses the tone component included in the low frequency of the time frequency signal (step S53). The lowfrequency correction unit 606 outputs low frequency information of the time frequency signal whose tone is suppressed to the frequency time conversion unit 607 (step S54).

図２５の説明に戻る。周波数時間変換部６０７は、時間周波数信号を低域信号に変換する処理部である。周波数時間変換部６０７は、低域信号を、低域符号化部６０８に出力する。 Returning to the description of FIG. The frequency-time conversion unit 607 is a processing unit that converts a time-frequency signal into a low-frequency signal. The frequency-time conversion unit 607 outputs a low-frequency signal to the low-frequency coding unit 608.

たとえば、周波数時間変換部６０７は、式（４）で定義されるフィルタバンクにより、時間周波数信号Ｓ’［ｋ］［ｎ］を、低域信号ｓ_ｌｏｗ（ｎ）に変換する。式（４）において、Ｋ_ｌｏｗ＝３２、Ｎ_ｌｏｗ＝１２８とする。ここで、時間周波数信号Ｓ’［ｋ］［ｎ］は、低域補正部６０６により、低域補正が行われた時間周波数信号または、低域補正が行われていない時間周波数信号に対応する。For example, the frequency-time conversion unit 607 converts the filter bank is defined by equation (4), the time-frequency signal S '[k] [n] , the low frequency signal_{s low} (n). In equation (4), K_low = 32 and N_low = 128. Here, the time frequency signals S'[k] [n] correspond to the time frequency signal in which the low frequency correction is performed by the lowfrequency correction unit 606 or the time frequency signal in which the low frequency correction is not performed.

低域符号化部６０８は、低域信号をビット列に符号化することで、低域符号を生成する処理部である。たとえば、低域符号化部６０８は、ＡＡＣに基づいて、符号化を行う。低域符号化部６０８は、低域符号を、多重化部６０９に出力する。 The low-frequency coding unit 608 is a processing unit that generates a low-frequency code by encoding a low-frequency signal into a bit string. For example, the low frequency coding unit 608 performs coding based on AAC. The low frequency coding unit 608 outputs the low frequency code to themultiplexing unit 609.

多重化部６０９は、低域符号と、高域符号とを多重化することで、符号化ストリームを生成する処理部である。多重化部６０９は、符号化ストリームを、ネットワーク５０を介して、復号装置７００に送信する。 Themultiplexing unit 609 is a processing unit that generates a coded stream by multiplexing the low-frequency code and the high-frequency code. Themultiplexing unit 609 transmits the coded stream to thedecoding device 700 via thenetwork 50.

たとえば、多重化部６０９は、符号化ストリームを、ＭＰＥＧ−４ＡＤＴＳ（Audio Data Transport Stream）形式で出力する。図２９は、符号化ストリームのデータ構造の一例を示す図である。図２９に示すように、符号化ストリーム９５０は、複数のＡＤＴＳフレーム９５１〜９５４を含む。図示を省略するが、符号化ストリーム９５０は、ＡＤＴＳフレーム９５１〜９５４以外のＡＤＴＳフレームを含む。 For example, themultiplexing unit 609 outputs the encoded stream in the MPEG-4 ADTS (Audio Data Transport Stream) format. FIG. 29 is a diagram showing an example of the data structure of the coded stream. As shown in FIG. 29, the codedstream 950 includes a plurality of ADTS frames 951-954. Although not shown, the codedstream 950 includes ADTS frames other than ADTS frames 951-954.

たとえば、ＡＤＴＳフレーム９５２は、ＡＤＴＳヘッダ９６０と、ＲＡＷデータブロック９６１とを含む。ＲＡＷデータブロック９６１には、低域符号９７０と、ＦＩＬＬエレメント９７１が格納される。また、ＦＩＬＬエレメント９７１には、高域符号９７２が格納される。ＡＤＴＳフレーム９５１，９５３，９５４のデータ構造は、ＡＤＴＳフレーム９５２のデータ構造と同様である。 For example, theADTS frame 952 includes anADTS header 960 and a RAW data block 961. The low frequency code 970 and the FILL element 971 are stored in the RAW data block 961. Further, the high frequency code 972 is stored in the FILL element 971. The data structure of theADTS frame 951, 953, 954 is similar to the data structure of theADTS frame 952.

次に、図２４に示した復号装置７００について説明する。図３０は、本実施例６に係る復号装置の構成を示す機能ブロック図である。図３０に示すように、この復号装置７００は、符号分離部７０１と、低域復号部７０２と、分析ＱＭＦ部７０３と、高域逆量子化部７０４と、高域生成部７０５と、包絡調整部７０６と、合成部７０７とを有する。 Next, thedecoding device 700 shown in FIG. 24 will be described. FIG. 30 is a functional block diagram showing the configuration of the decoding device according to the sixth embodiment. As shown in FIG. 30, thedecoding device 700 includes acode separation unit 701, a lowfrequency decoding unit 702, an analysis QMF unit 703, a high frequency inverse quantization unit 704, a highfrequency generation unit 705, and an envelope adjustment. It has a section 706 and a synthesis section 707.

符号分離部７０１は、オーディオ符号化装置６００から符号化ストリームを受信し、符号化ストリームに含まれる低域符号と、高域符号とを分離する処理部である。符号分離部７０１は、低域符号を低域復号部７０２に出力する。符号分離部７０１は、高域符号を高域逆量子化部７０４に出力する。 Thecode separation unit 701 is a processing unit that receives a coded stream from theaudio coding device 600 and separates the low frequency code and the high frequency code included in the coded stream. Thecode separation unit 701 outputs the low frequency code to the lowfrequency decoding unit 702. Thecode separation unit 701 outputs the high frequency code to the high frequency inverse quantization unit 704.

低域復号部７０２は、低域符号を復号することで、低域信号を生成する処理部である。低域復号部７０２は、低域信号を分析ＱＭＦ部７０３に出力する。 The low-frequency decoding unit 702 is a processing unit that generates a low-frequency signal by decoding a low-frequency code. The lowfrequency decoding unit 702 outputs the low frequency signal to the analysis QMF unit 703.

分析ＱＭＦ部７０３は、式（３）で定義されるＱＭＦフィルタバンクを用いて、低域信号を、時間周波数信号に変換する処理部である。この時間周波数信号は、各時間の低域の周波数スペクトルに対応する情報である。以下の説明では、低域信号を変換することで得られる時間周波数信号を「低域周波数信号」と表記する。 The analysis QMF unit 703 is a processing unit that converts a low frequency signal into a time frequency signal by using the QMF filter bank defined by the equation (3). This time frequency signal is information corresponding to the low frequency spectrum of each time. In the following description, the time frequency signal obtained by converting the low frequency signal is referred to as a "low frequency signal".

高域逆量子化部７０４は、高域符号を復号することで、高域情報を抽出する処理部である。高域逆量子化部７０４は、抽出した高域情報を、高域生成部７０５に出力する。高域情報には、包絡電力、トーン周波数、周波数分解能が含まれる。 The high-frequency inverse quantization unit 704 is a processing unit that extracts high-frequency information by decoding a high-frequency code. The high-frequency inverse quantization unit 704 outputs the extracted high-frequency information to the high-frequency generation unit 705. High frequency information includes envelope power, tone frequency, and frequency resolution.

高域生成部７０５は、低域周波数信号を基にして、高域信号を生成する処理部である。高域生成部７０５が生成する高域信号は、時間と周波数との関係を示す高域の周波数スペクトルに対応する情報である。高域生成部７０５は、高域信号と高域情報とを包絡調整部７０６に出力する。 The highfrequency generation unit 705 is a processing unit that generates a high frequency signal based on the low frequency signal. The high-frequency signal generated by the high-frequency generation unit 705 is information corresponding to the high-frequency frequency spectrum showing the relationship between time and frequency. The highfrequency generation unit 705 outputs the high frequency signal and the high frequency information to the envelope adjustment unit 706.

以下では、逆フィルタモードがオフの場合の、高域生成部７０５の処理と、逆フィルタモードがオンの場合の、高域生成部７０５の処理について説明する。逆フィルタモードのオン、オフは、高域生成部７０５に予め設定される。 Hereinafter, the processing of the highfrequency generation unit 705 when the inverse filter mode is off and the processing of the highfrequency generation unit 705 when the inverse filter mode is on will be described. On / off of the inverse filter mode is preset in the highfrequency generation unit 705.

逆フィルタモードが「オフ」の場合の、高域生成部７０５の処理について説明する。高域生成部７０５は、低域周波数信号をそのまま高域側に複製することで、高域信号を生成する。 The processing of the highfrequency generation unit 705 when the inverse filter mode is “off” will be described. The highfrequency generation unit 705 generates a high frequency signal by duplicating the low frequency signal as it is on the high frequency side.

逆フィルタモードが「オン」の場合の、高域生成部７０５の処理について説明する。高域生成部７０５は、逆フィルタモードが「オン」の場合には、低域周波数信号に対して、逆フィルタを実行（非相関化を実行）し、逆フィルタを実行した低域周波数信号を高域側に複製することで、高域信号を生成する。高域生成部７０５が低域周波数信号に対して実行する非相関化は、低域周波数信号に対する補正の一例である。 The processing of the highfrequency generation unit 705 when the inverse filter mode is “ON” will be described. When the inverse filter mode is "on", thehigh frequency generator 705 executes an inverse filter (executes decorrelation) on the low frequency signal, and executes the inverse filter on the low frequency signal. A high frequency signal is generated by duplicating to the high frequency side. The decorrelation performed by thehigh frequency generator 705 on the low frequency signal is an example of correction for the low frequency signal.

包絡調整部７０６は、高域情報に含まれる周波数分解能および包絡電力を基にして、高域信号を調整する処理部である。また、包絡調整部７０６は、トーン周波数を基にして、高域信号にトーン成分を付与する。包絡調整部７０６は、調整済みの高域信号を、合成部７０７に出力する。 The envelope adjustment unit 706 is a processing unit that adjusts the high frequency signal based on the frequency resolution and the envelope power included in the high frequency information. Further, the envelope adjusting unit 706 adds a tone component to the high frequency signal based on the tone frequency. The envelope adjustment unit 706 outputs the adjusted high frequency signal to the synthesis unit 707.

合成部７０７は、分析ＱＭＦ部７０３から出力される低域周波数信号と、包絡調整部７０６から出力される調整済みの高域信号とを合成することで、音信号を復号する処理部である。合成部７０７は、復号した音信号を出力する。 The synthesis unit 707 is a processing unit that decodes the sound signal by synthesizing the low frequency signal output from the analysis QMF unit 703 and the adjusted high frequency signal output from the envelope adjustment unit 706. The synthesis unit 707 outputs the decoded sound signal.

次に、本実施例６に係るオーディオ符号化装置６００の処理手順の一例について説明する。図３１は、本実施例６に係るオーディオ符号化装置の処理手順を示すフローチャートである。図３１に示すように、オーディオ符号化装置６００の時間周波数変換部６０１は、音信号を受け付ける（ステップＳ５０１）。時間周波数変換部６０１は、音信号に対して時間周波数変換を実行する（ステップＳ５０２）。 Next, an example of the processing procedure of theaudio coding apparatus 600 according to the sixth embodiment will be described. FIG. 31 is a flowchart showing a processing procedure of the audio coding apparatus according to the sixth embodiment. As shown in FIG. 31, the time-frequency conversion unit 601 of theaudio coding device 600 receives the sound signal (step S501). The time-frequency conversion unit 601 executes time-frequency conversion on the sound signal (step S502).

オーディオ符号化装置６００の高域情報抽出部６０２は、音信号（時間周波数信号）から高域情報を抽出する（ステップＳ５０３）。オーディオ符号化装置６００の高域符号化部６０３は高域情報を符号化し、高域符号を生成する（ステップＳ５０４）。高域情報抽出部６０２は、逆フィルタモードのオン、オフを推定する（ステップＳ５０５）。 The high frequencyinformation extraction unit 602 of theaudio coding apparatus 600 extracts high frequency information from the sound signal (time frequency signal) (step S503). The high-frequency coding unit 603 of theaudio coding device 600 encodes the high-frequency information and generates a high-frequency code (step S504). The high frequencyinformation extraction unit 602 estimates whether the inverse filter mode is on or off (step S505).

オーディオ符号化装置６００の低域抽出部６０５は、音信号（時間周波数信号）から低域信号を抽出する（ステップＳ５０６）。低域補正部６０６は、補正判定処理を実行する（ステップＳ５０７）。ステップＳ５０７の補正判定処理の処理手順は、図２８で説明した処理手順に対応する。 The lowfrequency extraction unit 605 of theaudio coding apparatus 600 extracts the low frequency signal from the sound signal (time frequency signal) (step S506). The lowfrequency correction unit 606 executes the correction determination process (step S507). The processing procedure of the correction determination processing in step S507 corresponds to the processing procedure described with reference to FIG. 28.

オーディオ符号化装置６００の周波数時間変換部６０７は、低域信号に対して周波数時間変換を実行する（ステップＳ５０８）。低域符号化部６０８は、低域信号を符号化し、低域符号を生成する（ステップＳ５０９）。 The frequency-time conversion unit 607 of theaudio coding device 600 executes frequency-time conversion on the low-frequency signal (step S508). The low-frequency coding unit 608 encodes the low-frequency signal and generates a low-frequency code (step S509).

オーディオ符号化装置６００の多重化部６０９は、低域符号と高域符号とを多重化した符号化ストリームを生成する（ステップＳ５１０）。多重化部６０９は、符号化ストリームを復号装置７００に送信する（ステップＳ５１１）。 Themultiplexing unit 609 of theaudio coding apparatus 600 generates a coded stream in which the low frequency code and the high frequency code are multiplexed (step S510). Themultiplexing unit 609 transmits the coded stream to the decoding device 700 (step S511).

次に、本実施例６に係る復号装置７００の処理手順の一例について説明する。図３２は、本実施例６に係る復号装置の処理手順を示すフローチャートである。図３２に示すように、復号装置７００の符号分離部７０１は符号化ストリームを受信し、低域符号および高域符号を分離する（ステップＳ６０１）。 Next, an example of the processing procedure of thedecoding device 700 according to the sixth embodiment will be described. FIG. 32 is a flowchart showing a processing procedure of the decoding apparatus according to the sixth embodiment. As shown in FIG. 32, thecode separation unit 701 of thedecoding device 700 receives the coded stream and separates the low frequency code and the high frequency code (step S601).

復号装置７００の低域復号部７０２は、低域符号を復号することで低域信号を生成する（ステップＳ６０２）。復号装置７００の分析ＱＭＦ部７０３は、ＱＭＦフィルタバンクを用いて、低域周波数信号を生成する（ステップＳ６０３）。 The low-frequency decoding unit 702 of thedecoding device 700 generates a low-frequency signal by decoding the low-frequency code (step S602). The analysis QMF unit 703 of thedecoding device 700 uses the QMF filter bank to generate a low frequency signal (step S603).

復号装置７００の高域逆量子化部７０４は、高域符号に対して高域逆量子化を行うことで高域情報を生成する（ステップＳ６０４）。復号装置７００の高域生成部７０５は、逆フィルタモードがオンであるか否かを判定する（ステップＳ６０５）。 The high-frequency dequantization unit 704 of thedecoding device 700 generates high-frequency information by performing high-frequency dequantization on the high-frequency code (step S604). The highfrequency generation unit 705 of thedecoding device 700 determines whether or not the inverse filter mode is on (step S605).

高域生成部７０５は、逆フィルタモードがオフである場合には（ステップＳ６０５，Ｎｏ）、ステップＳ６０７に移行する。一方、高域生成部７０５は、逆フィルタモードがオンである場合には（ステップＳ６０５，Ｙｅｓ）、低域周波数信号に対する逆フィルタ処理を実行する（ステップＳ６０６）。 When the inverse filter mode is off (steps S605, No), the highfrequency generation unit 705 shifts to step S607. On the other hand, when the reverse filter mode is on (step S605, Yes), the highfrequency generation unit 705 executes the reverse filter processing on the low frequency signal (step S606).

高域生成部７０５は、低域周波数信号を複製して高域信号を生成する（ステップＳ６０７）。復号装置７００の包絡調整部７０６は、高域情報を基にして高域信号の包絡を調整する（ステップＳ６０８）。 The highfrequency generation unit 705 duplicates the low frequency signal to generate the high frequency signal (step S607). The envelope adjusting unit 706 of thedecoding device 700 adjusts the envelope of the high frequency signal based on the high frequency information (step S608).

復号装置７００の合成部７０７は、低域周波数信号と高域信号とを合成することで、音信号を復号する（ステップＳ６０９）。合成部７０７は、音信号を出力する（ステップＳ６１０）。 The synthesis unit 707 of thedecoding device 700 decodes the sound signal by synthesizing the low frequency signal and the high frequency signal (step S609). The synthesis unit 707 outputs a sound signal (step S610).

次に、本実施例６に係るオーディオ符号化装置６００の効果について説明する。オーディオ符号化装置６００は、逆フィルタモードのオン、オフに応じて、低域信号の補正の有無を制御する。たとえば、オーディオ符号化装置６００は、逆フィルタモードが「オフ」の場合には、低域信号の補正を行うことで、トーンを抑圧する。一方、オーディオ符号化装置６００は、逆フィルタモードが「オン」の場合には、低域信号の補正を行わないことで、低域信号のトーンを抑圧しない。このように、逆フィルタモードのオン、オフに応じて、トーンの抑圧の制御を行い、復号装置７００が復号を行う場合における、音信号の品質劣化の問題を解消する。 Next, the effect of theaudio coding device 600 according to the sixth embodiment will be described. Theaudio coding device 600 controls the presence / absence of correction of the low frequency signal according to the on / off of the inverse filter mode. For example, theaudio coding apparatus 600 suppresses the tone by correcting the low frequency signal when the inverse filter mode is “off”. On the other hand, when the inverse filter mode is "on", theaudio coding apparatus 600 does not suppress the tone of the low frequency signal by not correcting the low frequency signal. In this way, the suppression of the tone is controlled according to the on / off of the inverse filter mode, and the problem of quality deterioration of the sound signal when thedecoding device 700 performs decoding is solved.

オーディオ符号化装置６００は、逆フィルタモードが「オフ」の場合には、低域信号の補正を行うことで、トーンを抑圧するので、低域と高域との境界付近に、トーンが複数生成することにより発生するうねりを抑止し、音信号の品質劣化の問題を解消する。 When the inverse filter mode is "off", theaudio coding apparatus 600 suppresses the tone by correcting the low frequency signal, so that a plurality of tones are generated near the boundary between the low frequency and the high frequency. It suppresses the swell that occurs by doing so, and solves the problem of quality deterioration of the sound signal.

また、オーディオ符号化装置６００は、逆フィルタモードが「オン」の場合には、低域信号の補正を行わないことで、低域と高域との境界付近に、トーンが発生しないことによる音声劣化の問題を解消する。 Further, when the inverse filter mode is "on", theaudio coding apparatus 600 does not correct the low frequency signal, so that no tone is generated near the boundary between the low frequency band and the high frequency band. Eliminate the problem of deterioration.

オーディオ符号化装置６００は、音信号に含まれるトーン成分の平均値および音信号の平均電力を基にして、逆フィルタモードがオンであるかオフであるかを推定する。これにより、復号装置７００側で逆フィルタを実行するか否かを、音信号の特徴に合わせて自動で、推定することができる。 Theaudio coding apparatus 600 estimates whether the inverse filter mode is on or off based on the average value of the tone components contained in the sound signal and the average power of the sound signal. As a result, it is possible to automatically estimate whether or not the inverse filter is executed on thedecoding device 700 side according to the characteristics of the sound signal.

本実施例６に係る復号装置７００は、逆フィルタモードのオン、オフに応じて、低域信号の周波数スペクトルを補正（低域に対する逆フィルタを実行）し、補正した低域信号の周波数スペクトルを用いて、高域信号を復号する。上記のように、オーディオ符号化装置６００は、逆フィルタモードがオンの場合には、低域信号のトーン成分の補正が行われていないので、逆フィルタモードを実行しても、復号した音信号の境界付近に、トーン成分が残り、音質劣化の問題を解消することができる。 Thedecoding device 700 according to the sixth embodiment corrects the frequency spectrum of the low frequency signal (executes an inverse filter for the low frequency band) according to whether the reverse filter mode is turned on or off, and corrects the frequency spectrum of the corrected low frequency signal. It is used to decode a high frequency signal. As described above, when the reverse filter mode is on, theaudio coding apparatus 600 does not correct the tone component of the low frequency signal, so that the decoded sound signal is obtained even if the reverse filter mode is executed. The tone component remains near the boundary of the sound quality, and the problem of sound quality deterioration can be solved.

次に、上記実施例に示したオーディオ符号化装置１００（２００，３００，３０１，４００，６００）と同様の機能を実現するコンピュータのハードウェア構成の一例について説明する。図３３は、オーディオ符号化装置と同様の機能を実現するコンピュータのハードウェア構成の一例を示す図である。 Next, an example of a computer hardware configuration that realizes the same functions as the audio coding apparatus 100 (200, 300, 301, 400, 600) shown in the above embodiment will be described. FIG. 33 is a diagram showing an example of a hardware configuration of a computer that realizes a function similar to that of an audio coding device.

図３３に示すように、コンピュータ５００は、各種演算処理を実行するＣＰＵ５０１と、ユーザからのデータの入力を受け付ける入力装置５０２と、ディスプレイ５０３とを有する。また、コンピュータ５００は、記憶媒体からプログラム等を読み取る読み取り装置５０４と、外部装置との間でデータの授受を行うインターフェース装置５０５とを有する。また、コンピュータ５００は、各種情報を一時記憶するＲＡＭ５０６と、ハードディスク装置５０７とを有する。そして、各装置５０１〜５０７は、バス５０８に接続される。 As shown in FIG. 33, thecomputer 500 includes aCPU 501 that executes various arithmetic processes, aninput device 502 that receives data input from a user, and adisplay 503. Further, thecomputer 500 has areading device 504 that reads a program or the like from a storage medium, and aninterface device 505 that exchanges data between the external device and thereading device 504. Further, thecomputer 500 has aRAM 506 for temporarily storing various information and a hard disk device 507. Then, each of thedevices 501 to 507 is connected to thebus 508.

ハードディスク装置５０７は、判定プログラム５０７ａ、符号化プログラム５０７ｂ、多重化プログラム５０７ｃを有する。ＣＰＵ５０１は、判定プログラム５０７ａ、符号化プログラム５０７ｂ、多重化プログラム５０７ｃを読み出してＲＡＭ５０６に展開する。 The hard disk device 507 includes adetermination program 507a, acoding program 507b, and amultiplexing program 507c. TheCPU 501 reads out thedetermination program 507a, thecoding program 507b, and themultiplexing program 507c and deploys them in theRAM 506.

判定プログラム５０７ａは、判定プロセス５０６ａとして機能する。符号化プログラム５０７ｂは、符号化プロセス５０６ｂとして機能する。多重化プログラム５０７ｃは、多重化プロセス５０６ｃとして機能する。 Thedetermination program 507a functions as the determination process 506a. Thecoding program 507b functions as thecoding process 506b. Themultiplexing program 507c functions as the multiplexing process 506c.

判定プロセス５０６ａの処理は、判定部１３０，２１０，６０４の処理に対応する。符号化プロセス５０６ｂの処理は、低域信号抽出部１１０、高域情報抽出部１２０、低域補正部１４０、入力信号補正部２２０、低域符号化部１５０，３２０、高域補正部１６０，４１０、高域符号化部１７０、符号化部６００ａの処理に対応する。多重化プロセス５０６ｃの処理は、多重化部１８０，６０９の処理に対応する。 The processing of the determination process 506a corresponds to the processing of thedetermination units 130, 210, 604. The processing of thecoding process 506b includes a low frequencysignal extraction unit 110, a high frequencyinformation extraction unit 120, a lowfrequency correction unit 140, an inputsignal correction unit 220, a lowfrequency coding unit 150, 320, and a highfrequency correction unit 160, 410. , Corresponds to the processing of the highfrequency coding unit 170 and the coding unit 600a. The processing of the multiplexing process 506c corresponds to the processing of the multiplexingunits 180 and 609.

次に、上記実施例に示した復号装置７００と同様の機能を実現するコンピュータのハードウェア構成の一例について説明する。図３４は、復号装置と同様の機能を実現するコンピュータのハードウェア構成の一例を示す図である。 Next, an example of a computer hardware configuration that realizes the same functions as thedecoding device 700 shown in the above embodiment will be described. FIG. 34 is a diagram showing an example of a hardware configuration of a computer that realizes the same function as the decoding device.

図３４に示すように、コンピュータ５５０は、各種演算処理を実行するＣＰＵ５５１と、ユーザからのデータの入力を受け付ける入力装置５５２と、ディスプレイ５５３とを有する。また、コンピュータ５５０は、記憶媒体からプログラム等を読み取る読み取り装置５５４と、外部装置との間でデータの授受を行うインターフェース装置５５５とを有する。また、コンピュータ５５０は、各種情報を一時記憶するＲＡＭ５５６と、ハードディスク装置５５７とを有する。そして、各装置５５１〜５５７は、バス５５８に接続される。 As shown in FIG. 34, thecomputer 550 includes aCPU 551 that executes various arithmetic processes, aninput device 552 that receives data input from a user, and adisplay 553. Further, thecomputer 550 has areading device 554 that reads a program or the like from a storage medium, and aninterface device 555 that exchanges data with and from an external device. Further, thecomputer 550 has aRAM 556 that temporarily stores various information and ahard disk device 557. Then, eachdevice 551 to 557 is connected to thebus 558.

ハードディスク装置５５７は、分離プログラム５５７ａ、低域復号プログラム５５７ｂ、高域生成プログラム５５７ｃ、合成プログラム５５７ｄを有する。ＣＰＵ５５１は、分離プログラム５５７ａ、低域復号プログラム５５７ｂ、高域生成プログラム５５７ｃ、合成プログラム５５７ｄを読み出してＲＡＭ５５６に展開する。 Thehard disk device 557 includes aseparation program 557a, a lowfrequency decoding program 557b, a highfrequency generation program 557c, and asynthesis program 557d. TheCPU 551 reads out theseparation program 557a, the lowfrequency decoding program 557b, the highfrequency generation program 557c, and thesynthesis program 557d and develops them in theRAM 556.

分離プログラム５５７ａは、分離プロセス５５６ａとして機能する。低域復号プログラム５５７ｂは、低域復号プロセス５５６ｂとして機能する。高域生成プログラム５５７ｃは、高域生成プロセス５５６ｃとして機能する。合成プログラム５５７ｄは、合成プロセス５５６ｄとして機能する。 Theseparation program 557a functions as the separation process 556a. The lowfrequency decoding program 557b functions as the lowfrequency decoding process 556b. The highfrequency generation program 557c functions as the highfrequency generation process 556c. Thesynthesis program 557d functions as asynthesis process 556d.

分離プロセス５５６ａの処理は、符号分離部７０１の処理に対応する。低域復号プロセス５５６ｂの処理は、低域復号部７０２の処理に対応する。高域生成プロセス５５６ｃは、高域生成部７０５の処理に対応する。合成プロセス５５６ｄの処理は、合成部７０７の処理に対応する。 The processing of the separation process 556a corresponds to the processing of thecode separation unit 701. The processing of the low-frequency decoding process 556b corresponds to the processing of the low-frequency decoding unit 702. The highfrequency generation process 556c corresponds to the processing of the highfrequency generation unit 705. The processing of thesynthesis process 556d corresponds to the processing of the synthesis unit 707.

なお、各プログラム５０７ａ〜５０７ｃ，５５７ａ〜５５７ｄについては、必ずしも最初からハードディスク装置５０７，５５７に記憶させておかなくても良い。例えば、コンピュータ５００，５５０に挿入されるフレキシブルディスク（ＦＤ）、ＣＤ−ＲＯＭ、ＤＶＤディスク、光磁気ディスク、ＩＣカードなどの「可搬用の物理媒体」に各プログラムを記憶させておく。そして、コンピュータ５００，５５０が各プログラム５０７ａ〜５０７ｃ，５５７ａ〜５５７ｄを読み出して実行するようにしても良い。 Theprograms 507a to 507c and 557a to 557d do not necessarily have to be stored in thehard disk devices 507 and 557 from the beginning. For example, each program is stored in a "portable physical medium" such as a flexible disk (FD), a CD-ROM, a DVD disk, a magneto-optical disk, or an IC card inserted intocomputers 500 and 550. Then, thecomputers 500 and 550 may read and execute theprograms 507a to 507c and 557a to 557d.

以上の各実施例を含む実施形態に関し、さらに以下の付記を開示する。 The following additional notes will be further disclosed with respect to the embodiments including each of the above embodiments.

（付記１）入力信号の低域と高域との境界にトーンが含まれるか否かを判定する判定部と、
前記境界にトーンが含まれている場合に、前記低域、又は、前記高域の一方のトーンを抑圧し、前記入力信号の前記低域を符号化することで、低域符号を生成し、前記入力信号の前記高域を符号化することで、高域符号を生成する符号化部と、
低域符号と高域符号を多重化することで、符号化ストリームを生成する多重化部と
を有することを特徴とするオーディオ符号化装置。(Appendix 1) A determination unit that determines whether or not a tone is included in the boundary between the low and high frequencies of the input signal, and
When a tone is included in the boundary, one tone of the low frequency band or the high frequency band is suppressed, and the low frequency band of the input signal is encoded to generate a low frequency code. A coding unit that generates a high-frequency code by encoding the high-frequency code of the input signal,
An audio coding device characterized by having a multiplexing unit that generates a coded stream by multiplexing a low-frequency code and a high-frequency code.

（付記２）前記符号化部は、前記高域の入力信号の周波数スペクトルから包絡情報を抽出し、前記包絡情報を含む高域情報を符号化することで、前記高域符号を生成し、前記高域のトーンを抑圧する場合には、前記境界付近における前記包絡情報の値を抑制することを特徴とする付記１に記載のオーディオ符号化装置。(Appendix 2) The coding unit generates the high frequency code by extracting the inclusion information from the frequency spectrum of the high frequency input signal and encoding the high frequency information including the inclusion information. The audio coding apparatus according toAppendix 1, wherein when the high frequency tone is suppressed, the value of the enveloping information in the vicinity of the boundary is suppressed.

（付記３）前記判定部は、符号化するビットレートと閾値との比較結果を基にして、前記低域のトーンを抑圧するのか、前記高域のトーンを抑圧するのかを判定することを特徴とする付記１または２に記載のオーディオ符号化装置。(Appendix 3) The determination unit is characterized in determining whether to suppress the low-frequency tone or the high-frequency tone based on the comparison result between the encoded bit rate and the threshold value. The audio coding device according toAppendix 1 or 2.

（付記４）前記判定部は、前記低域の入力信号と前記低域符号を復号した入力信号との第１誤差、および、前記高域の入力信号と前記高域符号を復号した入力信号との第２誤差を算出し、前記第１誤差と第２誤差との比較結果を基にして、前記低域のトーンを抑圧するのか、前記高域のトーンを抑圧するのかを判定することを特徴とする付記１または２に記載のオーディオ符号化装置。(Appendix 4) The determination unit includes a first error between the low-frequency input signal and the input signal obtained by decoding the low-frequency code, and the high-frequency input signal and the high-frequency code-decoded input signal. The second error is calculated, and based on the comparison result between the first error and the second error, it is determined whether to suppress the low frequency tone or the high frequency tone. The audio coding device according toAppendix 1 or 2.

（付記５）前記符号化部は、トーンを抑圧する場合に、段階的にトーンの大きさを小さくしていくことを特徴とする付記１〜４のいずれか一つに記載のオーディオ符号化装置。(Supplementary Note 5) The audio coding apparatus according to any one ofSupplementary note 1 to 4, wherein the coding unit gradually reduces the size of the tone when the tone is suppressed. ..

（付記６）前記符号化部は、前記境界にトーンが含まれている場合に、前記高域の下限を、所定周波数分だけ高域側にずらすことで、高域を符号化する際の帯域を補正し、補正した帯域に対応する入力信号を符号化して、高域符号を生成することを特徴とする付記１に記載のオーディオ符号化装置。(Appendix 6) When the boundary includes a tone, the coding unit shifts the lower limit of the high frequency to the high frequency side by a predetermined frequency to encode the high frequency band. The audio coding apparatus according toAppendix 1, wherein the input signal corresponding to the corrected band is encoded to generate a high frequency code.

（付記７）前記符号化部は、前記境界にトーンが含まれている場合に、前記低域の上限を、所定周波数分だけ低域側にずらすことで、低域を符号化する際の帯域を補正し、補正した帯域に対応する入力信号を符号化して、低域符号を生成することを特徴とする付記１に記載のオーディオ符号化装置。(Appendix 7) When the boundary includes a tone, the coding unit shifts the upper limit of the low frequency to the low frequency side by a predetermined frequency to encode the low frequency band. The audio coding apparatus according toAppendix 1, wherein the input signal corresponding to the corrected band is encoded to generate a low frequency code.

（付記８）前記高域情報は、高域を所定幅で分割した帯域毎にトーンの有無を示すトーン周波数の情報を更に含み、前記符号化部は、前記境界に対応する帯域のトーンの有無を、無に設定することを特徴とする付記２に記載のオーディオ符号化装置。(Appendix 8) The high frequency information further includes information on the tone frequency indicating the presence or absence of a tone for each band in which the high frequency is divided by a predetermined width, and the coding unit includes the presence or absence of a tone in the band corresponding to the boundary. The audio coding device according to Appendix 2, wherein is set to none.

（付記９）前記符号化部は、前記符号化ストリームを復号する復号装置が、前記入力信号の前記低域をそのまま複製することで、前記入力信号の前記高域を生成する処理を行う場合に、前記低域に含まれる前記トーンを抑圧した後に、前記低域符号を生成し、
前記符号化ストリームを復号する復号装置が、前記入力信号の前記低域を非相関化した後に複製することで、前記入力信号の前記高域を生成する処理を行う場合に、前記低域に含まれる前記トーンを抑圧することなく、前記低域符号を生成することを特徴とする付記１に記載のオーディオ符号化装置。(Appendix 9) When the decoding device that decodes the coded stream performs a process of generating the high frequency band of the input signal by duplicating the low frequency band of the input signal as it is. After suppressing the tone contained in the low frequency band, the low frequency code is generated.
When the decoding device that decodes the coded stream performs a process of generating the high frequency of the input signal by decorrelating the low frequency of the input signal and then duplicating the low frequency, the low frequency is included in the low frequency. The audio coding apparatus according toAppendix 1, wherein the low-frequency code is generated without suppressing the tone.

（付記１０）前記符号化部は、前記入力信号に含まれるトーン成分の平均値および前記入力信号の平均値を基にして、前記復号装置が、前記低域を非相関化した後に、前記低域符号を生成するか否かを判定することを特徴とする付記９に記載のオーディオ符号化装置。(Appendix 10) The coding unit is based on the average value of the tone components contained in the input signal and the average value of the input signal, and after the decoding device uncorrelates the low frequency band, the low frequency band is described. The audio coding apparatus according to Appendix 9, wherein it determines whether or not to generate a region code.

（付記１１）コンピュータが実行するオーディオ符号化方法であって、
入力信号の低域と高域との境界にトーンが含まれるか否かを判定し、
前記境界にトーンが含まれている場合に、前記低域、又は、前記高域の一方のトーンを抑圧し、
前記入力信号の前記低域を符号化することで低域符号を生成し、
前記入力信号の前記高域を符号化することで、高域符号を生成し、
低域符号と高域符号を多重化することで、符号化ストリームを生成する
処理を実行することを特徴とするオーディオ符号化方法。(Appendix 11) An audio coding method executed by a computer.
Determines whether the boundary between the low and high frequencies of the input signal contains a tone, and determines whether or not the tone is included.
When the boundary contains a tone, one tone of the low frequency band or the high frequency band is suppressed.
A low frequency code is generated by encoding the low frequency band of the input signal.
By encoding the high frequency band of the input signal, a high frequency code is generated.
An audio coding method characterized by executing a process of generating a coded stream by multiplexing a low-frequency code and a high-frequency code.

（付記１２）前記高域の入力信号の周波数スペクトルから包絡情報を抽出し、前記包絡情報を含む高域情報を符号化することで、前記高域符号を生成する処理を更に実行し、前記トーンを抑圧する処理は、前記高域のトーンを抑圧する場合には、前記境界付近における前記包絡情報の値を抑制することを特徴とする付記１１に記載のオーディオ符号化方法。(Appendix 12) By extracting the envelope information from the frequency spectrum of the high frequency input signal and encoding the high frequency information including the envelope information, the process of generating the high frequency code is further executed, and the tone is further executed. The audio coding method according to Appendix 11, wherein the process of suppressing is suppressing the value of the envelope information in the vicinity of the boundary when the high frequency tone is suppressed.

（付記１３）前記トーンを抑圧する処理は、符号化するビットレートと閾値との比較結果を基にして、前記低域のトーンを抑圧するのか、前記高域のトーンを抑圧するのかを判定することを特徴とする付記１１または１２に記載のオーディオ符号化方法。(Appendix 13) The process of suppressing the tone determines whether to suppress the low-frequency tone or the high-frequency tone based on the comparison result between the encoded bit rate and the threshold value. The audio coding method according toAppendix 11 or 12, characterized in that.

（付記１４）前記トーンを抑圧する処理は、前記低域の入力信号と前記低域符号を復号した入力信号との第１誤差、および、前記高域の入力信号と前記高域符号を復号した入力信号との第２誤差を算出し、前記第１誤差と第２誤差との比較結果を基にして、前記低域のトーンを抑圧するのか、前記高域のトーンを抑圧するのかを判定することを特徴とする付記１１または１２に記載のオーディオ符号化方法。(Appendix 14) In the process of suppressing the tone, the first error between the low-frequency input signal and the input signal obtained by decoding the low-frequency code, and the high-frequency input signal and the high-frequency code are decoded. The second error with the input signal is calculated, and based on the comparison result between the first error and the second error, it is determined whether to suppress the low frequency tone or the high frequency tone. The audio coding method according toAppendix 11 or 12, characterized in that.

（付記１５）前記トーンを抑圧する処理は、トーンを抑圧する場合に、段階的にトーンの大きさを小さくしていくことを特徴とする付記１１〜１４のいずれか一つに記載のオーディオ符号化方法。(Supplementary Note 15) The audio code according to any one of Supplementary note 11 to 14, wherein the process of suppressing the tone gradually reduces the size of the tone when the tone is suppressed. How to make it.

（付記１６）前記高域符号を生成する処理は、前記境界にトーンが含まれている場合に、前記高域の下限を、所定周波数分だけ高域側にずらすことで、高域を符号化する際の帯域を補正し、補正した帯域に対応する入力信号を符号化して、高域符号を生成することを特徴とする付記１１に記載のオーディオ符号化方法。(Appendix 16) In the process of generating the high frequency code, when a tone is included in the boundary, the high frequency is encoded by shifting the lower limit of the high frequency to the high frequency side by a predetermined frequency. The audio coding method according to Appendix 11, wherein the band is corrected, the input signal corresponding to the corrected band is encoded, and a high frequency code is generated.

（付記１７）前記低域符号を生成する処理は、前記境界にトーンが含まれている場合に、前記低域の上限を、所定周波数分だけ低域側にずらすことで、低域を符号化する際の帯域を補正し、補正した帯域に対応する入力信号を符号化して、高域符号を生成することを特徴とする付記１１に記載のオーディオ符号化方法。(Appendix 17) In the process of generating the low frequency code, when a tone is included in the boundary, the low frequency is encoded by shifting the upper limit of the low frequency to the low frequency side by a predetermined frequency. The audio coding method according to Appendix 11, wherein the band is corrected, the input signal corresponding to the corrected band is encoded, and a high frequency code is generated.

（付記１８）前記高域情報は、高域を所定幅で分割した帯域毎にトーンの有無を示すトーン周波数の情報を更に含み、前記高域符号を生成する処理は、前記境界に対応する帯域のトーンの有無を、無に設定することを特徴とする付記１２に記載のオーディオ符号化方法。(Appendix 18) The high frequency information further includes information on the tone frequency indicating the presence or absence of a tone for each band in which the high frequency is divided by a predetermined width, and the process of generating the high frequency code is a band corresponding to the boundary. The audio coding method according toAppendix 12, wherein the presence or absence of the tone of is set to none.

（付記１９）前記低域符号を生成する処理は、前記符号化ストリームを復号する復号装置が、前記入力信号の前記低域をそのまま複製することで、前記入力信号の前記高域を生成する処理を行う場合に、前記低域に含まれる前記トーンを抑圧した後に、前記低域符号を生成し、
前記符号化ストリームを復号する復号装置が、前記入力信号の前記低域を非相関化した後に複製することで、前記入力信号の前記高域を生成する処理を行う場合に、前記低域に含まれる前記トーンを抑圧することなく、前記低域符号を生成することを特徴とする付記１１に記載のオーディオ符号化方法。(Appendix 19) The process of generating the low frequency code is a process in which a decoding device that decodes the coded stream duplicates the low frequency band of the input signal as it is to generate the high frequency band of the input signal. After suppressing the tone contained in the low frequency band, the low frequency code is generated.
When the decoding device that decodes the coded stream performs a process of generating the high frequency of the input signal by decorrelating the low frequency of the input signal and then duplicating the low frequency, the low frequency is included in the low frequency. The audio coding method according to Appendix 11, wherein the low-frequency code is generated without suppressing the tone.

（付記２０）前記低域符号を生成する処理は、前記入力信号に含まれるトーン成分の平均値および前記入力信号の平均値を基にして、前記復号装置が、前記低域を非相関化した後に、前記低域符号を生成するか否かを判定することを特徴とする付記１９に記載のオーディオ符号化方法。(Appendix 20) In the process of generating the low frequency code, the decoding device uncorrelates the low frequency band based on the average value of the tone components contained in the input signal and the average value of the input signal. The audio coding method according to Appendix 19, wherein it is later determined whether or not to generate the low frequency code.

（付記２１）コンピュータに、
入力信号の低域と高域との境界にトーンが含まれるか否かを判定し、
前記境界にトーンが含まれている場合に、前記低域、又は、前記高域の一方のトーンを抑圧し、
前記入力信号の前記低域を符号化することで低域符号を生成し、
前記入力信号の前記高域を符号化することで、高域符号を生成し、
低域符号と高域符号を多重化することで、符号化ストリームを生成する
処理を実行させることを特徴とするオーディオ符号化プログラム。(Appendix 21) To the computer
Determines whether the boundary between the low and high frequencies of the input signal contains a tone, and determines whether or not the tone is included.
When the boundary contains a tone, one tone of the low frequency band or the high frequency band is suppressed.
A low frequency code is generated by encoding the low frequency band of the input signal.
By encoding the high frequency band of the input signal, a high frequency code is generated.
An audio coding program characterized in that a process of generating a coded stream is executed by multiplexing a low-frequency code and a high-frequency code.

（付記２２）前記高域の入力信号の周波数スペクトルから包絡情報を抽出し、前記包絡情報を含む高域情報を符号化することで、前記高域符号を生成する処理を更に実行し、前記トーンを抑圧する処理は、前記高域のトーンを抑圧する場合には、前記境界付近における前記包絡情報の値を抑制することを特徴とする付記２１に記載のオーディオ符号化プログラム。(Appendix 22) By extracting the wrapping information from the frequency spectrum of the high-frequency input signal and encoding the high-frequency information including the wrapping information, a process of generating the high-frequency code is further executed, and the tone is further executed. 21. The audio coding program according toAppendix 21, wherein the process of suppressing is suppressing the value of the enveloping information in the vicinity of the boundary when the high frequency tone is suppressed.

（付記２３）前記トーンを抑圧する処理は、符号化するビットレートと閾値との比較結果を基にして、前記低域のトーンを抑圧するのか、前記高域のトーンを抑圧するのかを判定することを特徴とする付記２１または２２に記載のオーディオ符号化プログラム。(Appendix 23) The process of suppressing the tone determines whether to suppress the low-frequency tone or the high-frequency tone based on the comparison result between the encoded bit rate and the threshold value. The audio coding program according toAppendix 21 or 22, characterized in that.

（付記２４）前記トーンを抑圧する処理は、前記低域の入力信号と前記低域符号を復号した入力信号との第１誤差、および、前記高域の入力信号と前記高域符号を復号した入力信号との第２誤差を算出し、前記第１誤差と第２誤差との比較結果を基にして、前記低域のトーンを抑圧するのか、前記高域のトーンを抑圧するのかを判定することを特徴とする付記２１または２２に記載のオーディオ符号化プログラム。(Appendix 24) In the process of suppressing the tone, the first error between the low-frequency input signal and the input signal obtained by decoding the low-frequency code, and the high-frequency input signal and the high-frequency code are decoded. The second error with the input signal is calculated, and based on the comparison result between the first error and the second error, it is determined whether to suppress the low frequency tone or the high frequency tone. The audio coding program according toAppendix 21 or 22, characterized in that.

（付記２５）前記トーンを抑圧する処理は、トーンを抑圧する場合に、段階的にトーンの大きさを小さくしていくことを特徴とする付記２１〜２４のいずれか一つに記載のオーディオ符号化プログラム。(Supplementary Note 25) The audio code according to any one ofSupplementary note 21 to 24, wherein the process of suppressing the tone gradually reduces the size of the tone when the tone is suppressed. Program.

（付記２６）前記高域符号を生成する処理は、前記境界にトーンが含まれている場合に、前記高域の下限を、所定周波数分だけ高域側にずらすことで、高域を符号化する際の帯域を補正し、補正した帯域に対応する入力信号を符号化して、高域符号を生成することを特徴とする付記２１に記載のオーディオ符号化プログラム。(Appendix 26) In the process of generating the high frequency code, when a tone is included in the boundary, the high frequency is encoded by shifting the lower limit of the high frequency to the high frequency side by a predetermined frequency. 21. The audio coding program according toAppendix 21, wherein the band is corrected, the input signal corresponding to the corrected band is encoded, and a high frequency code is generated.

（付記２７）前記低域符号を生成する処理は、前記境界にトーンが含まれている場合に、前記低域の上限を、所定周波数分だけ低域側にずらすことで、低域を符号化する際の帯域を補正し、補正した帯域に対応する入力信号を符号化して、低域符号を生成することを特徴とする付記２１に記載のオーディオ符号化プログラム。(Appendix 27) In the process of generating the low frequency code, when a tone is included in the boundary, the low frequency is encoded by shifting the upper limit of the low frequency to the low frequency side by a predetermined frequency. 21. The audio coding program according toAppendix 21, wherein the band is corrected, the input signal corresponding to the corrected band is encoded, and a low frequency code is generated.

（付記２８）前記高域情報は、高域を所定幅で分割した帯域毎にトーンの有無を示すトーン周波数の情報を更に含み、前記高域符号を生成する処理は、前記境界に対応する帯域のトーンの有無を、無に設定することを特徴とする付記２２に記載のオーディオ符号化プログラム。(Appendix 28) The high frequency information further includes information on the tone frequency indicating the presence or absence of a tone for each band in which the high frequency is divided by a predetermined width, and the process of generating the high frequency code is a band corresponding to the boundary. The audio coding program according toAppendix 22, wherein the presence or absence of the tone of the above is set to none.

（付記２９）前記低域符号を生成する処理は、前記符号化ストリームを復号する復号装置が、前記入力信号の前記低域をそのまま複製することで、前記入力信号の前記高域を生成する処理を行う場合に、前記低域に含まれる前記トーンを抑圧した後に、前記低域符号を生成し、
前記符号化ストリームを復号する復号装置が、前記入力信号の前記低域を非相関化した後に複製することで、前記入力信号の前記高域を生成する処理を行う場合に、前記低域に含まれる前記トーンを抑圧することなく、前記低域符号を生成することを特徴とする付記２１に記載のオーディオ符号化プログラム。(Appendix 29) The process of generating the low frequency code is a process in which a decoding device that decodes the coded stream duplicates the low frequency band of the input signal as it is to generate the high frequency band of the input signal. After suppressing the tone contained in the low frequency band, the low frequency code is generated.
When the decoding device that decodes the coded stream performs a process of generating the high frequency of the input signal by decorrelating the low frequency of the input signal and then duplicating the low frequency, the low frequency is included in the low frequency. 21. The audio coding program according toAppendix 21, wherein the low frequency code is generated without suppressing the tone.

（付記３０）前記低域符号を生成する処理は、前記入力信号に含まれるトーン成分の平均値および前記入力信号の平均値を基にして、前記復号装置が、前記低域を非相関化した後に、前記低域符号を生成するか否かを判定することを特徴とする付記２１に記載のオーディオ符号化プログラム。(Appendix 30) In the process of generating the low frequency code, the decoding device uncorrelates the low frequency band based on the average value of the tone components included in the input signal and the average value of the input signal. The audio coding program according toAppendix 21, wherein later, it is determined whether or not to generate the low frequency code.

（付記３１）符号化ストリームを低域符号と高域符号に分離する符号分離部と、
前記低域符号を復号することで低域信号を生成する低域復号部と、
前記低域信号の周波数スペクトルを非相関化するか否かの制御情報を基にして、前記低域信号の周波数スペクトルを非相関化し、
前記高域符号を復号することで、音信号の周波数スペクトルの包絡情報を含む高域情報を生成し、前記低域信号の周波数スペクトルと、前記高域情報とを基にして、高域信号を生成する高域生成部と、
前記低域信号と前記高域信号とを合成することで、前記音信号を生成する合成部と
を有することを特徴とする復号装置。(Appendix 31) A code separator that separates the coded stream into a low-frequency code and a high-frequency code,
A low-frequency decoding unit that generates a low-frequency signal by decoding the low-frequency code,
Based on the control information of whether or not to uncorrelate the frequency spectrum of the low frequency signal, the frequency spectrum of the low frequency signal is uncorrelated.
By decoding the high frequency code, high frequency information including the wrapping information of the frequency spectrum of the sound signal is generated, and the high frequency signal is generated based on the frequency spectrum of the low frequency signal and the high frequency information. The high frequency generator to be generated and
A decoding device characterized by having a compositing unit that generates the sound signal by synthesizing the low-frequency signal and the high-frequency signal.

（付記３２）コンピュータが実行する復号方法であって、
符号化ストリームを低域符号と高域符号に分離し、
前記低域符号を復号することで低域信号を生成し、
前記低域信号の周波数スペクトルを非相関化するか否かの制御情報を基にして、前記低域信号の周波数スペクトルを非相関化し、
前記高域符号を復号することで、音信号の周波数スペクトルの包絡情報を含む高域情報を生成し、前記低域信号の周波数スペクトルと、前記高域情報とを基にして、高域信号を生成し、
前記低域信号と前記高域信号とを合成することで、前記音信号を生成する
処理を実行することを特徴とする復号方法。(Appendix 32) A decryption method executed by a computer.
Separate the coded stream into low-frequency code and high-frequency code,
A low-frequency signal is generated by decoding the low-frequency code.
Based on the control information of whether or not to uncorrelate the frequency spectrum of the low frequency signal, the frequency spectrum of the low frequency signal is uncorrelated.
By decoding the high frequency code, high frequency information including the wrapping information of the frequency spectrum of the sound signal is generated, and the high frequency signal is generated based on the frequency spectrum of the low frequency signal and the high frequency information. Generate and
A decoding method characterized by executing a process of generating the sound signal by synthesizing the low-frequency signal and the high-frequency signal.

（付記３３）コンピュータに、
符号化ストリームを低域符号と高域符号に分離し、
前記低域符号を復号することで低域信号を生成し、
前記低域信号の周波数スペクトルを非相関化するか否かの制御情報を基にして、前記低域信号の周波数スペクトルを非相関化し、
前記高域符号を復号することで、音信号の周波数スペクトルの包絡情報を含む高域情報を生成し、前記低域信号の周波数スペクトルと、前記高域情報とを基にして、高域信号を生成し、
前記低域信号と前記高域信号とを合成することで、前記音信号を生成する
処理を実行させることを特徴とする復号プログラム。(Appendix 33) To the computer
Separate the coded stream into low-frequency code and high-frequency code,
A low-frequency signal is generated by decoding the low-frequency code.
Based on the control information of whether or not to uncorrelate the frequency spectrum of the low frequency signal, the frequency spectrum of the low frequency signal is uncorrelated.
By decoding the high frequency code, high frequency information including the wrapping information of the frequency spectrum of the sound signal is generated, and the high frequency signal is generated based on the frequency spectrum of the low frequency signal and the high frequency information. Generate and
A decoding program characterized in that a process of generating the sound signal is executed by synthesizing the low-frequency signal and the high-frequency signal.

５０ネットワーク
１００，２００，３００，３０１，４００，６００オーディオ符号化装置
１１０低域信号抽出部
１２０，６０２高域情報抽出部
１３０，２１０，６０４判定部
１４０，６０６低域補正部
１５０，３２０，６０８低域符号化部
１６０，４１０高域補正部
１７０高域符号化部
１８０，６０９多重化部
２２０入力信号補正部
３１０補正制御部
６０１時間周波数変換部
６０３高域符号化部
６０５低域抽出部
６０７周波数時間変換部
７００復号装置
７０１符号分離部
７０２低域復号部
７０３分析ＱＭＦ部
７０４高域逆量子化部
７０５高域生成部
７０６包絡調整部
７０７合成部50 Network 100,200,300,301,400,600Audio coding device 110 Low frequency signal extraction unit 120,602 High frequency information extraction unit 130,210,604 Judgment unit 140,606 Low frequency correction unit 150,320,608 Low frequency coding unit 160,410 Highfrequency correction unit 170 High frequency coding unit 180,609Multiplexing unit 220 Inputsignal correction unit 310 Correction control unit 601 Timefrequency conversion unit 603 Highfrequency coding unit 605 Low frequency extraction unit 607 Frequencytime conversion unit 700Decoding device 701Code separation unit 702 Low frequency decoding unit 703 Analysis QMF unit 704 High frequencyinverse quantization unit 705 High frequency generation unit 706 Entrainment adjustment unit 707 Synthesis unit

Claims

Translated fromJapanese

入力信号の低域と高域との境界にトーンが含まれるか否かを判定する判定部と、
前記境界にトーンが含まれている場合に、前記低域、又は、前記高域の一方のトーンを抑圧し、前記入力信号の前記低域を符号化することで、低域符号を生成し、前記入力信号の前記高域を符号化することで、高域符号を生成する符号化部と、
低域符号と高域符号を多重化することで、符号化ストリームを生成する多重化部と
を有することを特徴とするオーディオ符号化装置。A determination unit that determines whether or not a tone is included in the boundary between the low and high frequencies of the input signal,
When a tone is included in the boundary, one tone of the low frequency band or the high frequency band is suppressed, and the low frequency band of the input signal is encoded to generate a low frequency code. A coding unit that generates a high-frequency code by encoding the high-frequency code of the input signal,
An audio coding device characterized by having a multiplexing unit that generates a coded stream by multiplexing a low-frequency code and a high-frequency code.

前記符号化部は、前記高域の入力信号の周波数スペクトルから包絡情報を抽出し、前記包絡情報を含む高域情報を符号化することで、前記高域符号を生成し、前記高域のトーンを抑圧する場合には、前記境界付近における前記包絡情報の値を抑制することを特徴とする請求項１に記載のオーディオ符号化装置。 The coding unit extracts the wrapping information from the frequency spectrum of the high-frequency input signal and encodes the high-frequency information including the wrapping information to generate the high-frequency code and generate the high-frequency tone. The audio coding apparatus according to claim 1, wherein the value of the enveloping information in the vicinity of the boundary is suppressed when the information is suppressed.

前記判定部は、符号化するビットレートと閾値との比較結果を基にして、前記低域のトーンを抑圧するのか、前記高域のトーンを抑圧するのかを判定することを特徴とする請求項１または２に記載のオーディオ符号化装置。 The determination unit is characterized in that it determines whether to suppress the low frequency tone or the high frequency tone based on a comparison result between a bit rate to be encoded and a threshold value. The audio coding apparatus according to 1 or 2.

前記判定部は、前記低域の入力信号と前記低域符号を復号した入力信号との第１誤差、および、前記高域の入力信号と前記高域符号を復号した入力信号との第２誤差を算出し、前記第１誤差と第２誤差との比較結果を基にして、前記低域のトーンを抑圧するのか、前記高域のトーンを抑圧するのかを判定することを特徴とする請求項１または２に記載のオーディオ符号化装置。 The determination unit has a first error between the low-frequency input signal and the input signal obtained by decoding the low-frequency code, and a second error between the high-frequency input signal and the input signal obtained by decoding the high-frequency code. Is calculated, and based on the result of comparison between the first error and the second error, it is determined whether to suppress the low frequency tone or the high frequency tone. The audio encoding device according to 1 or 2.

前記符号化部は、前記境界にトーンが含まれている場合に、前記高域の下限を、所定周波数分だけ高域側にずらすことで、高域を符号化する際の帯域を補正し、補正した帯域に対応する入力信号を符号化して、高域符号を生成することを特徴とする請求項１に記載のオーディオ符号化装置。 When the boundary contains a tone, the coding unit corrects the band when coding the high frequency band by shifting the lower limit of the high frequency band to the high frequency band side by a predetermined frequency. The audio coding apparatus according to claim 1, wherein the input signal corresponding to the corrected band is encoded to generate a high frequency code.

前記符号化部は、前記境界にトーンが含まれている場合に、前記低域の上限を、所定周波数分だけ低域側にずらすことで、低域を符号化する際の帯域を補正し、補正した帯域に対応する入力信号を符号化して、低域符号を生成することを特徴とする請求項１に記載のオーディオ符号化装置。 When the boundary contains a tone, the coding unit corrects the band when coding the low frequency band by shifting the upper limit of the low frequency band to the low frequency band side by a predetermined frequency. The audio coding apparatus according to claim 1, wherein the input signal corresponding to the corrected band is encoded to generate a low frequency code.

前記高域情報は、高域を所定幅で分割した帯域毎にトーンの有無を示すトーン周波数の情報を更に含み、前記符号化部は、前記境界に対応する帯域のトーンの有無を、無に設定することを特徴とする請求項２に記載のオーディオ符号化装置。 The high frequency information further includes information on the tone frequency indicating the presence or absence of a tone for each band in which the high frequency is divided by a predetermined width, and the coding unit makes the presence or absence of a tone in the band corresponding to the boundary absent. The audio coding apparatus according to claim 2, wherein the audio coding apparatus is set.

前記符号化部は、前記符号化ストリームを復号する復号装置が、前記入力信号の前記低域をそのまま複製することで、前記入力信号の前記高域を生成する処理を行う場合に、前記低域に含まれる前記トーンを抑圧した後に、前記低域符号を生成し、
前記符号化ストリームを復号する復号装置が、前記入力信号の前記低域を非相関化した後に複製することで、前記入力信号の前記高域を生成する処理を行う場合に、前記低域に含まれる前記トーンを抑圧することなく、前記低域符号を生成することを特徴とする請求項１に記載のオーディオ符号化装置。When the decoding device that decodes the coded stream performs a process of generating the high range of the input signal by duplicating the low range of the input signal as it is, the coding unit performs the low range. After suppressing the tone contained in, the low frequency code is generated.
When the decoding device that decodes the coded stream performs a process of generating the high frequency of the input signal by decorrelating the low frequency of the input signal and then duplicating the low frequency, the low frequency is included in the low frequency. The audio coding apparatus according to claim 1, wherein the low frequency code is generated without suppressing the tone.

前記符号化部は、前記入力信号に含まれるトーン成分の平均値および前記入力信号の平均値を基にして、前記復号装置が、前記低域を非相関化した後に、前記低域符号を生成するか否かを判定することを特徴とする請求項８に記載のオーディオ符号化装置。 The coding unit generates the low frequency code after the decoding device uncorrelates the low frequency band based on the average value of the tone components contained in the input signal and the average value of the input signal. The audio coding apparatus according to claim 8, wherein it is determined whether or not to perform.

コンピュータが実行するオーディオ符号化方法であって、
入力信号の低域と高域との境界にトーンが含まれるか否かを判定し、
前記境界にトーンが含まれている場合に、前記低域、又は、前記高域の一方のトーンを抑圧し、
前記入力信号の前記低域を符号化することで低域符号を生成し、
前記入力信号の前記高域を符号化することで、高域符号を生成し、
低域符号と高域符号を多重化することで、符号化ストリームを生成する
処理を実行することを特徴とするオーディオ符号化方法。An audio coding method performed by a computer
Determines whether the boundary between the low and high frequencies of the input signal contains a tone, and determines whether or not the tone is included.
When the boundary contains a tone, one tone of the low frequency band or the high frequency band is suppressed.
A low frequency code is generated by encoding the low frequency band of the input signal.
By encoding the high frequency band of the input signal, a high frequency code is generated.
An audio coding method characterized by executing a process of generating a coded stream by multiplexing a low-frequency code and a high-frequency code.

コンピュータに、
入力信号の低域と高域との境界にトーンが含まれるか否かを判定し、
前記境界にトーンが含まれている場合に、前記低域、又は、前記高域の一方のトーンを抑圧し、
前記入力信号の前記低域を符号化することで低域符号を生成し、
前記入力信号の前記高域を符号化することで、高域符号を生成し、
低域符号と高域符号を多重化することで、符号化ストリームを生成する
処理を実行させることを特徴とするオーディオ符号化プログラム。On the computer
Determines whether the boundary between the low and high frequencies of the input signal contains a tone, and determines whether or not the tone is included.
When the boundary contains a tone, one tone of the low frequency band or the high frequency band is suppressed.
A low frequency code is generated by encoding the low frequency band of the input signal.
By encoding the high frequency band of the input signal, a high frequency code is generated.
An audio coding program characterized in that a process of generating a coded stream is executed by multiplexing a low-frequency code and a high-frequency code.