JP2000206996A

Movatterモバイル変換

Info

Publication number: JP2000206996A
Application number: JP11007000A
Authority: JP
Inventors: Takahiro Mine; 貴宏嶺; Takashi Araki; 貴志荒木; Shiro Omori; 士郎大森
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1999-01-13
Filing date: 1999-01-13
Publication date: 2000-07-28

Abstract

PROBLEM TO BE SOLVED: To provide a receiver and receiving method, communication equipment and communicating method by which received voice having an improved listening quality is obtained. SOLUTION: A signal switching section 32 is provided with a switch 150. Based on user's desire, the switch 150 selects the voice signals of a first band B1 (300 Hz to 3400 Hz) of a first sampling frequency fs1 (=8 kHz), in which a spectrum forming and a listening quality improvement are conducted in a first post filter process (a) 47, the voice signals of the band B1 of a second sampling frequency fs2 (=16 kHz) in which a spectrum forming and a listening quality improvement are conducted by a second post filter process (b) 48, and the voice signals of a wide band Bw (300 Hz to 6000 Hz) of the frequency fs2 in which a spectrum forming and a listening quality improvement are conducted by a third post filter process (b) 49.

Description

Translated fromJapanese

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、通信や放送によっ
て伝えられた、音声信号の音声パラメータ符号を使って
音声信号を合成する受信装置及び方法、通信装置及び方
法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a receiving apparatus and method for synthesizing an audio signal using an audio parameter code of an audio signal transmitted by communication or broadcasting, and a communication apparatus and method.

【０００２】[0002]

【従来の技術】従来の通信装置では、受話側における入
力音声と出力音声のサンプリング周波数が同一であると
共に、音声周波数帯域も同一であった。これは、電話回
線の伝送帯域が例えば３００〜３４００Ｈｚと狭く、電
話回線を介して送られてくる音声信号の周波数帯域が制
限されてしまうためである。2. Description of the Related Art In a conventional communication device, a sampling frequency of an input voice and an output voice on a receiving side are the same, and a voice frequency band is also the same. This is because the transmission band of the telephone line is narrow, for example, 300 to 3400 Hz, and the frequency band of the audio signal transmitted via the telephone line is limited.

【０００３】[0003]

【発明が解決しようとする課題】ところで、上記伝送帯
域が制限された、入力音声と同様の音声周波数帯域で出
力される音声では音質はあまり良好とは言えない。つま
り、聴覚的品質が劣る。また、ディジタル携帯電話の音
質についても不満がある。By the way, the sound quality of the sound output in the same sound frequency band as the input sound whose transmission band is limited is not so good. That is, the auditory quality is inferior. They also complain about the sound quality of digital mobile phones.

【０００４】本発明は、上記実情に鑑みてなされたもの
であり、聴覚的品質を向上させた受話音声を得ることの
できる受信装置及び方法、通信装置及び方法の提供を目
的とする。[0004] The present invention has been made in view of the above circumstances, and has as its object to provide a receiving apparatus and method, a communication apparatus, and a method capable of obtaining a received voice with improved auditory quality.

【０００５】[0005]

【課題を解決するための手段】本発明に係る受信装置
は、上記課題を解決するために、第１のサンプリング周
波数ｆ_s1の音声信号を生成するために送信装置から伝送
されてきた伝送信号に基づく音声パラメータ符号を使っ
て生成した第１の帯域Ｂ₁の音声信号のサンプリング周
波数を第２のサンプリング周波数ｆ_s2（ｆ_s2＞ｆ_s1）に
変換するサンプリングレート変換手段と、上記音声パラ
メータ符号を使って生成した第１のサンプリング周波数
ｆ_s1の第１の帯域Ｂ₁の音声信号に第１のポストフィル
タ処理を施すと共に、上記サンプリングレート変換手段
からの第２のサンプリング周波数ｆ_s2の第１の帯域Ｂ₁
の音声信号に第２のポストフィルタ処理を施すポストフ
ィルタ手段と、このポストフィルタ手段からの第１のフ
ィルタ処理出力と第２のフィルタ処理出力とを切り換え
る切り換え手段とを備える。A receiving apparatus according to the present invention.
In order to solve the above-mentioned problem,
Wave number f_s1Transmitted from transmitter to generate audio signal
Using speech parameter codes based on the transmitted signal
Generated first band B₁Sampling frequency of the audio signal
Change the wave number to the second sampling frequency f_s2(F_s2> F_s1)
Sampling rate conversion means for converting
First sampling frequency generated using meter code
f_s1First band B of₁1st post-fill to the audio signal
The sampling rate conversion means.
From the second sampling frequency f_s2First band B of₁
Post-filtering the audio signal of
Filter means and a first filter from the post-filter means.
Switching between filter processing output and second filter processing output
Switching means.

【０００６】ここで、上記ポストフィルタ手段は、上記
第１のサンプリング周波数ｆ_s1の音声信号を生成するた
めに送信装置から伝送されてきた伝送信号に基づく音声
パラメータ符号に応じたポストフィルタ処理を上記第１
のポストフィルタ処理として上記第１のサンプリング周
波数ｆ_s1の第１の帯域Ｂ₁の音声信号に施す。Here, the post-filter means performs post-filter processing according to a voice parameter code based on a transmission signal transmitted from a transmitting device to generate a voice signal of the first sampling frequency f_s1. First
As the post-filter process performed on the first band B₁ of the audio signal of the first sampling frequency f_s1.

【０００７】また、上記ポストフィルタ手段は、復号化
された信号が入力され、フィルタ係数が第１の周期で更
新されるスペクトル整形フィルタ手段と、このスペクト
ル整形フィルタ手段からの出力が入力され、ゲインが上
記第１の周期とは異なる第２の周期で更新されるゲイン
調整手段とを有する。Further, the post-filter means receives the decoded signal as input and updates the filter coefficient in the first cycle. The spectrum-shaping filter means receives the output from the spectral-shaping filter means as input, Has gain adjustment means updated in a second cycle different from the first cycle.

【０００８】本発明に係る受信方法は、上記課題を解決
するために、第１のサンプリング周波数ｆ_s1の音声信号
を生成するために送信装置から伝送されてきた伝送信号
に基づく音声パラメータ符号を使って生成した第１のサ
ンプリング周波数ｆ_s1の第１の帯域Ｂ₁の音声信号に第
１のポストフィルタ処理を施して得られる第１の処理出
力と、上記音声パラメータ符号を使って生成した第１の
帯域Ｂ₁の音声信号のサンプリング周波数を第２のサン
プリング周波数ｆ_s2（ｆ_s2＞ｆ_s1）に変換して得た第２
のサンプリング周波数ｆ_s2の第１の帯域Ｂ₁の音声信号
に第２のポストフィルタ処理を施して得られた第２の処
理出力とを、切り換える。[0008] In order to solve the above-mentioned problems, a receiving method according to the present invention uses an audio parameter code based on a transmission signal transmitted from a transmitting apparatus to generate an audio signal having a first sampling frequency f_s1. first first first the process output obtained in the audio signal band B₁ subjected to the first post-filter processing of the sampling frequency f_s1 generated Te, first produced by using the speech parameters reference numeral 1 the sampling frequency of the band B₁ of the audio signal a second sampling frequency f_s2 second obtained by converting the (f_s2> f_s1)
And a second processed output obtained by subjecting the audio signal of the_first band_B1 of the sampling frequency f_s2 to the second post-filter processing.

【０００９】上記第１のポストフィルタ処理は、上記第
１のサンプリング周波数ｆ_s1の音声信号を生成するため
に送信装置から伝送されてきた伝送信号に基づく音声パ
ラメータ符号に応じたポストフィルタ処理を上記第１の
サンプリング周波数ｆ_s1の第１の帯域Ｂ₁の音声信号に
施す。The first post-filter processing includes a post-filter processing corresponding to a voice parameter code based on a transmission signal transmitted from a transmitting device to generate a voice signal of the first sampling frequency f_s1. It performed to the first first band B₁ of the audio signal having the sampling frequency f_s1.

【００１０】また、上記第２のポストフィルタ処理は、
上記第１のポストフィルタ処理を、上記サンプリングレ
ート変換された第２のサンプリング周波数ｆ_s2の第１の
帯域Ｂ₁の音声信号におけるｆ_s2／ｆ_s1倍のサンプルに
対して施す。[0010] Further, the second post-filter processing includes:
The first post-filter processing, performed on the f_s2 / f_s1 times of samples in the first band B₁ of the audio signal of the second sampling frequency f_s2, which has been converted the sampling rate.

【００１１】本発明に係る通信装置は、上記課題を解決
するために、入力音声信号に第１のサンプリング周波数
ｆ_s1による符号化処理を施して伝送信号を生成する送信
手段と、上記第１のサンプリング周波数ｆ_s1の音声信号
を生成するために送信手段から伝送されてきた伝送信号
に基づく音声パラメータ符号を使って生成した第１のサ
ンプリング周波数ｆ_s1の第１の帯域Ｂ₁の音声信号に第
１のポストフィルタ処理を施して得られる第１の処理出
力と、上記音声パラメータ符号を使って生成した第１の
帯域Ｂ₁の音声信号のサンプリング周波数を第２のサン
プリング周波数ｆ_s2（ｆ_s2＞ｆ_s1）に変換して得た第２
のサンプリング周波数ｆ_s2の第１の帯域Ｂ₁の音声信号
に第２のポストフィルタ処理を施して得られた第２の処
理出力とを、切り換えて出力する受信手段とを備える。[0011] In order to solve the above-mentioned problems, a communication device according to the present invention includes: a transmitting unit that performs an encoding process on an input audio signal at a first sampling frequency f_s1 to generate a transmission signal; first the first first band B₁ of the audio signal having the sampling frequency f_s1 generated using the speech parameters code based on the transmission signal transmitted from the transmitting means to generate an audio signal having the sampling frequency f_s1 the first and the process output obtained by performing the first post-filter processing, the speech parameter codes to use the sampling frequency of the first band B₁ of the audio signal generated by a second sampling frequency f_{_s2} (f_s2> f_s1 )
Receiving means for switching and outputting a second processed output obtained by subjecting the audio signal of the_first band_B1 of the sampling frequency f_s2 to the second post-filter processing.

【００１２】ここで、上記受信手段は、上記第１のサン
プリング周波数ｆ_s1の音声信号を生成するために送信装
置から伝送されてきた伝送信号に基づく音声パラメータ
符号を使って生成した第１の帯域Ｂ₁の音声信号のサン
プリング周波数を第２のサンプリング周波数ｆ_s2（ｆ_s2
＞ｆ_s1）に変換するサンプリングレート変換手段と、上
記音声パラメータ符号を使って生成した第１のサンプリ
ング周波数ｆ_s1の第１の帯域Ｂ₁の音声信号に第１のポ
ストフィルタ処理を施すと共に、上記サンプリングレー
ト変換手段からの第２のサンプリング周波数ｆ_s2の第１
の帯域Ｂ₁の音声信号に第２のポストフィルタ処理を施
すポストフィルタ手段と、このポストフィルタ手段から
の第１のフィルタ処理出力と第２のフィルタ処理出力と
を切り換える切り換え手段とを備える。Here, the receiving means includes a first band generated by using a voice parameter code based on a transmission signal transmitted from a transmitting device to generate a voice signal of the first sampling frequency f_s1. the sampling frequency of the B₁ of the audio signal a second sampling frequency f_s2 (f_s2
> F_s1 ), and a first post-filter process on the audio signal of the first band B₁ of the first sampling frequency f_s1 generated using the audio parameter code, The first sampling frequency f_s2 from the sampling rate converter
Comprising of a post-filter means for performing a second post-filter processing to the audio signal of the band B_1, and a switching means for switching a first filtered output and the second filtered output from the post-filter unit.

【００１３】本発明に係る通信方法は、上記課題を解決
するために、入力音声信号に第１のサンプリング周波数
ｆ_s1による符号化処理を施して伝送信号を生成すると共
に、上記第１のサンプリング周波数ｆ_s1の音声信号を生
成するために送信装置から伝送されてきた伝送信号に基
づく音声パラメータ符号を使って生成した第１のサンプ
リング周波数ｆ_s1の第１の帯域Ｂ₁の音声信号に第１の
ポストフィルタ処理を施して得られる第１の処理出力
と、上記音声パラメータ符号を使って生成した第１の帯
域Ｂ₁の音声信号のサンプリング周波数を第２のサンプ
リング周波数ｆ_s2（ｆ_s2＞ｆ_s1）に変換して得た第２の
サンプリング周波数ｆ_s2の第１の帯域Ｂ₁の音声信号に
第２のポストフィルタ処理を施して得られた第２の処理
出力とを、切り換えて出力する。In order to solve the above-mentioned problems, a communication method according to the present invention performs an encoding process on an input audio signal at a first sampling frequency f_s1 to generate a transmission signal, and generates a transmission signal. f to_s1 first first band B₁ of the audio signal having the sampling frequency f_s1 generated using the speech parameters code based on the transmission signal transmitted from the transmitting apparatus to generate an audio signal of the first postfiltering first and processing output obtained by performing the above sound parameter codes to use the sampling frequency of the first band B₁ of the audio signal generated by a second sampling frequency f_{_s2} (f_s2> f_s1 and a second processing output obtained in the first band B₁ of the audio signal subjected to the second post-filter processing of the second sampling frequency f_s2 obtained by converting the) switching To output.

【００１４】[0014]

【発明の実施の形態】以下、本発明の実施の形態につい
て図面を参照しながら説明する。この実施の形態は、本
発明に係る受信装置の具体例となる、図１に示す受信装
置１であり、本発明に係る受信方法を適用している。こ
の受信装置１は、パーソナルディジタルセルラー（Pers
onal Digital Cellular，ＰＤＣ）として、現在広く使
用されている、ディジタル携帯電話の受話側として用い
ることができる。Embodiments of the present invention will be described below with reference to the drawings. This embodiment is a receiving apparatus 1 shown in FIG. 1 which is a specific example of a receiving apparatus according to the present invention, to which a receiving method according to the present invention is applied. This receiving device 1 is a personal digital cellular (Pers
onal Digital Cellular (PDC), which can be used as a receiving side of a digital mobile phone that is currently widely used.

【００１５】受信装置１は、第１のサンプリング周波数
ｆ_s1の音声信号を生成するために後述する送信装置から
基地局を介して伝送されてきた音声パラメータ符号か
ら、第１のサンプリング周波数ｆ_s1の第１の帯域Ｂ₁の
音声信号と、第２のサンプリング周波数ｆ_s2（ｆ_s2＞ｆ
_s1）の第１の帯域Ｂ₁の音声信号と、第２のサンプリン
グ周波数ｆ_s2（ｆ_s2＞ｆ_s1）の広帯域Ｂ_w（第１の帯域
Ｂ₁＋第２の帯域Ｂ₂）の音声信号を生成し、これら３種
類の音声信号を切り換えて出力する。第１のサンプリン
グ周波数ｆ_s1としては８ＫＨｚを、第２のサンプリング
周波数ｆ_s2としては１６ＫＨｚを用いる。また、第１の
帯域Ｂ₁としては３００Ｈｚ〜３４００Ｈｚを、第２の
帯域Ｂ₂としては３４００Ｈｚ〜６０００Ｈｚを用い
る。したがって、広帯域Ｂ_Wとしては３００Ｈｚ〜６０
００Ｈｚを用いる。The receiving apparatus 1, from the speech parameter code transmitted via a base station from the transmitting apparatus to be described later to generate an audio signal of a first sampling frequency f_s1, the first sampling frequency f_s1 The audio signal of the first band B₁ and the second sampling frequency f_s2 (f_s2 > f
a first audio signal having a bandwidth B₁_s1), a second sampling frequency_{_{_{f s2 (f s2> f s1}}} ) wideband B_w (first band B₁ + second band B₂₎ of the speech signal Is generated, and these three types of audio signals are switched and output. 8 KHz is used as the first sampling frequency f_{s1 and} 16 KHz is used as the second sampling frequency f_s2 . As the first band B₁ of 300Hz～3400Hz, using 3400Hz~6000Hz the second as band B_2. Therefore, the wide-band B_W 300Hz~60
00 Hz is used.

【００１６】図１において受信装置１がアンテナ２を介
して基地局から受信した音声パラメータ符号は、ＲＦ受
信部３、制御部４を経由して信号処理装置５のメモリ５
ａに格納される。In FIG. 1, the speech parameter code received by the receiving apparatus 1 from the base station via the antenna 2 is transmitted to the memory 5 of the signal processing apparatus 5 via the RF receiving section 3 and the control section 4.
a.

【００１７】信号処理装置５のメモリ５ａに格納された
音声パラメータ符号は、信号処理装置５の復号部で復号
処理された後、所定の信号処理が施されて出力される。The speech parameter code stored in the memory 5a of the signal processing device 5 is decoded by the decoding unit of the signal processing device 5, and then subjected to predetermined signal processing and output.

【００１８】信号処理装置５からの出力信号は、Ｄ／Ａ
変換器６でアナログ信号とされた後、アンチエイリアシ
ングフィルタ７、ボリューム８及びアンプ９を経由して
スピーカ１０から出力される。なお、制御部４には例え
ばキー操作部１１とＬＣＤ表示部１２が接続されてい
る。The output signal from the signal processing device 5 is D / A
After being converted into an analog signal by the converter 6, the analog signal is output from the speaker 10 via the anti-aliasing filter 7, the volume 8 and the amplifier 9. The control unit 4 is connected to, for example, a key operation unit 11 and an LCD display unit 12.

【００１９】図２には、上記音声パラメータ符号を例え
ば無線伝送路、及び基地局を介して送信する、送信装置
１５の構成を示す。この送信装置１５もＰＤＣとして、
現在広く使用されている、ディジタル携帯電話の送話側
として使うことができる。FIG. 2 shows a configuration of a transmitting device 15 that transmits the above-mentioned voice parameter code via, for example, a radio transmission path and a base station. This transmitting device 15 is also a PDC,
It can be used as a transmitting side of a digital mobile phone which is widely used at present.

【００２０】マイクロホン１６から入力された音声信号
は、アンプ１７，ボリューム１８，アンチエイリアシン
グフィルタ１９及びＡ／Ｄ変換器２０を経由して信号処
理装置２１のメモリ２１ａに格納される。The audio signal input from the microphone 16 is stored in the memory 21a of the signal processing device 21 via the amplifier 17, the volume 18, the anti-aliasing filter 19, and the A / D converter 20.

【００２１】メモリ２１ａに格納された音声信号は、信
号処理装置２１内部の音声符号化部で符号処理され、音
声パラメータ符号として出力される。この音声パラメー
タ符号は、制御部２２及びＲＦ送信部２３及びアンテナ
２４を経由して基地局へ送信される。なお、制御部２２
にはキー操作部２５とＬＣＤ表示部２６が接続されてい
る。The audio signal stored in the memory 21a is subjected to code processing in an audio encoding unit inside the signal processing device 21, and is output as an audio parameter code. This voice parameter code is transmitted to the base station via the control unit 22, the RF transmission unit 23, and the antenna 24. The control unit 22
Is connected to a key operation unit 25 and an LCD display unit 26.

【００２２】ここで、信号処理装置２１内部の音声符号
化部は、無線伝送路により制限される狭帯域化を考慮し
た音声パラメータ符号を生成する。一般的には、３００
Ｈｚ〜３４００Ｈｚの伝送帯域を考慮している。上記伝
送信号に基づく音声パラメータ符号は、制御部２２を介
してＲＦ送信部２３に供給される。例えば、音声パラメ
ータ符号としては、励振源に関する線形予測（ＬＰＣ）
残差や、線形予測係数αがある。他には、ピッチ周波数
に関するラグＬＡＧや、例えば２０msecのフレームにお
けるフレームパワーＲ０等がある。Here, the speech coding unit in the signal processing device 21 generates a speech parameter code in consideration of the narrow band limited by the radio transmission path. Generally, 300
A transmission band of 3 Hz to 3400 Hz is considered. The voice parameter code based on the transmission signal is supplied to the RF transmission unit 23 via the control unit 22. For example, as the voice parameter code, linear prediction (LPC) for the excitation source
There are residuals and linear prediction coefficients α. Other examples include a lag LAG related to the pitch frequency and a frame power R0 in a frame of, for example, 20 msec.

【００２３】図１の受信装置１内部の信号処理装置５
は、図３に示すデコーダ２７と、図４に示す信号切換部
３２とを備えてなる。The signal processing device 5 inside the receiving device 1 of FIG.
Comprises a decoder 27 shown in FIG. 3 and a signal switching unit 32 shown in FIG.

【００２４】上記図２に示した送信装置１５の信号処理
装置２１における音声符号部での符号化方法がＰＳＩ−
ＣＥＬＰ（Pitch Synchronus Innovation - CELP：ピッ
チ同期雑音励振源−ＣＥＬＰ）符号化方式によるもので
あるとすれば、デコーダ２７は、ＰＳＩ−ＣＥＬＰ符号
化による伝送信号を用いて音声をデコードし、出力端子
２８にデコード音声Ｓｎｄ_Nを、出力端子２９に線形予
測係数α_Nを、出力端子３０に励振源ＮＥｘｃ_Nを供給す
る。ＰＳＩ−ＣＥＬＰ符号化による伝送信号は、第１の
サンプリング周波数ｆ_s1＝８ＫＨｚの第１の帯域Ｂ₁＝
３００〜３４００Ｈｚの音声信号を生成するために伝送
されてきたものである。The encoding method in the speech encoding unit in the signal processing device 21 of the transmitting device 15 shown in FIG.
If it is based on the CELP (Pitch Synchronous Innovation-CELP) coding method, the decoder 27 decodes the sound using the transmission signal based on the PSI-CELP coding, and outputs an output terminal 28. supplying an excitation source NExc_N decoded audio Snd_N, the linear prediction coefficient alpha_N to the output terminal 29, an output terminal 30 to. The transmission signal by the PSI-CELP coding has a first sampling frequency f_s1 = first band B₁ of 8 kHz = B₁ =
It has been transmitted to generate an audio signal of 300 to 3400 Hz.

【００２５】信号切換部３２は、第１のサンプリング周
波数ｆ_s1（＝８ＫＨｚ）の音声信号を生成するために送
信装置から伝送されてきたＰＳＩ−ＣＥＬＰ符号による
伝送信号を使ってデコーダ２７が復号した第１の帯域Ｂ
₁（３００Ｈｚ〜３４００Ｈｚ）のデコード音声Ｓｎｄ_N
のサンプリングレートを第２のサンプリング周波数ｆ_s2
（＝１６ＫＨｚ）に変換するサンプリグレート変換手段
と、上記音声パラメータ符号を使って生成した第１のサ
ンプリング周波数ｆ_s1の第１の帯域Ｂ₁の音声信号に第
１のポストフィルタ処理を施すと共に、上記サンプリン
グレート変換手段からの第２のサンプリング周波数ｆ_s2
の第１の帯域Ｂ₁の音声信号に第２のポストフィルタ処
理を施すポストフィルタ手段と、このポストフィルタ手
段からの第１のフィルタ処理出力と第２のフィルタ処理
出力とを切り換える切り換え手段とを備える。The signal switching unit 32 uses the PSI-CELP code transmitted from the transmitting apparatus to generate a speech signal of the first sampling frequency f_s1 (= 8 KHz), and the decoder 27 decodes the signal. First band B
₁ (300Hz-3400Hz) decoded sound Snd_N
Of the second sampling frequency f_s2
(= 16 KHz) and a first post-filter process on the audio signal of the first band B₁ of the first sampling frequency f_s1 generated using the audio parameter code, The second sampling frequency f_s2 from the sampling rate conversion means
And post-filtering means for performing a second post-filtering process on the audio signal of the_first band B₁ , and switching means for switching between the first filter-processed output and the second filter-processed output from the post-filtering device. Prepare.

【００２６】さらに、この受信装置は、上記デコーダ２
７が上記ＰＳＩ−ＣＥＬＰ符号をデコードして得た線形
予測係数α_Nと、励振源ＮＥｘｃ_Nとを使って第２のサン
プリング周波数ｆ_s2（＝１６ＫＨｚ）の第２の帯域Ｂ₂
（３４００Ｈｚ〜６０００Ｈｚ）の信号を推測する帯域
外成分推測手段と、上記サンプリングレート変換手段か
らの第２のサンプリング周波数ｆ_s2の第１の帯域Ｂ₁の
音声信号に上記帯域外成分推測手段で推測された第２の
サンプリング周波数ｆ_s2の第２の帯域Ｂ₂の音声信号を
加算する加算手段とを備え、上記ポストフィルタ手段は
上記加算手段からの加算出力に第３のポストフィルタ処
理を施し、上記切り換え手段は上記第３のフィルタ処理
出力も上記第１及び第２のフィルタ処理出力とともに切
り換える。Further, the receiving apparatus is provided with the decoder 2
7 uses a linear prediction coefficient α_N obtained by decoding the PSI-CELP code and an excitation source NExc_N to obtain a second band B_{2 of a} second sampling frequency f_s2 (= 16 KHz).
And out-of-band component predicting unit to estimate signal (3400Hz~6000Hz), guess second first band B₁ of the audio signal to the out-of-band component predicting unit of the sampling frequency f_s2 from the sampling rate converting means It is second and an adding means for adding a second audio signal having a bandwidth B₂ of the sampling frequency f_s2 was, the post-filtering means performs a third post filter processing to the addition output from said adding means, The switching means switches the third filtered output together with the first and second filtered outputs.

【００２７】ここで、上記ポストフィルタ手段は図４に
示す第１のポストフィルタ処理（ａ）４７、第２のポス
トフィルタ処理（ｂ）４８、第３のポストフィルタ処理
（ｂ）４９を行う。これら各ポストフィルタ処理４７〜
４９は、上記ポストフィルタ手段の行う、ポストフィル
タ処理をブロックとして示したものである。第２又は第
３のポストフィルタ処理（ｂ）４８又は４９は、第１の
ポストフィルタ処理（ａ）４７を第２のサンプリング周
波数ｆ_s2の第１の帯域Ｂ₁又は広帯域Ｂ_Wの音声信号にお
けるｆ_s2／ｆ_s1倍のサンプルに対して施す。なお、上記
ポストフィルタ処理に付加している（ａ），（ｂ）は、
サンプリング周波数ｆ_s1で動作する処理と、ｆ_s2で動作
する処理を区別する記号である。Here, the post-filter means performs the first post-filter processing (a) 47, the second post-filter processing (b) 48, and the third post-filter processing (b) 49 shown in FIG. Each of these post-filter processes 47 to
Reference numeral 49 denotes the post-filter processing performed by the post-filter means as a block. The second or third post-filter processing (b) 48 or 49 performs the first post-filter processing (a) 47 on the audio signal of the first band B₁ or the wide band B_W at the second sampling frequency f_s2 . It is applied to a sample of_fs2 /_fs1 times. (A) and (b) added to the post-filter processing are
A process that operates at the sampling frequency f_s1, a processing to distinguish symbols operating at f_s2.

【００２８】上記サンプリングレート変換手段は図４に
おけるアップサンプル部４５である。上記切り換え手段
は切り換えスイッチ部１５０である。上記加算手段は加
算部４６である。そして、上記帯域外成分推測手段は、
アップサンプル部４５とポストフィルタ処理（ａ）４
７，（ｂ）４８及び（ｂ）４９と切り換えスイッチ部１
５０と加算部４６を除いた部分である。The sampling rate conversion means is the up-sampling section 45 in FIG. The switching means is the switching unit 150. The adding means is the adding unit 46. Then, the out-of-band component estimating means includes:
Up-sampling unit 45 and post-filter processing (a) 4
7, (b) 48 and (b) 49 and changeover switch unit 1
This is a part excluding 50 and the adder 46.

【００２９】以下、信号切換部３２の構成を詳細に説明
する。Hereinafter, the configuration of the signal switching section 32 will be described in detail.

【００３０】先ず、上記帯域外成分推測手段は、線形予
測係数→自己相関（α_N→ｒ_N）変換部３６と、自己相関
（ｒ）広帯域化部３７と、広帯域コードブック（ｒ_wＣ
Ｂ）３８と、自己相関→線形予測係数（ｒ_w→α_w）変換
部３９と、ＬＰＣ合成部４０と、励振源拡張部４１と、
高域抽出＆抑圧フィルタ４２と、乗算部４３とからな
る。First, the out-of-band component estimating means includes a linear prediction coefficient → autocorrelation (α_N → r_N ) conversion section 36, an autocorrelation (r) widening section 37, and a wide band codebook (r_w C
B) 38, an autocorrelation → linear prediction coefficient (r_w → α_w ) conversion unit 39, an LPC synthesis unit 40, an excitation source extension unit 41,
It comprises a high-frequency extraction & suppression filter 42 and a multiplier 43.

【００３１】入力端子３４から供給された線形予測係数
α_Nは、線形予測係数→自己相関（α_N→ｒ_N）変換部３
６に供給される。このα_N→ｒ_N変換部３６は、線形予測
係数α_Nを自己相関ｒ_Nに変換し、自己相関（ｒ）広帯域
化部３７に供給する。自己相関（ｒ）広帯域化部３７は
広帯域コードブック（ｒ_wＣＢ）３８を用いて自己相関
ｒを広帯域化（拡張化）する。広帯域コードブック（ｒ
_wＣＢ）３８は広帯域音から抽出した自己相関パラメー
タｒ_wを用いて予め作成されている。The linear prediction coefficient α_N supplied from the input terminal 34 is converted into a linear prediction coefficient → autocorrelation (α_N → r_N ) conversion unit 3
6. The α_N → r_N conversion unit 36 converts the linear prediction coefficient α_N into an autocorrelation r_N , and supplies the auto correlation r_N to the autocorrelation (r) widening unit 37. The autocorrelation (r) widening unit 37 widens (extends) the autocorrelation r using the wideband codebook (r_w CB) 38. Wideband codebook (r
_w CB) 38 is created in advance using the autocorrelation parameter r_w extracted from the wideband sound.

【００３２】広帯域コードブック（ｒ_wＣＢ）３８を用
い、自己相関（ｒ）広帯域化部３７が拡張した拡張自己
相関ｒ_wは自己相関→線形予測係数（ｒ_w→α_w）変換部
３９に供給される。ｒ_w→α_w変換部３９は拡張自己相関
ｒ_wを拡張線形予測係数α_wに再度変換してからＬＰＣ合
成部４０に供給する。[0032] Using the wide band code book (r_w CB) 38, extended autocorrelation r_w autocorrelation (r) broadband portion 37 is expanded in the autocorrelation → linear prediction coefficients (r_{_w} → α_w) conversion unit 39 Supplied. The r_w → α_w conversion unit 39 converts the extended auto-correlation r_w into the extended linear prediction coefficient α_w again, and supplies it to the LPC synthesis unit 40.

【００３３】ＬＰＣ合成部４０はｒ_w→α_w変換部３９か
らの広帯域線形予測係数α_wと後述する励振源拡張部４
１からの拡張励振源に基づいて広帯域音声を合成する。The LPC synthesizing section 40 receives the wideband linear prediction coefficient α_w from the r_w → α_w converting section 39 and the excitation source expanding section 4 described later.
A wideband speech is synthesized based on the extended excitation source from No. 1.

【００３４】ＬＰＣ合成部４０の合成出力は、高域抽出
＆抑圧フィルタ４２に供給される。高域抽出＆抑圧フィ
ルタ４２は、周波数帯域３００Ｈｚ〜３４００Ｈｚの信
号成分を除去し、第２の帯域Ｂ₂＝３４００Ｈｚ〜６０
００Ｈｚの信号成分を抽出するように、高い周波数成分
を抑圧する。このフィルタ４２からのフィルタ出力に
は、端子４４から供給されるゲインが乗算部４３で乗算
される。乗算部４３でゲインが乗算された出力（第２の
帯域Ｂ₂＝３４００Ｈｚ〜６０００Ｈｚ）は、加算部４
６に供給される。The combined output of the LPC combining section 40 is supplied to a high-frequency extraction and suppression filter 42. The high-frequency extraction & suppression filter 42 removes signal components in the frequency band of 300 Hz to 3400 Hz, and the second band B₂ = 3400 Hz to 60
High frequency components are suppressed so that 00 Hz signal components are extracted. The filter output from the filter 42 is multiplied by a gain supplied from a terminal 44 by a multiplier 43. The output (second band B₂ = 3400 Hz to 6000 Hz) multiplied by the gain in the multiplier 43 is output to the adder 4
6.

【００３５】上記ＬＰＣ合成部４０には、励振源拡張部
４１からの拡張励振源も供給される。励振源拡張部４１
は、入力端子３５から供給された励振源に関するパラメ
ータとしてのＬＰＣ残差（このＬＰＣ残差を励振源ＮＥ
ｘｃ_Nと記す。）を拡張する。励振源拡張部４１の詳細
な構成を図５に示す。The extended excitation source from the excitation source extension unit 41 is also supplied to the LPC synthesis unit 40. Excitation source expansion unit 41
Is the LPC residual as a parameter relating to the excitation source supplied from the input terminal 35 (this LPC residual is referred to as the excitation source NE
xc_N. ) To expand. FIG. 5 shows a detailed configuration of the excitation source extension unit 41.

【００３６】先ず、入力端子３５を介して供給された励
振源ＮＥｘｃ_Nは、アップサンプル部５０によりアップ
サンプルされる。アップサンプル部５０の出力は、ＬＰ
Ｆ５１、ブースト部５２を介して出力端子５５からＬＰ
Ｃ合成部４０に送られる。すなわち、励振源ＮＥｘｃ_N
をアップサンプルした信号は、音声信号を合成する際の
上記拡張励振源として用いられる。ブースト部５２は、
破擦音や摩擦音が検出された場合に、上記拡張励振源を
ブーストするためのもので、そのブースト量は破擦音検
出部５４の出力により制御される。破擦音検出部５４
は、入力端子５３を介して上記α_N→ｒ_N変換部３６から
の自己相関ｒ_Nを受け取り、破擦音や摩擦音を検出す
る。First, the excitation source NExc_N supplied via the input terminal 35 is up-sampled by the up-sampling section 50. The output of the up-sampling unit 50 is LP
F51, LP from output terminal 55 via boost section 52
It is sent to the C synthesizing unit 40. That is, the excitation source NExc_N
Is used as the above-mentioned extended excitation source when synthesizing the audio signal. The boost unit 52
This is for boosting the extended excitation source when an affricate or fricative is detected, and the boost amount is controlled by an output of the affricate detector 54. Affricate detector 54
Receives the autocorrelation r_N from the α_N → r_N conversion unit 36 via the input terminal 53 and detects affricate and fricative.

【００３７】このような構成の励振源拡張部４１からの
励振源が上記ＬＰＣ合成部４０に供給される。そして、
ＬＰＣ合成部４０は、ｒ_w→α_w変換部３９からの広帯域
線形予測係数α_wと上記拡張励振源に基づいて広帯域音
声を合成する。ここまでの構成が上記帯域外成分推測手
段である。The excitation source from the excitation source extension unit 41 having such a configuration is supplied to the LPC synthesis unit 40. And
LPC synthesis section 40 synthesizes a wideband speech based on the wideband linear prediction coefficient alpha_w and the extended excitation source from r_{_w} →_{α w} conversion unit 39. The configuration up to here is the out-of-band component estimating means.

【００３８】次に、入力端子３３を介して上記図３のデ
コーダ２７から供給されるデコード音声Ｓｎｄ_Nにポス
トフィルタ処理を施すポストフィルタについて説明す
る。Next, a description will be given of a post-filter for performing post-filter processing on the decoded sound Snd_N supplied from the decoder 27 of FIG. 3 through the input terminal 33.

【００３９】このポストフィルタは、本件出願人が既に
出願した、特開平９−１２７９９６号公報に開示されて
いる、音声復号化方法及び装置で適用している技術によ
り、上記デコード音声信号Ｓｎｄ_Nのスペクトル整形及
び聴感上の品質向上を実現する。[0039] The post-filter is present applicant has already filed by disclosed in Japanese Patent Laid-Open No. 9-127996, is applied in the speech decoding method and apparatus technology, the decoded audio signal Snd_N Realizes spectral shaping and quality improvement in audibility.

【００４０】図６には上記音声復号化方法及び装置を適
用したポストフィルタの詳細な構成を示す。ポストフィ
ルタの要部となるスペクトル整形フィルタ１３１は、ホ
ルマント強調フィルタ１３２と高域強調フィルタ１３３
とからなっている。このスペクトル整形フィルタ１３１
からの出力は、スペクトル整形によるゲイン変化を補正
するためのゲイン調整器１３４に送られており、このゲ
イン調整器１３４のゲインＧは、ゲイン制御部１３６に
より決定される。ゲイン制御部１３６は、スペクトル整
形フィルタ１３１の入力と出力とを比較してゲイン変化
を計算し、ゲイン調整器１３４のゲインＧの補正値を算
出する。ここで、スペクトル整形フィルタ１３１の上記
入力とは端子１３５を介して供給される、上記デコード
音声信号Ｓｎｄ_Nであり、上記出力とは端子１３７を介
してこのポストフィルタから導出されるフィルタ出力で
ある。このような構成のポストフィルタの詳細な動作に
ついては後述する。FIG. 6 shows a detailed configuration of a post filter to which the above-described speech decoding method and apparatus are applied. A spectrum shaping filter 131 which is a main part of the post filter includes a formant emphasis filter 132 and a high-frequency emphasis filter 133.
It consists of This spectrum shaping filter 131
Are sent to a gain adjuster 134 for correcting a gain change due to spectrum shaping. The gain G of the gain adjuster 134 is determined by the gain control unit 136. The gain control section 136 compares the input and output of the spectrum shaping filter 131 to calculate a gain change, and calculates a correction value of the gain G of the gain adjuster 134. Here, the input of the spectrum shaping filter 131 is the decoded audio signal Snd_N supplied via a terminal 135, and the output is a filter output derived from the post filter via a terminal 137. . The detailed operation of the post filter having such a configuration will be described later.

【００４１】次に、上記サンプリング周波数変換手段と
してのアップサンプル部４５は、サンプリング周波数が
第１のサンプリング周波数ｆ_s1＝８ｋＨｚの第１の帯域
Ｂ₁＝３００Ｈｚ〜３４００Ｈｚの音声信号のサンプリ
ング周波数を第２のサンプリング周波数ｆ_s2＝１６ｋＨ
ｚに変換する。このアップサンプル部４５からの、サン
プリング周波数が第２のサンプリング周波数ｆ_s2＝１６
ｋＨｚに変換された第１の帯域Ｂ₁＝３００Ｈｚ〜３４
００Ｈｚの音声信号成分は、加算部４６及び第２のポス
トフィルタ処理（ｂ）４８に供給される。Next, the up-sampling unit 45 as the sampling frequency conversion means converts the sampling frequency of the audio signal having the_first band B₁ = 300 Hz to 3400 Hz with the first sampling frequency f_s1 = 8 kHz. 2 sampling frequency f_s2 = 16 kHz
Convert to z. The sampling frequency from this up-sampling unit 45 is the second sampling frequency f_s2 = 16.
first band converted into the kHz B₁ = 300Hz~34
The 00 Hz audio signal component is supplied to the adder 46 and the second post-filter processing (b) 48.

【００４２】また、加算部４６が乗算部４３からの乗算
出力である、第２のサンプリング周波数ｆ_s2＝１６ｋＨ
ｚの第２の帯域Ｂ₂＝３４００Ｈｚ〜６０００Ｈｚの音
声信号成分に、アップサンプル部４５からの上記音声信
号成分を加算することによって得られた加算出力は第３
のポストフィルタ処理（ｂ）４９に供給される。The second sampling frequency f_s2 = 16 kHz, which is a multiplication output from the multiplication unit 43 by the addition unit 46.
the sound signal component of the second band B₂ = 3400Hz~6000Hz of z, the added output is the third obtained by adding the audio signal component from the up-sampling unit 45
(B) 49 of the post-filter processing.

【００４３】また、信号切換部３２は、上述したよう
に、上記切換手段として切換スイッチ１５０を備え、第
１のポストフィルタ処理（ａ）４７でスペクトル整形及
び聴感上の品質が向上された上記第１のサンプリング周
波数ｆ_s1（＝８ＫＨｚ）の第１の帯域Ｂ₁（３００Ｈｚ
〜３４００Ｈｚ）の音声信号と、第２のポストフィルタ
処理（ｂ）４８でスペクトル整形及び聴感上の品質が向
上された第２のサンプリング周波数ｆ_s2（＝１６ＫＨ
ｚ）の第１の帯域Ｂ₁（３００Ｈｚ〜３４００Ｈｚ）の
音声信号と、第３のポストフィルタ処理（ｂ）４９でス
ペクトル整形及び聴感上の品質が向上された第２のサン
プリング周波数ｆ_s2（＝１６ＫＨｚ）の広帯域Ｂ_w（３
００Ｈｚ〜６０００Ｈｚ）の音声信号を切り換える。Further, as described above, the signal switching section 32 includes the changeover switch 150 as the above-mentioned switching means, and the first post-filter processing (a) 47 improves the spectral shaping and the quality of the audibility by the first post-filter processing (a) 47. 1 sampling frequency f_s1 (= 8 KHz) in a first band B₁ (300 Hz
３3400 Hz) and a second sampling frequency f_s2 (= 16 KH) whose spectral shaping and audible quality have been improved by the second post-filter processing (b) 48
z) the audio signal of the first band B₁ (300 Hz to 3400 Hz) and the second sampling frequency f_s2 (= 16 KHz) broadband B_w (3
(00 Hz to 6000 Hz).

【００４４】切り換えスイッチ１５０は、上記第１のサ
ンプリング周波数ｆ_s1（＝８ＫＨｚ）の第１の帯域Ｂ₁
（３００Ｈｚ〜３４００Ｈｚ）の音声信号を被選択端子
ａで受け、第２のサンプリング周波数ｆ_s2（＝１６ＫＨ
ｚ）の第１の帯域Ｂ₁（３００Ｈｚ〜３４００Ｈｚ）の
音声信号を被選択端子ｂで受け、第２のサンプリング周
波数ｆ_s2（＝１６ＫＨｚ）の広帯域Ｂ_w（３００Ｈｚ〜
６０００Ｈｚ）の音声信号を被選択端子ｃで受ける。そ
して、切り換え制御信号端子１５１からの切り換え制御
信号に基づいて選択片ｄを切り換えることにより、いず
れか一の音声信号をＤ／Ａ変換器６に供給する。The changeover switch 150 is connected to the first band B₁ of the first sampling frequency f_s1 (= 8 KHz).
(300 Hz to 3400 Hz) is received at the selected terminal a, and the second sampling frequency f_s2 (= 16 KH)
z) receives an audio signal in the first band B₁ (300 Hz to 3400 Hz) at the selected terminal b, and outputs a wide band B_w (300 Hz to 300 kHz) with the second sampling frequency f_s2 (= 16 KHz).
6000 Hz) at the selected terminal c. Then, by switching the selection piece d based on the switching control signal from the switching control signal terminal 151, one of the audio signals is supplied to the D / A converter 6.

【００４５】以上の構成の信号切換部３２における、主
要な動作原理について以下に説明する。信号切換部３２
は、３００Ｈｚ〜３４００Ｈｚの第１の帯域Ｂ₁の音声
信号を生成するための音声パラメータ符号から３４００
Ｈｚ〜６０００Ｈｚという第２の帯域Ｂ₂用の音声符号
化パラメータを生成し、広帯域ＬＰＣ合成を行う。その
後、原音声の周波数帯域である低域（３００Ｈｚ〜３４
００Ｈｚ）側を、原音声を１６ＫＨｚにアップサンプル
したものに置換する。すなわち、高域通過フィルタを施
し高域（３４００Ｈｚ〜６０００Ｈｚ）のみを残し、こ
の高域成分の中でも高い周波数成分を抑圧し、さらにゲ
インを調整し、その後、原音声（３００Ｈｚ〜３４００
Ｈｚ）をアップサンプル（第２のサンプリング周波数ｆ
_s2）したものに加算して、第２のサンプリング周波数ｆ
_s2（＝１６ＫＨｚ）の広帯域Ｂ_w（３００Ｈｚ〜６００
０Ｈｚ）の音声信号を得る。The main operation principle of the signal switching section 32 having the above configuration will be described below. Signal switching unit 32
Is 3400 from a speech parameter code for generating a speech signal of a_first band B1 of 300 Hz to 3400 Hz.
A speech coding parameter for a_second band B2 of₂ Hz to 6000 Hz is generated, and wideband LPC synthesis is performed. Then, the low frequency band (300 Hz to 34 Hz)
00 Hz) side is replaced with the original sound up-sampled to 16 KHz. That is, a high-pass filter is applied to leave only the high band (3400 Hz to 6000 Hz), high frequency components among these high band components are suppressed, and the gain is further adjusted.
Hz) is up-sampled (second sampling frequency f
_s2 ) and add the second sampling frequency f
_s2 (= 16KHz) of broadband B_w (300Hz~600
0 Hz).

【００４６】ここで、音声パラメータ符号の広帯域化
（或いは拡張化）は、線形予測係数αの広帯域化、励振
源ＮＥｘｃ_Nの広帯域化の二つが必要である。また、α
の広帯域化には、αと相互に変換可能なパラメータであ
る自己相関ｒによるコードブックを予め作成しておく必
要がある。このコードブックによる量子化、逆量子化に
よって自己相関ｒが広帯域化される。Here, to widen (or expand) the speech parameter code, it is necessary to widen the linear prediction coefficient α and widen the excitation source NExc_N. Also, α
In order to widen the bandwidth, it is necessary to previously create a codebook based on autocorrelation r, which is a parameter that can be mutually converted with α. The band of the autocorrelation r is widened by quantization and inverse quantization by the codebook.

【００４７】先ず、線形予測係数αの広帯域化について
説明する。αはスペクトル包絡を表すフィルタ係数であ
ることに着目し、高域側を推定しやすい別のスペクトル
包絡を表すパラメータである自己相関ｒに一旦変換し、
これを広帯域化し、その後で広帯域（或いは拡張）自己
相関ｒ_wから広帯域（或いは拡張）線形予測係数α_wに逆
変換する。拡張にはベクトル量子化を用いる。狭帯域自
己相関ｒ_nをベクトル量子化し、そのインデックスから
対応するｒ_wを求めればよい。First, the widening of the linear prediction coefficient α will be described. Note that α is a filter coefficient representing a spectral envelope, and is temporarily converted into an autocorrelation r, which is a parameter representing another spectral envelope that makes it easy to estimate the high frequency side,
This is widened, and then inversely converted from the wideband (or extended) autocorrelation r_w to the wideband (or extended) linear prediction coefficient α_w . Vector quantization is used for extension. Narrowband autocorrelation r_n to vector quantization, may be obtained the corresponding r_w from that index.

【００４８】狭帯域自己相関と広帯域自己相関には、後
述するように一定の関係が成り立つため、広帯域自己相
関によるコードブックのみを用意すればよく、狭帯域自
己相関をこれによりベクトル量子化でき、また逆量子化
により広帯域自己相関が求まる。Since a certain relationship is established between the narrowband autocorrelation and the wideband autocorrelation as described later, only the codebook based on the wideband autocorrelation needs to be prepared, and the narrowband autocorrelation can be vector-quantized by this. Wideband autocorrelation is obtained by inverse quantization.

【００４９】狭帯域信号を、広帯域信号を帯域制限した
ものとすれば、広帯域自己相関と狭帯域自己相関には以
下の（１）式に示す関係がある。Assuming that the narrow-band signal is obtained by band-limiting the wide-band signal, the wide-band auto-correlation and the narrow-band auto-correlation have a relationship represented by the following equation (1).

【００５０】[0050]

【数１】(Equation 1)

【００５１】ここで、φは自己相関、ｘ_nは狭帯域信
号、ｘ_wは広帯域信号、ｈは帯域制限フィルタのインパ
ルス応答である。Here, φ is an autocorrelation,_xn is a narrow band signal,_xw is a wide band signal, and h is an impulse response of a band limiting filter.

【００５２】さらに、自己相関とパワースペクトルの関
係から、次の（２）式が得られる。Further, the following equation (2) is obtained from the relationship between the autocorrelation and the power spectrum.

【００５３】[0053]

【数２】(Equation 2)

【００５４】この帯域制限フィルタのパワー特性と等し
い周波数特性を持つ、もう一つの帯域制限フィルタを考
え、これをＨ’とすれば、上記（２）式は、次の（３）
式のようになる。If another band-limiting filter having a frequency characteristic equal to the power characteristic of this band-limiting filter is considered, and this is set to H ′, the above equation (2) becomes the following equation (3)
It looks like an expression.

【００５５】[0055]

【数３】(Equation 3)

【００５６】この新たなフィルタの通過域、阻止域は当
初の帯域制限フィルタと同等であり、減衰特性が２乗と
なる。したがって、この新たなフィルタもまた、帯域制
限フィルタといえる。これを考慮すると、狭帯域自己相
関は、広帯域自己相関と帯域制限のフィルタのインパル
ス応答との畳み込み、すなわち広帯域自己相関を帯域制
限したものと単純化される。すなわち、次の（４）式と
なる。The pass band and the stop band of this new filter are the same as those of the original band limiting filter, and the attenuation characteristic is squared. Therefore, this new filter can also be said to be a band limiting filter. With this in mind, narrowband autocorrelation is simplified to the convolution of broadband autocorrelation with the impulse response of a band-limited filter, ie, band-limited wideband autocorrelation. That is, the following equation (4) is obtained.

【００５７】[0057]

【数４】(Equation 4)

【００５８】以上より、狭帯域自己相関をベクトル量子
化するにあたっては、広帯域コードブックのみを用意す
れば、量子化時に必要な狭帯域ベクトルは演算により作
成が可能であり、狭帯域自己相関から予めコードブック
を用意しておく必要がないことが分かる。As described above, when performing vector quantization of the narrow-band autocorrelation, if only a wide-band codebook is prepared, the narrow-band vector required at the time of quantization can be created by calculation. It turns out that there is no need to prepare a codebook.

【００５９】さらに、各広帯域自己相関のｒ_wコードベ
クタは単調減少もしくはなだらかに増減するカーブを持
つために、上記Ｈ’により低域通過させても大きな変化
がなく、ｒ_n量子化は、直接ｒ_wコードブックで行える。
ただし、サンプリング周波数が１／２のため、１次おき
に比較する必要がある。[0059] Further, in order to have a curve r_w code vector of each wide-band autocorrelation increase or decrease monotonically decreasing or gradually, the H 'by no significant change be passed through a low-pass, r_n quantization directly It can be carried out in the r_w code book.
However, since the sampling frequency is 1/2, it is necessary to compare every other order.

【００６０】線形予測係数αの拡張は有声音（Ｖ）と無
声音（ＵＶ）に分けることによって、さらに精度良い拡
張が可能であるため、これも行っている。これに伴いコ
ードブックもＶ用、ＵＶ用の二つを用いている。The linear prediction coefficient α is expanded because it can be more accurately expanded by dividing it into voiced sound (V) and unvoiced sound (UV). Accordingly, two codebooks for V and UV are used.

【００６１】次に、励振源の拡張について説明する。Ｐ
ＳＩ−ＣＥＬＰにおいては狭帯域での励振源を、図５の
アップサンプル部５０でゼロ値を挿入することでアップ
サンプルし、エイリアシング歪みを発生させたものを用
いる。この方法は非常に単純であるが、元の音声のパワ
ーや調波構造の差分が保存されるので、励振源としては
十分な品質であるといえる。Next, expansion of the excitation source will be described. P
In the SI-CELP, an excitation source in a narrow band is upsampled by inserting a zero value in an upsampling unit 50 in FIG. 5 to generate an aliasing distortion. Although this method is very simple, it can be said that the quality is sufficient as an excitation source because the difference between the power and the harmonic structure of the original voice is preserved.

【００６２】そして、以上で得られた広帯域αと広帯域
励振源によりＬＰＣ合成部４０でＬＰＣ合成を行う。Then, LPC combining is performed by the LPC combining section 40 using the broadband α and the broadband excitation source obtained as described above.

【００６３】また、広帯域ＬＰＣ合成された音声は、こ
のままでは品質が悪いので、低域側はコーデック出力の
オリジナル音声Ｓｎｄ_Nで置換する。このために、合成
音のうち３４００Ｈｚ以上を抽出し、一方でコーデック
出力をｆs＝１６ＫＨｚにアップサンプルし、これらを
加算する。Since the sound obtained by wideband LPC synthesis is inferior in quality as it is, the original sound Snd_N of the codec output is replaced on the low frequency side. For this purpose, 3400 Hz or more is extracted from the synthesized sound, while the codec output is up-sampled to fs = 16 KHz, and these are added.

【００６４】このとき、乗算部４３で高域側に乗算する
ゲインをユーザの好みに応じてゲイン調整器で調整可能
としている。ユーザ毎の個人差が大きいため、この値を
可変にしている。高域側ゲインの値をユーザからの入力
により予め設定しておき、この値を参照し、乗算を行
う。At this time, the gain by which the multiplier 43 multiplies the high frequency side can be adjusted by the gain adjuster according to the user's preference. This value is variable because individual differences between users are large. The value of the high-frequency gain is set in advance by an input from the user, and multiplication is performed with reference to this value.

【００６５】また、加算前に高域側に対し、高域抽出＆
抑圧フィルタ４２で約６ＫＨｚ以上の成分を若干抑圧す
るフィルタリングを施すことで、聴きやすい音にしてい
る。このフィルタ係数を選択可能とし、予め選択された
フィルタにより処理を行うことで、好みに応じ高域側の
周波数帯域を選択可能とした。このフィルタの選択もユ
ーザの入力により設定する。Before addition, high-frequency extraction &
By applying a filtering that slightly suppresses a component of about 6 KHz or more by the suppression filter 42, a sound that is easy to hear is obtained. This filter coefficient is selectable, and processing is performed using a filter selected in advance, so that a higher frequency band can be selected as desired. The selection of this filter is also set by the user's input.

【００６６】なお、このフィルタ４２を用いての処理
は、低域側のパワー特性に影響を与えないため、加算後
に行っても良い。あるいは、あえて低域側にも影響のあ
るフィルタを加算後に施す事も可能である。以上により
広帯域音声が得られる。The processing using the filter 42 does not affect the power characteristics on the low frequency side, and may be performed after the addition. Alternatively, it is also possible to apply a filter that also affects the low-frequency side after addition. As described above, a wideband sound can be obtained.

【００６７】次に、以上の動作原理に基づいて、信号切
換部３２が広帯域音声信号を生成する動作について図７
のフローチャートを用いて説明する。Next, the operation of the signal switching section 32 for generating a wideband audio signal based on the above-described operation principle will be described with reference to FIG.
This will be described with reference to the flowchart of FIG.

【００６８】ステップＳ１で図４に示したα_N→ｒ_N変換
部３６は、図３に示したデコーダ２７によりデコードさ
れた線形予測係数α_Nを自己相関ｒ_Nに変換する。また、
デコーダ２７でデコードされた音声信号Ｓｎｄ_Nはステ
ップＳ２でＶ／ＵＶ判定される。In step S1, the α_N → r_N conversion section 36 shown in FIG. 4 converts the linear prediction coefficient α_N decoded by the decoder 27 shown in FIG. 3 into an autocorrelation r_N. Also,
The audio signal Snd_N decoded by the decoder 27 is subjected to V / UV determination in step S2.

【００６９】このステップＳ２での判定結果がＶである
と、ステップＳ４では有声音用自己相関ｒ_Nを量子化す
る。この量子化は、ステップＳ３で求めた狭帯域Ｖ用パ
ラメータを用いる。すなわち、広帯域Ｖのコードブック
３８から、１次おきに比較して求めた狭帯域Ｖ用パラメ
ータを用いる。If the result of the determination in step S2 is V, in step S4 the voiced autocorrelation r_N is quantized. This quantization uses the narrowband V parameter obtained in step S3. That is, the parameters for the narrow band V obtained by comparing every other order from the code book 38 of the wide band V are used.

【００７０】一方、ステップＳ２での判定結果がＵＶで
あるときには、ステップＳ４ではステップＳ３で求めた
狭帯域ＵＶ用パラメータを用いて無声音用自己相関ｒを
量子化する。On the other hand, when the result of the determination in step S2 is UV, in step S4 the autocorrelation r for unvoiced sound is quantized using the narrow-band UV parameters obtained in step S3.

【００７１】そして、ステップＳ５でそれぞれ広帯域Ｖ
コードブック又は広帯域ＵＶコードブックを用いて逆量
子化し、これにより広帯域自己相関ｒ_Wが得られる。広
帯域自己相関ｒ_WはステップＳ６でｒ_W→α_W変換部３９
によりα_Wに変換される。Then, in step S5, the wide band V
Inverse quantization using a codebook or a wideband UV codebook, which results in a wideband autocorrelation r_W. The broadband autocorrelation r_W is calculated in step S 6 by r_W → α_W converter 39
To α_W.

【００７２】一方、デコーダ２７からの励振源は、ステ
ップＳ７で図５に示したアップサンプル部５０によりサ
ンプル間にゼロが詰められることでアップサンプルさ
れ、エイリアシングにより広帯域化される。これが広帯
域励振源として、ＬＰＣ合成部４０に供給される。On the other hand, the excitation source from the decoder 27 is up-sampled by padding zeros between the samples by the up-sampling unit 50 shown in FIG. 5 in step S7, and is widened by aliasing. This is supplied to the LPC synthesis section 40 as a broadband excitation source.

【００７３】そして、ステップＳ８で、ＬＰＣ合成部４
０が広帯域α_Wと広帯域励振源とを、ＬＰＣ合成し、広
帯域の音声信号が得られる。Then, in step S8, the LPC synthesizing unit 4
0 indicates that the wideband α_W and the wideband excitation source are LPC-combined to obtain a wideband audio signal.

【００７４】しかし、このままでは予測によって求めら
れた広帯域信号にすぎず、予測による誤差が含まれてい
るので品質が悪い。特に入力狭帯域音声の周波数範囲
（３００Ｈｚ〜３４００Ｈｚ）に関しては、コーデック
出力のオリジナル音声Ｓnd_N（入力音声）をそのまま利
用したほうが良い。However, if this is the case, it is merely a wideband signal obtained by prediction, and the quality is poor because it contains errors due to prediction. In particular, regarding the frequency range of the input narrowband audio (300 Hz to 3400 Hz), it is better to use the original audio Snd_N (input audio) output from the codec as it is.

【００７５】したがって、ＬＰＣ合成部４０からの合成
音のうち、入力狭帯域音声の周波数範囲３００〜３４０
０ＨｚをステップＳ９でバンドストップフィルタ（ＢＳ
Ｐ）を用いたフィルタリングにより除去する。Therefore, of the synthesized sounds from the LPC synthesizing section 40, the frequency range of the input narrowband sound is 300 to 340.
0 Hz is set to the band stop filter (BS
It is removed by filtering using P).

【００７６】そして、ステップＳ１０でアップサンプル
部４５により上記オリジナル音声Ｓｎｄ_Nをアップサン
プルしたものと、ステップＳ１３で加算部４６により加
算する。このとき、ステップＳ１１で高域側に対し、約
６ＫＨｚ以上の成分を若干抑圧する高域抽出＆抑圧フィ
ルタ４２によりフィルタリングすることで、聴きやすい
音にしている。このフィルタ係数は上述したように選択
可能とされている。Then, in step S10, the up-sampler 45 up-samples the original sound Snd_N and adds it in step S13 by the adder 46. At this time, by filtering the high-frequency side with a high-frequency extraction and suppression filter 42 that slightly suppresses a component of about 6 KHz or more in step S11, the sound is easy to hear. This filter coefficient can be selected as described above.

【００７７】さらに、ステップＳ１２では、乗算部４３
を用いてユーザの好みに応じて高域側ゲインを調整可能
としている。Further, in step S12, the multiplication section 43
To adjust the high-frequency gain according to the user's preference.

【００７８】なお、ここで、信号切換部３２で用いる、
コードブックの作成について説明する。コードブックの
作成は一般によく知られたＧＬＡ(Generalized Lloyd A
lgorithm)による方法である。広帯域音声を一定時間、
例えば２０msecごとのフレームに区切り、そのフレーム
毎に、一定次例えば６次までの自己相関を求めておく。
このフレーム毎の自己相関をトレーニングデータとし、
６次元のコードブックを作成する。このとき、有声音、
無声音の区別を行い、有声音の自己相関、無声音の自己
相関を別々に集め、それぞれのコードブックを作成して
もよい。この場合、帯域拡張処理中αの拡張時、コード
ブックを参照するが、このときにも有声音、無声音の判
別を行い、対応するコードブックを利用する。Here, the signal switching unit 32 uses
The creation of a codebook will be described. The creation of the codebook is generally well-known by GLA (Generalized Lloyd A
lgorithm). Broadband audio for a certain time,
For example, the frame is divided into frames every 20 msec, and the autocorrelation of a certain order, for example, the sixth order is obtained for each frame.
The autocorrelation for each frame is used as training data,
Create a 6-dimensional codebook. At this time, voiced sound,
Unvoiced sounds may be distinguished, and the autocorrelation of voiced sounds and the autocorrelation of unvoiced sounds may be separately collected to create respective codebooks. In this case, the code book is referred to when α is expanded during the band expansion processing. At this time, a voiced sound or an unvoiced sound is determined, and the corresponding code book is used.

【００７９】信号切換部３２では、広帯域有声音用コー
ドブックと広帯域無声音用コードブックを用いている。
この広帯域有声音用コードブックの作成については図８
を、広帯域無声音用コードブックの作成については図９
を参照しながら説明する。The signal switching section 32 uses a codebook for wideband voiced sound and a codebook for wideband unvoiced sound.
For the creation of the codebook for the wideband voiced sound, see FIG.
Figure 9 shows how to create a codebook for wideband unvoiced sound.
This will be described with reference to FIG.

【００８０】先ず、広帯域音声信号を学習用に用意し、
図８のステップＳ３１で１フレーム２０msecにフレーミ
ングする。次に、ステップＳ３２で各フレームにおい
て、例えばフレームエネルギーやゼロクロスの値等を調
べることによって有声音（Ｖ）か無声音（ＵＶ）かの分
類を行う。First, a wideband audio signal is prepared for learning,
In step S31 in FIG. 8, framing is performed for 20 msec per frame. Next, in step S32, for each frame, classification is performed as to whether it is a voiced sound (V) or an unvoiced sound (UV) by examining, for example, a frame energy, a value of zero crossing, and the like.

【００８１】そして、ステップＳ３３で広帯域有声音フ
レームにおいて、例えば６次までの自己相関パラメータ
ｒを計算する。また、ステップＳ３４では広帯域無声音
フレームにおける、例えば６次までの自己相関パラメー
タｒを求める。Then, in step S33, for example, the autocorrelation parameter r up to the sixth order is calculated in the wideband voiced sound frame. In step S34, for example, the autocorrelation parameter r up to the sixth order in the wideband unvoiced sound frame is obtained.

【００８２】この各フレームの６次の自己相関パラメー
タから、図９のステップＳ４１で広帯域パラメータを抽
出し、ＧＬＡにより次元６の広帯域Ｖ（ＵＶ）コードブ
ックをステップＳ４２で作成する。A wideband parameter is extracted from the sixth-order autocorrelation parameters of each frame in step S41 of FIG. 9 and a wideband V (UV) codebook of dimension 6 is created in step S42 by GLA.

【００８３】以上のようにして広帯域有声音用及び広帯
域無声音用コードブックを作成できる。As described above, a codebook for a wideband voiced sound and a wideband unvoiced sound can be created.

【００８４】次に、上記図６に示したポストフィルタの
動作について詳細に説明する。Next, the operation of the post filter shown in FIG. 6 will be described in detail.

【００８５】図６のスペクトル整形フィルタ１３１の特
性ＰＦ(Ｚ)は、線形予測係数αiを用いると、次の
（５）式のように表せる。The characteristic PF (Z) of the spectrum shaping filter 131 shown in FIG. 6 can be expressed by the following equation (5) using the linear prediction coefficient αi.

【００８６】[0086]

【数５】(Equation 5)

【００８７】この（５）式の分数部分がホルマント強調
フィルタ特性を、（１−ｋｚ^-1）の部分が高域強調フィ
ルタ特性をそれぞれ表す。また、β，γ，ｋは定数であ
り、一例としてβ＝0.6，γ＝0.8，ｋ＝0.3を挙げるこ
とができる。The fractional part of the equation (5) represents the formant enhancement filter characteristic, and the part (1-kz⁻¹ ) represents the high-frequency enhancement filter characteristic. Β, γ, and k are constants, for example, β = 0.6, γ = 0.8, and k = 0.3.

【００８８】また、ゲイン調整部１３４のゲインＧは、
次の（６）式のように表せる。The gain G of the gain adjusting unit 134 is
It can be expressed as the following equation (6).

【００８９】[0089]

【数６】(Equation 6)

【００９０】この式中のｘ（ｉ）はスペクトル整形フィ
ルタ１３１の入力、すなわち上記広帯域音声信号Ｓｎｄ
_wであり、ｙ（ｉ）はスペクトル整形フィルタの出力で
ある。X (i) in this equation is the input of the spectrum shaping filter 131, that is, the wideband audio signal Snd
_w and y (i) is the output of the spectral shaping filter.

【００９１】ここで、上記スペクトル整形フィルタ１３
１の係数の更新周期は、図１０に示すように、ＬＰＣ合
成部４０の係数であるα_wの更新周期と同じく、２０サ
ンプル、２．５ｍｓｅｃであるのに対し、ゲイン調整部
１３４のゲインＧの更新周期は、１６０サンプル、２０
ｍｓｅｃである。Here, the spectrum shaping filter 13
As shown in FIG. 10, the update cycle of the coefficient of 1 is 20 samples and 2.5 msec, similarly to the update cycle of the coefficient α_w of the LPC synthesis unit 40, whereas the gain G of the gain adjustment unit 134 is Update cycle is 160 samples, 20
msec.

【００９２】このように、ポストフィルタのスペクトル
整形フィルタ１３１の係数の更新周期に比較して、ゲイ
ン調整部１３４のゲインＧの更新周期を長くとることに
より、ゲイン調整の変動による悪影響を防止している。As described above, the update cycle of the gain G of the gain adjustment unit 134 is made longer than the update cycle of the coefficient of the spectrum shaping filter 131 of the post filter, thereby preventing adverse effects due to fluctuations in gain adjustment. I have.

【００９３】すなわち、一般のポストフィルタにおいて
は、スペクトル整形フィルタの係数の更新周期とゲイン
の更新周期とを同じにしており、このとき、ゲインの更
新周期を２０サンプル、２．５ｍｓｅｃとすると、図１
０からも明らかなように、１ピッチ周期の中で変動する
ことにより、クリックノイズを生じる原因となる。そこ
で、ポストフィルタでは、ゲインの切換周期をより長
く、例えば１フレーム分の１６０サンプル、２０ｍｓｅ
ｃとすることにより、ゲインの変動を防止することがで
きる。また逆に、スペクトル整形フィルタ１３１の係数
の更新周期を１６０サンプル、２０ｍｓｅｃと長くする
ときには、短時間の音声スペクトルの変化にポストフィ
ルタ特性が追従できず、良好な聴感上の品質改善が行え
ないが、このフィルタ係数の更新周期を２０サンプル、
２．５ｍｓｅｃと短くすることにより、効果的なポスト
フィルタ処理が可能となる。That is, in a general post-filter, the update cycle of the coefficient of the spectrum shaping filter and the update cycle of the gain are set to be the same. At this time, if the update cycle of the gain is 20 samples and 2.5 msec, FIG. 1
As is clear from 0, the fluctuation within one pitch period causes click noise. Therefore, in the post filter, the switching period of the gain is made longer, for example, 160 samples for one frame, 20 msec.
By setting c, it is possible to prevent a change in gain. Conversely, when the update cycle of the coefficient of the spectrum shaping filter 131 is increased to 160 samples and 20 msec, the post-filter characteristic cannot follow a short-time change in the audio spectrum, and good audibility quality cannot be improved. , The update cycle of this filter coefficient is 20 samples,
By making the length as short as 2.5 msec, effective post-filter processing can be performed.

【００９４】ところで、このポストフィルタは、上記第
１のサンプリング周波数ｆ_s1（８ＫＨｚ）の音声信号を
生成するために送信装置から伝送されてきた伝送信号に
基づく音声パラメータ符号（例えばα）を用いて上記デ
コード音声信号に第１のポストフィルタ処理（ａ）４７
を施しているが、上記第２のポストフィルタ処理（ｂ）
４８及び第３のポストフィルタ処理（ｂ）４９が実際に
ポストフィルタ処理を施すのは、第２のサンプリング周
波数ｆ_s2（１６ＫＨｚ）とされた音声信号に対してであ
る。このため、第２のポストフィルタ処理（ｂ）４８及
び第３のポストフィルタ処理（ｂ）４９は、上記第１の
ポストフィルタ処理（ａ）４７をサンプリング周波数が
１６ＫＨｚの音声信号における２（＝ｆ_s2／ｆ_s1）倍の
サンプルに対して施す。By the way, this post-filter uses an audio parameter code (for example, α) based on a transmission signal transmitted from a transmission device to generate an audio signal of the first sampling frequency f_s1 (8 KHz). First post-filter processing (a) 47 on the decoded audio signal
But the second post-filter processing (b)
48 and the third post-filter processing (b) 49 actually perform post-filter processing on an audio signal having the second sampling frequency f_s2 (16 KHz). For this reason, the second post-filter processing (b) 48 and the third post-filter processing (b) 49 perform the first post-filter processing (a) 47 on the basis of 2 (= f_s2 /_fs1 ) times as many samples.

【００９５】このようにして、第１のポストフィルタ処
理（ａ）４７は上記デコード音声信号のスペクトル整形
及び聴感上の品質を効果的に向上できる。また、第２の
ポストフィルタ処理（ｂ）４８及び第３のポストフィル
タ処理（ｂ）４９は第２のサンプリング周波数ｆ_s2（１
６ＫＨｚ）とされた第１の帯域Ｂ₁及び広帯域Ｂ_Wの音声
信号のスペクトル整形及び聴感上の品質を効果的に向上
できる。In this manner, the first post-filter processing (a) 47 can effectively improve the spectral shaping of the decoded audio signal and the quality of the audibility. The second post-filter processing (b) 48 and the third post-filter processing (b) 49_perform the second sampling frequency f_s2 (1
The spectrum shaping and audible quality of the audio signal of the first band B₁ and the wide band B_W of 6 KHz) can be effectively improved.

【００９６】そして、図４に示した信号切換部３２は、
切り換えスイッチ１５０により、第１のポストフィルタ
処理（ａ）４７，第２のポストフィルタ（ｂ）４８及び
第３のポストフィルタ（ｂ）４９でスペクトル整形及び
聴感上の品質が効果的に向上された音声信号、つまりサ
ンプリング周波数が８ＫＨｚの第１の帯域Ｂ₁（３００
〜３４００Ｈｚ）の音声信号と、サンプリング周波数が
１６ＫＨｚの第１の帯域Ｂ₁（３００〜３４００Ｈｚ）
の音声信号と、サンプリング周波数が１６ＫＨｚの広帯
域Ｂ_W（３００〜６０００Ｈｚ）の広帯域音声信号とを
切り換えてＤ／Ａ変換器６に送ることができる。The signal switching section 32 shown in FIG.
With the changeover switch 150, the first post-filter processing (a) 47, the second post-filter (b) 48, and the third post-filter (b) 49 effectively improve the spectral shaping and audibility. An audio signal, that is, a first band B₁ (300
３3400 Hz) and a first band B₁ (300 to 3400 Hz) with a sampling frequency of 16 KHz.
And audio signal can be sent to the D / A converter 6 sampling frequency by switching between wideband speech signal of the wide band B_W (300~6000Hz) of 16 KHz.

【００９７】このため、上記図１に示した受信装置１
は、サンプリング周波数が８ＫＨｚ，１６ＫＨｚと異な
る、第１の帯域Ｂ₁（３００Ｈｚ〜３４００Ｈｚ）のＰ
ＳＩ−ＣＥＬＰによる受話音声信号や、サンプリング周
波数が１６ＫＨｚの広帯域（３００Ｈｚ〜６０００Ｈ
ｚ）のＰＳＩ−ＣＥＬＰによる受話音声信号にポストフ
ィルタ処理を施した上で、ユーザに選択させることがで
きる。ユーザ側では選択肢が広がる。また、状況に応じ
て受話音声を帯域拡張するだけでなく、入力時の帯域と
同様にすることができるので、内蔵のバッテリーの減り
を抑えることもできる。Therefore, the receiving apparatus 1 shown in FIG.
Is the P of the_first band B₁ (300 Hz to 3400 Hz) whose sampling frequency is different from 8 kHz and 16 kHz.
The received voice signal by SI-CELP and a wide band (300 Hz to 6000H) with a sampling frequency of 16 KHz
After subjecting the received voice signal by the PSI-CELP of z) to post-filter processing, the user can make a selection. The user has more options. In addition, not only the band of the received voice can be extended according to the situation, but also the band can be made the same as the band at the time of input, so that the built-in battery can be reduced.

【００９８】なお、Ｄ／Ａ変換器６でのサンプリング周
波数を１６ＫＨｚに固定して、１６ＫＨｚ固定での第１
の帯域Ｂ₁の音声信号と、広帯域Ｂ_Wの音声信号を切り換
えるようにしてもよい。Ｄ／Ａ変換器６で用いるクロッ
クを８Ｋｚ／１６ＫＨｚと切り換えなくて済むため、ハ
ードウェア負担を減らすことができる。The sampling frequency of the D / A converter 6 is fixed at 16 KHz, and the first frequency is fixed at 16 KHz.
And the audio signal having a bandwidth B₁ in, may be switched to the audio signal of the wide band B_W. Since the clock used in the D / A converter 6 does not need to be switched to 8 kHz / 16 kHz, the hardware load can be reduced.

【００９９】また、アップサンプル部４５では、切り換
えスイッチ１５０における、サンプリング周波数の８Ｋ
Ｈｚ／１６ＫＨｚ切り換え持に、フィルタ出力をクリア
しておく。ノイズ発生を防ぐためである。In the up-sampling section 45, the changeover switch 150 sets the sampling frequency to 8K.
The filter output is cleared before switching between Hz and 16 kHz. This is to prevent generation of noise.

【０１００】次に、図１の受信装置１内部の信号処理装
置５の他の具体例について図１１〜図１３を用いて説明
する。この他の具体例は、図１１に示すデコーダ５８
と、図１２に示す信号切換部６５とを備えてなる。Next, another specific example of the signal processing device 5 inside the receiving device 1 of FIG. 1 will be described with reference to FIGS. Another specific example is the decoder 58 shown in FIG.
And a signal switching unit 65 shown in FIG.

【０１０１】上記図２に示した送信装置１５の信号処理
装置２１における音声符号器での符号化方法がＶＳＥＬ
Ｐ（Vector Sum Excited Linear Prediction：ベクトル
和励起線形予測）符号化方式によるものであるとすれ
ば、デコーダ５８はＶＳＥＬＰ符号化による伝送信号を
デコードして出力端子５９にデコード音声Ｓｎｄ_Nを、
出力端子６０に線形予測係数α_Nを、出力端子６１に励
振源１Ｅｘｃ_N1を、出力端子６２に励振源２Ｅｘｃ_N2を
供給する。The encoding method in the speech encoder in the signal processing device 21 of the transmitting device 15 shown in FIG.
If it is based on the P (Vector Sum Excited Linear Prediction) encoding method, the decoder 58 decodes the transmission signal by VSELP encoding and outputs the decoded audio Snd_N to the output terminal 59.
The linear prediction coefficient α_N is supplied to the output terminal 60, the excitation source 1Exc_N1 is supplied to the output terminal 61, and the excitation source 2Exc_N2 is supplied to the output terminal 62.

【０１０２】信号切換部６５は、図１２に示すような構
成であり、上記図４に示した信号切換部３２と異なるの
は励振源切換＆拡張部６８を設けている点である。The signal switching unit 65 has a configuration as shown in FIG. 12, and is different from the signal switching unit 32 shown in FIG. 4 in that an excitation source switching and extension unit 68 is provided.

【０１０３】ＰＳＩ−ＣＥＬＰは、コーデック自体、特
に有声音Ｖを聴感上滑らかに聞こえるような処理を行っ
ているが、ＶＳＥＬＰにはこれがなく、このために帯域
幅拡張したときに若干雑音が混入したように聞こえる。
そこで、広帯域励振源を作成する際に、励振源を切り換
える部を内部に備えた励振源切換＆拡張部６８を用い、
図１２に示すような処理を施す。この図１２に示す処理
は、上記図７に示した励振源処理をステップＳ８７〜ス
テップＳ８９のように変えたものである。The PSI-CELP performs a process for allowing the codec itself, particularly the voiced sound V, to be heard audibly smoothly. However, the VSELP does not have this, and therefore, when the bandwidth is expanded, noise is mixed slightly. Sounds like.
Therefore, when creating a broadband excitation source, an excitation source switching & extension unit 68 having a unit for switching the excitation source is used.
The processing shown in FIG. 12 is performed. The processing shown in FIG. 12 is obtained by changing the excitation source processing shown in FIG. 7 to steps S87 to S89.

【０１０４】ＶＳＥＬＰの励振源は、コーデックに利用
されるパラメータβ(長期予測係数), bL[i](長期フィル
タ状態),γ(利得), c1[i](励起コードベクタ)により、 β * bL[i] + γ * c1[i] として作成されるが、このうち前者がピッチ成分、後者
がノイズ成分を表すので、これをβ * bL[i]とγ * c1
[i]に分け、ステップＳ８７で、一定の時間範囲におい
て、前者のエネルギーが大きい場合にはピッチが強い有
声音と考えられるため、ステップＳ８８でＹＥＳに進
み、励振源をパルス列とし、ピッチ成分のない部分では
ＮＯに進み０に抑圧した。また、ステップＳ８７でエネ
ルギーが大きくない場合には従来どおりとし、こうして
作成された狭帯域励振源にステップＳ８９でゼロ詰め処
理によりPSI-CELP同様０を詰めアップサンプルすること
で広帯域励振源とした。これにより、ＶＳＥＬＰにおけ
る有声音の聴感上の品質が向上する。The excitation source of VSELP is represented by β * using parameters β (long-term prediction coefficient), bL [i] (long-term filter state), γ (gain), and c1 [i] (excitation code vector) used for the codec. bL [i] + γ * c1 [i], the former of which represents the pitch component and the latter of which represents the noise component, which are represented by β * bL [i] and γ * c1
[i], and if the former energy is large in a certain time range in step S87, the voice is considered to be a voiced sound having a strong pitch. Therefore, the process proceeds to YES in step S88, the excitation source is set to a pulse train, and the pitch component When there was no part, the process proceeded to NO and suppressed to zero. If the energy is not large in step S87, the conventional narrow band excitation source is filled up with zero by PZ-CELP in step S89 by zero padding in step S89 to obtain a wide band excitation source. As a result, the auditory quality of voiced sound in VSELP is improved.

【０１０５】そして、ステップＳ９２でアップサンプル
部４５により上記オリジナル音声Ｓｎｄ_Nをアップサン
プルしたものと、ステップＳ９５で加算部４６により加
算する。このとき、ステップＳ９１で高域側に対し、約
６ＫＨｚ以上の成分を若干抑圧する高域抽出＆抑圧フィ
ルタ４２によりフィルタリングを施すことで、聴きやす
い音にしている。このフィルタ係数は上述したように選
択可能としている。Then, in step S92, the upsampling section 45 upsamples the original sound Snd_N and adds it in step S95 with the adding section 46. At this time, the high-frequency side is filtered by a high-frequency extraction and suppression filter 42 that slightly suppresses a component of about 6 KHz or more in step S91, so that the sound is easy to hear. This filter coefficient is selectable as described above.

【０１０６】さらに、ステップＳ９３では、乗算部４３
を用いてユーザの好みに応じて高域側ゲインを調整可能
としている。Further, in step S93, the multiplication section 43
To adjust the high-frequency gain according to the user's preference.

【０１０７】この信号切換部６５でも第１のポストフィ
ルタ処理（ａ）４７，第２のポストフィルタ処理（ｂ）
４８及び第３のポストフィルタ処理（ｂ）４９を行うポ
ストフィルタを備えている。第１のポストフィルタ処理
（ａ）４７は上記デコード音声信号のスペクトル整形及
び聴感上の品質を効果的に向上でき、第２のポストフィ
ルタ処理（ｂ）４８及び第３のポストフィルタ処理
（ｂ）４９は第２のサンプリング周波数ｆ_s2（１６ＫＨ
ｚ）とされた第１の帯域Ｂ₁及び広帯域Ｂ_Wの音声信号の
スペクトル整形及び聴感上の品質を効果的に向上でき
る。The signal switching section 65 also performs the first post-filter processing (a) 47 and the second post-filter processing (b).
48 and a post filter for performing the third post filter processing (b) 49. The first post-filter processing (a) 47 can effectively improve the spectral shaping and audibility of the decoded audio signal, and the second post-filter processing (b) 48 and the third post-filter processing (b) 49 is the second sampling frequency f_s2 (16 KH
z) It is possible to effectively improve the spectral shaping and audible quality of the audio signals of the first band B₁ and the wide band B_{W set} as z).

【０１０８】したがって、ＶＳＥＬＰによる復号化方法
を用いた信号切換部６５でも、ユーザの好みに基づい
て、サンプリング周波数が８ＫＨｚの第１の帯域Ｂ
₁（３００〜３４００Ｈｚ）の音声信号，サンプリング
周波数が１６ＫＨｚの第１の帯域Ｂ₁の音声信号又はサ
ンプリング周波数が１６ＫＨｚの広帯域Ｂ_Wの音声信号
のスペクトル整形及び聴感上の品質を効果的に向上した
上で切り換えてＤ／Ａ変換器６に送ることができる。Therefore, even in the signal switching section 65 using the decoding method based on VSELP, the first band B having a sampling frequency of 8 KHz is used based on the user's preference.
Audio signal₁ (ranging from 300 to 3400 Hz), the sampling frequency is first audio signal or the sampling frequency of the band B₁ of 16KHz has improved the quality of the spectral shaping and audibility of the audio signal of the wide band B_W of 16KHz effectively It can be switched above and sent to the D / A converter 6.

【０１０９】このため、上記図１に示した受信装置１
は、サンプリング周波数が８ＫＨｚ，１６ＫＨｚと異な
る、第１の帯域Ｂ₁（３００Ｈｚ〜３４００Ｈｚ）のＶ
ＳＥＬＰによる受話音声信号や、サンプリング周波数が
１６ＫＨｚの広帯域（３００Ｈｚ〜６０００Ｈｚ）のＶ
ＳＥＬＰによる受話音声信号にポストフィルタ処理を施
した上で、ユーザに選択させることができる。ユーザ側
では選択肢を広げることができる。状況に応じてＶＳＥ
ＬＰによる受話音声を帯域拡張するだけでなく、入力時
の帯域と同様にすることができるので、内蔵のバッテリ
ーの減りを抑えることもできる。For this reason, the receiving apparatus 1 shown in FIG.
Is the V of the_first band B₁ (300 Hz to 3400 Hz) whose sampling frequency is different from 8 kHz and 16 kHz.
Received voice signal by SELP and V of wide band (300Hz-6000Hz) with sampling frequency of 16KHz
After performing post-filter processing on the received voice signal by the SELP, the user can make a selection. The user has more options. VSE depending on the situation
In addition to extending the band of the voice received by the LP, the band can be made the same as the band at the time of input, so that the built-in battery can be reduced.

【０１１０】さらに、図１の受信装置１内部の信号処理
装置５としては、図１４に示す信号切換部７０とその前
段の、図１５に示すデコード部とからなる信号処理装置
を他の具体例としてもよい。Further, as the signal processing device 5 in the receiving device 1 of FIG. 1, a signal processing device comprising a signal switching unit 70 shown in FIG. 14 and a decoding unit shown in FIG. It may be.

【０１１１】図１５に示したデコード部は、ＶＳＥＬＰ
デコーダ７７とＰＳＩ−ＣＥＬＰデコーダ８１とを備
え、送信装置側から伝送されてくる、音声パラメータ符
号の符号化方式に応じて、デコーダ７７又は８１への音
声パラメータ符号の入力を切り換える。つまり、入力端
子７５を介して受け取った上記音声パラメータ符号を切
換スイッチ７６で、上記符号化方式の種類、つまりＶＳ
ＥＬＰ又はＰＳＩ-ＣＥＬＰに応じて切り換えている。The decoding section shown in FIG.
It includes a decoder 77 and a PSI-CELP decoder 81, and switches the input of the speech parameter code to the decoder 77 or 81 according to the encoding method of the speech parameter code transmitted from the transmitting device side. That is, the voice parameter code received via the input terminal 75 is switched by the changeover switch 76 to the type of the coding method, that is, VS.
Switching is performed according to ELP or PSI-CELP.

【０１１２】ＶＳＥＬＰデコーダ７７からの二つの励振
源１Ｅｘｃ_N1及び励振源２Ｅｘｃ_N2は出力端子７８及び
７９を介して図１４の入力端子６６及び６７に供給され
る。また、ＰＳＩ-ＣＥＬＰデコーダ８１からの励振源
ＮＥｘｃ_Nは出力端子８２を介して図１４の入力端子３
５に供給される。The two excitation sources 1Exc_N1 and 2Exc_N2 from the VSELP decoder 77 are supplied via the output terminals 78 and 79 to the input terminals 66 and 67 of FIG. Moreover, the excitation source NExc_N from PSI-CELP decoder 81 input terminal 3 of FIG. 14 through the output terminal 82
5 is supplied.

【０１１３】また、ＶＳＥＬＰデコーダ７７又はＰＳＩ
−ＣＥＬＰデコーダ８１からの線形予測係数α_V又はα_p
は上記符号化方式の種類に応じて切換スイッチ８０によ
り選択されてから出力端子８３を介して図１４の入力端
子３４に供給される。The VSELP decoder 77 or PSI
-Linear prediction coefficient α_V or α_p from CELP decoder 81
Is selected by the changeover switch 80 in accordance with the type of the encoding method, and is supplied to the input terminal 34 of FIG.

【０１１４】同様に、ＶＳＥＬＰデコーダ７７又はＰＳ
Ｉ−ＣＥＬＰデコーダ８１からのデコード音声も上記符
号化方式の種類に応じて切換スイッチ８４により選択さ
れてから出力端子８５を介して図１４の入力端子３３に
供給される。Similarly, the VSELP decoder 77 or PS
The decoded audio from the I-CELP decoder 81 is also selected by the changeover switch 84 in accordance with the type of the above-mentioned encoding method, and is then supplied to the input terminal 33 of FIG.

【０１１５】また、図１４に示す、信号切換部７０側で
は、上記符号化方式の種類に応じて切り換わる切換スイ
ッチ７１により、励振源切換＆拡張部６８又は励振源拡
張部４１からの励振源出力を切り換えて、ＬＰＣ合成部
４０に供給する。Further, on the signal switching section 70 side shown in FIG. 14, an excitation source switching & extension section 68 or an excitation source from the excitation source extension section 41 is switched by a changeover switch 71 which switches according to the type of the above-mentioned encoding method. The output is switched and supplied to the LPC synthesis unit 40.

【０１１６】この信号切換部７０でも第１のポストフィ
ルタ処理（ａ）４７，第２のポストフィルタ処理（ｂ）
４８及び第３のポストフィルタ処理（ｂ）４９を行うポ
ストフィルタを備えている。第１のポストフィルタ処理
（ａ）４７は上記デコード音声信号のスペクトル整形及
び聴感上の品質を効果的に向上でき、第２のポストフィ
ルタ処理（ｂ）４８及び第３のポストフィルタ処理
（ｂ）４９は第２のサンプリング周波数ｆ_s2（１６ＫＨ
ｚ）とされた第１の帯域Ｂ₁及び広帯域Ｂ_Wの音声信号の
スペクトル整形及び聴感上の品質を効果的に向上でき
る。The signal switching section 70 also performs the first post-filter processing (a) 47 and the second post-filter processing (b).
48 and a post filter for performing the third post filter processing (b) 49. The first post-filter processing (a) 47 can effectively improve the spectral shaping and audibility of the decoded audio signal, and the second post-filter processing (b) 48 and the third post-filter processing (b) 49 is the second sampling frequency f_s2 (16 KH
z) It is possible to effectively improve the spectral shaping and audible quality of the audio signals of the first band B₁ and the wide band B_{W set} as z).

【０１１７】したがって、この信号切換部７０によれ
ば、送信装置側から伝送されてくる伝送信号の符号化方
式の種類に応じ、サンプリング周波数が８ＫＨｚ，１６
ＫＨｚと異なる、第１の帯域Ｂ₁（３００Ｈｚ〜３４０
０Ｈｚ）の受話音声信号や、サンプリング周波数が１６
ＫＨｚの広帯域（３００Ｈｚ〜６０００Ｈｚ）の受話音
声信号にポストフィルタ処理を施した上で、ユーザに選
択させることができる。ユーザ側では選択肢を広げるこ
とができる。状況に応じて受話音声を帯域拡張するだけ
でなく、入力時の帯域と同様にすることができるので、
内蔵のバッテリーの減りを抑えることもできる。Therefore, according to the signal switching section 70, the sampling frequency is 8 KHz, 16 KHz in accordance with the type of the encoding system of the transmission signal transmitted from the transmitting device side.
KHz, the first band B₁ (300 Hz to 340
0 Hz) and a sampling frequency of 16
A post-filtering process can be performed on a received voice signal of a wide band (300 Hz to 6000 Hz) of KHz, and the user can make a selection. The user has more options. Depending on the situation, it is possible to not only extend the received voice band but also make it the same as the input band,
The built-in battery can be reduced.

【０１１８】さらに、上記図１の受信装置１内部の信号
処理装置５は、図１６に示すような信号切換部９０を備
えてもよい。Further, the signal processing device 5 inside the receiving device 1 of FIG. 1 may include a signal switching unit 90 as shown in FIG.

【０１１９】信号切換部９０の入力端子９１には、上記
音声パラメータ符号の内、ＬＰＣ残差である励振源が供
給される。また、入力端子９２には線形予測係数αが供
給される。入力端子９１からの励振源は、ＬＰＣ合成フ
ィルタ９３に送られると共に、アップサンプル部１００
に送られる。入力端子９２からの線形予測係数はＬＰＣ
合成フィルタ９３に送られる。An input terminal 91 of the signal switching section 90 is supplied with an excitation source which is an LPC residual among the above speech parameter codes. The input terminal 92 is supplied with a linear prediction coefficient α. The excitation source from the input terminal 91 is sent to the LPC synthesis filter 93 and the up-sampler 100
Sent to The linear prediction coefficient from the input terminal 92 is LPC
The signal is sent to the synthesis filter 93.

【０１２０】ＬＰＣ合成フィルタ９３は、入力端子９１
からの励振源を基に、入力端子９２からの線形予測係数
を用いて音声信号を合成する。ＬＰＣ合成フィルタ９３
で合成された音声信号は、第１のポストフィルタ処理
（ａ）１６１及びアップサンプル部９４に供給される。The LPC synthesis filter 93 has an input terminal 91
The speech signal is synthesized using the linear prediction coefficient from the input terminal 92 on the basis of the excitation source from. LPC synthesis filter 93
Are supplied to the first post-filter processing (a) 161 and the up-sampling unit 94.

【０１２１】第１のポストフィルタ処理（ａ）１６１
は、上記第１のポストフィルタ処理（ａ）４７と同様に
動作する。ここでは説明を省略する。First post-filter processing (a) 161
Operates similarly to the first post-filter processing (a) 47 described above. Here, the description is omitted.

【０１２２】アップサンプル部９４は、ＬＰＣ合成フィ
ルタ９３で合成された音声信号のサンプリング周波数ｆ
_s1をアップサンプルする。アップサンプルされた上記音
声信号は、第２のポストフィルタ処理（ｂ）１６２及び
バンドバスフィルタ（ＢＰＦ）９５に供給される。The up-sampling section 94 outputs the sampling frequency f of the audio signal synthesized by the LPC synthesis filter 93.
_{Upsample s1} . The upsampled audio signal is supplied to the second post-filter processing (b) 162 and the band-pass filter (BPF) 95.

【０１２３】第２のポストフィルタ処理（ｂ）１６２は
上記第２のポストフィルタ処理（ｂ）４８と同様に動作
する。すなわち、上記第１のポストフィルタ処理（ａ）
１６１をサンプリング周波数が１６ＫＨｚの音声信号に
おける２（＝ｆ_s2／ｆ_s1）倍のサンプルに対して施す。The second post-filter processing (b) 162 operates in the same manner as the second post-filter processing (b) 48. That is, the first post-filter processing (a)
161 is applied to 2 (=_fs2 /_fs1 ) times the sample of the audio signal whose sampling frequency is 16 kHz.

【０１２４】バンドパスフィルタ９５はアップサンプル
部９４からの出力のうち所定の帯域のみを通過させ、加
算部９６に供給する。このアップサンプル部９４、バン
ドパスフィルタ９５、加算部９６に通じる経路は、元の
周波数帯域の成分の信号を合成された音声信号に付加す
るための経路である。The band-pass filter 95 allows only a predetermined band of the output from the up-sampling section 94 to pass therethrough and supplies it to the adding section 96. The path leading to the up-sampling section 94, the band-pass filter 95, and the adding section 96 is a path for adding the signal of the component of the original frequency band to the synthesized audio signal.

【０１２５】また、ＬＰＣ合成フィルタ９３から線形予
測係数−自己相関変換部９７に線形予測係数が送られ
る。線形予測係数−自己相関変換部９７は、線形予測係
数を自己相関に変換するものである。この自己相関は狭
帯域コードブック９８に送られると共に、破擦音検出部
９９に送られる。The LPC synthesis filter 93 sends the linear prediction coefficient to the linear prediction coefficient-autocorrelation conversion section 97. The linear prediction coefficient-autocorrelation converter 97 converts the linear prediction coefficient into autocorrelation. This autocorrelation is sent to the narrowband codebook 98 and also to the affricate detector 99.

【０１２６】また、入力端子９１からの励振源は、アッ
プサンプル部１００でアップサンプルされ、ローパスフ
ィルタ１０１、ブースト部１０２を介して、ＬＰＣ合成
フィルタ１０３に送られる。ブースト部１０２は、破擦
音や摩擦音が検出された場合に励振源をブーストするた
めのもので、ブースト部１０２のブースト量は、破擦音
検出部９９の出力により制御される。The excitation source from the input terminal 91 is up-sampled by the up-sampling unit 100 and sent to the LPC synthesis filter 103 via the low-pass filter 101 and the boost unit 102. The boost unit 102 boosts the excitation source when an affricate or a fricative is detected, and the boost amount of the boost unit 102 is controlled by an output of the affricate detector 99.

【０１２７】狭帯域コードブック９８には、予め複数の
音声信号のパターンから得られた狭帯域音声信号の自己
相関情報がコードベクタとして格納されている。狭帯域
コードブック９８で、線形予測係数−自己相関変換部９
７からの自己相関と、狭帯域コードブック９８に格納さ
れている自己相関情報とが比較され、マッチング処理が
行われる。そして、最もマッチしている自己相関情報の
インデックスが広帯域コードブック１０４に送られる。In the narrow band code book 98, autocorrelation information of narrow band audio signals obtained from a plurality of audio signal patterns is stored in advance as code vectors. In the narrowband codebook 98, the linear prediction coefficient-autocorrelation conversion unit 9
7 and the autocorrelation information stored in the narrowband codebook 98, and a matching process is performed. Then, the index of the best matching autocorrelation information is sent to wideband codebook 104.

【０１２８】広帯域コードブック１０４には、狭帯域コ
ードブック９８と対応して、狭帯域コードブック９８を
作成したときと同一のパターンの音声信号から得られる
広帯域音声信号の自己相関情報がコードベクタとして格
納されている。狭帯域コードブック９８で最もマッチし
ている自己相関情報が判断されると、このインデックス
が広帯域コードブック１０４に送られ、広帯域コードブ
ック１０４により、最もマッチしていると判断された狭
帯域の自己相関情報に対応する広帯域の自己相関情報が
読み出される。In the wideband codebook 104, corresponding to the narrowband codebook 98, autocorrelation information of a wideband audio signal obtained from an audio signal of the same pattern as when the narrowband codebook 98 was created is used as a code vector. Is stored. When the best matching autocorrelation information is determined in the narrowband codebook 98, this index is sent to the wideband codebook 104, and the wideband codebook 104 determines the narrowband autocorrelation information determined to be the best match. Broadband autocorrelation information corresponding to the correlation information is read.

【０１２９】広帯域コードブック１０４から読み出され
た広帯域の自己相関情報は、自己相関−線形予測係数変
換部１０５に送られる。自己相関−線形予測係数変換部
１０５により、自己相関から線形予測係数への変換が行
われる。この線形予測係数がＬＰＣ合成フィルタ１０３
に送られる。The wideband autocorrelation information read from wideband codebook 104 is sent to autocorrelation / linear prediction coefficient conversion section 105. The autocorrelation-to-linear prediction coefficient conversion unit 105 converts the autocorrelation to a linear prediction coefficient. The LPC synthesis filter 103
Sent to

【０１３０】ＬＰＣ合成フィルタ１０３ではＬＰＣ合成
が行われ、これにより、広帯域音声信号が合成される。
ＬＰＣ合成フィルタ１０３で合成された音声信号は、高
域抽出＆抑圧フィルタ１０６及び乗算部１０７に供給さ
れる。The LPC synthesis filter 103 performs the LPC synthesis, thereby synthesizing a wideband audio signal.
The audio signal synthesized by the LPC synthesis filter 103 is supplied to a high-frequency extraction and suppression filter 106 and a multiplication unit 107.

【０１３１】高域抽出＆抑圧フィルタ１０６は、ＬＰＣ
合成フィルタ１０３からの合成出力から入力狭帯域音声
信号の周波数帯域３００Ｈｚ〜３４００Ｈｚの信号成分
を除去し、３４００Ｈｚ以上の信号成分を抽出すると共
に、ユーザの好みに応じて高い周波数成分を抑圧する。
乗算部１０７は、高域抽出＆抑圧フィルタ１０６からの
フィルタ出力に端子１０８から調整されたゲインを乗算
する。The high-frequency extraction & suppression filter 106 is an LPC
A signal component in the frequency band of 300 Hz to 3400 Hz of the input narrow band audio signal is removed from the combined output from the combining filter 103 to extract a signal component of 3400 Hz or more, and suppresses a high frequency component according to the user's preference.
The multiplication unit 107 multiplies the filter output from the high-frequency extraction & suppression filter 106 by the gain adjusted from the terminal 108.

【０１３２】そして、加算部９６は、乗算部１０７から
の乗算出力に、ＢＰＦ９５を介した元の狭帯域音声信号
成分を加算し、広帯域の音声信号を出力する。この広帯
域の音声信号は第３のポストフィルタ処理（ｂ）１６３
に供給される。The adding section 96 adds the original narrowband audio signal component via the BPF 95 to the multiplied output from the multiplying section 107, and outputs a wideband audio signal. This wideband audio signal is subjected to third post-filter processing (b) 163.
Supplied to

【０１３３】第３のポストフィルタ処理（ｂ）１６３は
上記第３のポストフィルタ処理（ｂ）４９と同様に動作
する。すなわち、上記第２のポストフィルタ処理（ｂ）
１６２と同様に、上記第１のポストフィルタ処理（ａ）
１６１をサンプリング周波数が１６Ｈｚの音声信号にお
ける２（＝ｆ_s2／ｆ_s1）倍のサンプルに対して施す。The third post-filter processing (b) 163 operates similarly to the above-mentioned third post-filter processing (b) 49. That is, the second post-filter processing (b)
162, the first post-filter processing (a)
161 is applied to 2 (=_fs2 /_fs1 ) times the sample of the audio signal whose sampling frequency is 16 Hz.

【０１３４】第１のポストフィルタ処理（ａ）１６１か
らの第１のフィルタ処理出力と、第２のポストフィルタ
処理（ｂ）１６２からの第２のフィルタ処理出力と、第
３のポストフィルタ処理（ｂ）１６３からの第３のフィ
ルタ処理出力は切り換えスイッチ１０９の被選択端子
ａ，ｂ，ｃに供給される。The first filter processing output from the first post-filter processing (a) 161, the second filter processing output from the second post-filter processing (b) 162, and the third post-filter processing ( b) The third filtered output from 163 is supplied to the selected terminals a, b, c of the changeover switch 109.

【０１３５】すなわち、切り換えスイッチ１０９は、上
記第１のサンプリング周波数ｆ_s1（＝８ＫＨｚ）の第１
の帯域Ｂ₁（３００Ｈｚ〜３４００Ｈｚ）のポストフィ
ルタ処理が施された音声信号を被選択端子ａで受け、第
２のサンプリング周波数ｆ_s2（＝１６ＫＨｚ）の第１の
帯域Ｂ₁（３００Ｈｚ〜３４００Ｈｚ）のポストフィル
タ処理が施された音声信号を被選択端子ｂで受け、第２
のサンプリング周波数ｆ_s2（＝１６ＫＨｚ）の広帯域Ｂ
_w（３００Ｈｚ〜６０００Ｈｚ）のポストフィルタ処理
が施された音声信号を被選択端子ｃで受ける。そして、
切り換え制御信号端子１２９からの切り換え制御信号に
基づいて選択片ｄを切り換えることにより、いずれか一
のポストフィルタ処理が施された音声信号をＤ／Ａ変換
器６に供給する。That is, the changeover switch 109 is connected to the first sampling frequency f_s1 (= 8 KHz).
The post-filter-processed audio signal of the band B₁ (300 Hz to 3400 Hz) is received at the selected terminal a, and the first band B₁ (300 Hz to 3400 Hz) of the second sampling frequency f_s2 (= 16 KHz) is received. The post-filtered audio signal is received at the selected terminal b,
Broadband B with sampling frequency f_s2 (= 16 KHz)
_The audio signal subjected to_w (300 Hz to 6000 Hz) post-filter processing is received at the selected terminal c. And
By switching the selection piece d based on the switching control signal from the switching control signal terminal 129, any one of the post-filtered audio signals is supplied to the D / A converter 6.

【０１３６】以上より、この図１６に示した信号切換部
９０を備える受信装置でも、サンプリング周波数が８Ｋ
Ｈｚ，１６ＫＨｚと異なる、第１の帯域Ｂ₁（３００Ｈ
ｚ〜３４００Ｈｚ）の受話音声信号や、サンプリング周
波数が１６ＫＨｚの広帯域（３００Ｈｚ〜６０００Ｈ
ｚ）の受話音声信号を、ポストフィルタ処理を施した上
でユーザに選択させることができる。As described above, even in the receiving apparatus provided with signal switching section 90 shown in FIG.
Hz, 16 KHz, the first band B₁ (300H
z to 3400 Hz) or a wide band (300 Hz to 6000 H) with a sampling frequency of 16 KHz.
The user can select the received voice signal of z) after the post-filter processing.

【０１３７】なお、上記受信装置１内部の信号処理装置
５は、各信号切換部３２，６５，７０及び９０内に、各
ポストフィルタ処理の後又は前で第１〜第３の雑音低減
処理を行う雑音低減処理部を備えても良い。The signal processing device 5 in the receiving device 1 performs the first to third noise reduction processes in each of the signal switching units 32, 65, 70, and 90 before or after each post-filter process. A noise reduction processing unit may be provided.

【０１３８】この雑音低減処理部は、本件出願人が既に
出願した、特開平７−１９３５４８号公報に開示されて
いる、雑音低減処理方法を用いて、背景雑音を検出し、
抑圧する。この雑音低減処理方法は、上記第１のサンプ
リング周波数ｆ_s1の音声信号を生成するために送信装置
から伝送されてきた伝送信号に基づく音声パラメータ符
号から検出された背景雑音区間の雑音レベルに応じて制
御信号を形成し、この制御信号に基づいて雑音低減処理
の内容を変化させる。This noise reduction processing unit detects background noise by using a noise reduction processing method disclosed in Japanese Patent Application Laid-Open No. 7-193548, which has already been filed by the present applicant.
Oppress. This noise reduction processing method is based on a noise level of a background noise section detected from a voice parameter code based on a transmission signal transmitted from a transmission device to generate a voice signal of the first sampling frequency f_s1. A control signal is formed, and the content of the noise reduction processing is changed based on the control signal.

【０１３９】図１７には、上記雑音低減処理方法を適用
した雑音低減処理部の第１の雑音低減処理（ａ）１７
１，第２の雑音低減処理（ｂ）１７２，第３の雑音低減
処理（ｂ）１７３を、第１のポストフィルタ処理（ａ）
４７，第２のポストフィルタ処理（ｂ）４８，第３のポ
ストフィルタ処理（ｂ）４９の後段で行う信号切換部３
２を示す。また、図１８には、上記雑音低減処理部の詳
細な構成を示す。上記加算部４６からの加算出力とな
る、帯域３００〜６０００Ｈｚ、サンプリング周波数が
１６ｋＨｚの広帯域音声信号Ｓｎｄ_wは入力端子１４１
を介して、フレームパワー計算部１４２に供給される。
フレームパワー計算部１４２は、例えば周期２０ｍｓｅ
ｃのフレーム毎のパワーとして、例えば自乗平均の平方
根、いわゆるｒｍｓ値を計算する。このフレームパワー
計算部１４２で計算されたフレーム平均パワー値は、抑
圧比計算部１４３に供給される。抑圧比計算部１４３
は、上記フレームパワー計算部１４２で計算されたフレ
ーム平均パワーを用いて、雑音を抑圧するための係数で
ある抑圧比を計算する。抑圧比計算部１４３で計算され
た抑圧比は、スムージング部１４４に送られる。スムー
ジング部１４４は、抑圧比計算部１４３で計算された抑
圧比にスムージング処理を施す。このスムージング処理
とは、例えば２０ｍｓｅｃで１６０サンプルのフレーム
単位で分割された入力音声信号のつながりの不連続性を
避けるための処理である。このスムージング処理が施さ
れた抑圧比は、ノイズリデュース部１４５に送られ、こ
のノイズリデュース部１４５において上記広帯域音声信
号Ｓｎｄ_wの雑音を除去するために用いられる。FIG. 17 shows the first noise reduction processing (a) 17 of the noise reduction processing section to which the above-described noise reduction processing method is applied.
1, a second noise reduction process (b) 172, a third noise reduction process (b) 173, a first post-filter process (a)
47, a second post-filtering process (b) 48, and a signal switching unit 3 to be performed in a subsequent stage of the third post-filtering process (b) 49
2 is shown. FIG. 18 shows a detailed configuration of the noise reduction processing unit. A wideband audio signal Snd_w having a band of 300 to 6000 Hz and a sampling frequency of 16 kHz, which is an addition output from the adder 46, is input to an input terminal 141
Is supplied to the frame power calculation unit 142 via the.
The frame power calculation unit 142 has, for example, a cycle of 20 msec.
For example, a so-called rms value, which is the root of the root mean square, is calculated as the power of each frame of c. The frame average power value calculated by the frame power calculation unit 142 is supplied to the suppression ratio calculation unit 143. Suppression ratio calculator 143
Calculates a suppression ratio, which is a coefficient for suppressing noise, using the frame average power calculated by the frame power calculation unit 142. The suppression ratio calculated by the suppression ratio calculation unit 143 is sent to the smoothing unit 144. The smoothing unit 144 performs a smoothing process on the suppression ratio calculated by the suppression ratio calculation unit 143. The smoothing process is a process for avoiding discontinuity of connection between input audio signals divided in units of 160 samples in, for example, 20 msec. The suppression ratio that has been subjected to the smoothing processing is sent to the noise reducer 145, and is used by the noise reducer 145 to remove noise from the wideband audio signal Snd_w .

【０１４０】抑圧比計算部１４３には、端子１４８を介
して入力された雑音レベル検出信号をレベル弁別部１４
７で弁別して得られた制御信号が供給されており、この
制御信号に応じて、例えば上記抑圧比計算のしきい値が
切換制御されるようになっている。The noise-ratio detection signal input via the terminal 148 is supplied to the suppression-ratio calculator 143.
A control signal obtained by discrimination in step 7 is supplied, and in response to this control signal, for example, a threshold for the above-described suppression ratio calculation is switched and controlled.

【０１４１】次に、この雑音低減処理部の動作について
詳細に説明する。図１８のフレームパワー計算部１４２
は、上記フレーム当たりの上記デコード音声信号Ｓｎｄ
_Nの平均パワーｒｍｓを計算する。この平均パワーｒｍ
ｓは抑圧比計算部１４３に供給される。Next, the operation of the noise reduction processing section will be described in detail. The frame power calculation unit 142 in FIG.
Is the decoded audio signal Snd per frame
Calculate the average power rms of_N. This average power rm
s is supplied to the suppression ratio calculation unit 143.

【０１４２】抑圧比計算部１４３は、平均パワーｒｍｓ
と、あるしきい値ｎｒ１とを比較し、その比較結果によ
り、抑圧比scaleを計算する。すなわち、この抑圧比sca
leは、上記平均パワーｒｍｓがしきい値ｎｒ１以上のと
き１とし、しきい値ｎｒ１よりも小さいとき、 scale＝ｒｍｓ／Ｋ・・・（７）とする。ここで、Ｋは定数である。この例の場合には、
Ｋ＝ｎｒ１となる。The suppression ratio calculator 143 calculates the average power rms
And a certain threshold value nr1, and the suppression ratio scale is calculated based on the comparison result. That is, this suppression ratio sca
le is set to 1 when the average power rms is equal to or larger than the threshold value nr1, and is set to scale = rms / K (7) when the average power rms is smaller than the threshold value nr1. Here, K is a constant. In this case,
K = nr1.

【０１４３】あるいは、全てのｒｍｓについて上記
（７）式を計算し、その計算結果としての抑圧比scale
が１よりも小（scale＜１）となる場合には、この
（７）式で計算された抑圧比scaleを上記デコード音声
信号Ｓｎｄ_Nに乗算する。これは、上記平均パワーｒｍ
ｓが上記しきい値ｒｎ１よりも小となるフレームにおい
ては、上記デコード音声信号Ｓｎｄ_Nに１よりも小さい
ゲインを乗算することを意味する。また、この（７）式
の結果、抑圧比scaleが１以上（scale≧１）となる場合
には、上記デコード音声信号Ｓｎｄ_Nには何も処理を施
さずそのまま出力する。これは、抑圧比scaleが上記し
きい値となるフレームにおいては、上記デコード音声信
号Ｓｎｄ_Nに１のゲインを乗算することを意味する。し
たがって、このしきい値ｎｒ１を適切に選ぶことによ
り、雑音部分のようなパワーの小さい部分ではゲインが
小さく制御されることになり、実質的に雑音低減の効果
が得られる。なお、上記（７）式を用いた場合のノイズ
抑圧の効果は、入力信号の平均パワーに対して１／２倍
となる。Alternatively, the above equation (7) is calculated for all rms, and the suppression ratio scale as the calculation result is calculated.
Is smaller than 1 (scale <1), the decoded speech signal Snd_N is multiplied by the suppression ratio scale calculated by the equation (7). This is the average power rm
In a frame in which s is smaller than the threshold value rn1, this means that the decoded audio signal Snd_N is multiplied by a gain smaller than 1. When the suppression ratio scale is 1 or more (scale ≧ 1) as a result of Expression (7), the decoded audio signal Snd_N is output without any processing. This suppression ratio scale is in the frame to be the threshold value, it means that multiplies a gain of 1 to the decode voice signal Snd_N. Therefore, by appropriately selecting the threshold value nr1, the gain is controlled to be small in a portion having a small power such as a noise portion, and the effect of substantially reducing noise is obtained. Note that the effect of noise suppression when using the above equation (7) is 倍 of the average power of the input signal.

【０１４４】また、ノイズの抑圧がききすぎる場合や、
一定レベル以下をミュートする部と組み合わせて使用す
る場合などにおいては、上記しきい値ｎｒ１（これを第
１のしきい値とする。）よりも小さい第２のしきい値ｎ
ｒ２を設定し、入力レベルがこの第２のしきい値ｎｒ２
よりも小さくなる領域で、抑圧を小さく、すなわちエキ
スパンダの伸長作用の強さを弱めることが好ましい。When the noise suppression is too strong,
In a case where the second threshold value nr1 is used in combination with a unit that mutes a certain level or less, the second threshold value n is smaller than the threshold value nr1 (this is referred to as a first threshold value).
r2, and the input level is set to the second threshold value nr2
It is preferable to reduce the suppression, that is, to weaken the strength of the expanding action of the expander in a region where the expansion is smaller.

【０１４５】ところで、入力された信号に対して音声と
雑音とを区別して処理しているわけではないので、子音
などの音声パワーが相対的に小さいところで音声が無く
なる傾向がある。特に強くノイズリデュースをかけたと
きにこの現象が顕著に現れ、音声の種類によってはかな
りの違和感を感じる。したがって、フレーム平均パワー
に対して、どの程度の強さでノイズリデュースをかける
か、またどのくらいの大きさからかけるかの検討が必要
になってくる。By the way, since the input signal is not processed by distinguishing between speech and noise, there is a tendency that the speech disappears when the speech power of a consonant or the like is relatively small. This phenomenon is particularly noticeable when a strong noise reduction is applied, and depending on the type of voice, a considerable sense of discomfort is felt. Therefore, it is necessary to consider how strong the noise reduction should be applied to the frame average power and from what size.

【０１４６】また、上記のような処理をフレーム単位で
行うと、フレームでの音声のつながりが不連続になり、
聞いたときに不自然感を感じてしまう。When the above-described processing is performed in units of frames, the connection of audio in frames becomes discontinuous.
When you hear it, you feel unnatural.

【０１４７】これらのことを考慮して、上記抑圧比scal
eに対してアタックタイム、リカバリタイムを設定し、
例えばフレーム単位のスムージングを行うことにより、
上記不自然感が出ないようにすることが考えられる。In consideration of the above, the above suppression ratio scal
Set attack time and recovery time for e,
For example, by performing smoothing in frame units,
It is conceivable to prevent the unnatural feeling from appearing.

【０１４８】すなわち、上記図１８の構成からも明らか
なように、抑圧比計算部１４３で計算して求められた抑
圧比scaleは、一旦スムージング部１４４によるスムー
ジング処理を施した後、ノイズリデュース部１４５に送
るようにしている。That is, as is clear from the configuration of FIG. 18, the suppression ratio scale calculated by the suppression ratio calculation unit 143 is once subjected to a smoothing process by the smoothing unit 144 and then to the noise reduction unit 145. To send to.

【０１４９】このスムージング部１４４は、上述したよ
うなノイズ低減処理において生じる問題を解決するため
に設けられたものであり、上記アタックタイム、リカバ
リタイムを設定している。この例では、アタックタイム
を“０”とし、リカバリータイムは可変としている。The smoothing section 144 is provided to solve the above-described problem that occurs in the noise reduction processing, and sets the attack time and the recovery time. In this example, the attack time is “0” and the recovery time is variable.

【０１５０】すなわち、計算した現在のフレームの音声
パワーが前のフレームより大きい時にはその値をそのま
ま使い、逆に小さい場合は所定の特性を備えるローパス
フィルタ（ＬＰＦ）によりスムージングを行い、フレー
ムパワーの変化による処理の不自然感が出ないようにす
る。ノイズリデュース部１４５は、上記広帯域音声信号
Ｓｎｄ_wにスムージング部１４４を介した抑圧比scaleを
乗算して入力信号Ｓｎｄ_wの雑音低減処理を行い、雑音
が低減された出力信号を出力端子１４６から出力してい
る。That is, when the calculated speech power of the current frame is larger than the previous frame, the value is used as it is. On the other hand, when the calculated speech power is smaller, smoothing is performed by a low-pass filter (LPF) having a predetermined characteristic to change To avoid unnatural feeling of processing. The noise reducer 145 multiplies the wideband audio signal Snd_w by the suppression ratio scale via the smoothing unit 144 to perform noise reduction processing on the input signal Snd_w , and outputs an output signal with reduced noise from the output terminal 146. are doing.

【０１５１】ところで、上記抑圧比計算部１４３には、
端子１４８を介した雑音レベル検出信号をレベル弁別部
１４７で弁別して得られた制御信号が供給されている。
この制御信号に応じて、上記抑圧比計算のしきい値が切
換制御されている。すなわち、抑圧比計算のしきい値
は、雑音レベル検出信号に基づいている。Incidentally, the above-mentioned suppression ratio calculating section 143 includes:
A control signal obtained by discriminating the noise level detection signal via the terminal 148 by the level discriminator 147 is supplied.
In response to the control signal, the threshold for the above-described suppression ratio calculation is switched and controlled. That is, the threshold for the suppression ratio calculation is based on the noise level detection signal.

【０１５２】この雑音レベル検出信号は、上記第１のサ
ンプリング周波数ｆ_s1の音声信号を生成するために送信
装置から伝送されてきた伝送信号に基づく音声パラメー
タ符号から検出された背景雑音区間の音声レベルにより
表すことができる。The noise level detection signal is an audio level of a background noise section detected from an audio parameter code based on a transmission signal transmitted from the transmission device to generate the audio signal of the first sampling frequency f_s1. Can be represented by

【０１５３】ここでは、図示を省略しているが、上記音
声パラメータ符号から背景雑音区間を検出する雑音区間
検出部と、この雑音区間検出部で検出された雑音区間の
雑音レベルを検出する雑音レベル検出部が必要とされ、
端子１４８には雑音レベル検出部で検出された雑音レベ
ル検出信号が供給される。Although not shown here, a noise section detecting section for detecting a background noise section from the speech parameter code, and a noise level detecting the noise level of the noise section detected by the noise section detecting section. A detector is needed,
A terminal 148 is supplied with a noise level detection signal detected by the noise level detection unit.

【０１５４】また、この雑音低減処理部は、上記第１の
サンプリング周波数ｆ_s1（８ＫＨｚ）の音声信号を生成
するために送信装置から伝送されてきた伝送信号に基づ
く音声パラメータ符号を第１の雑音低減処理に用いてい
るが、他の雑音低減処理（ｂ）１７２，雑音低減処理
（ｂ）１７３が実際に雑音低減処理を施すのは、第２の
サンプリング周波数ｆ_s2（１６ＫＨｚ）とされた音声信
号に対してである。このため、第２の雑音低減処理
（ｂ）１７２，第３の雑音低減処理（ｂ）１７３は、上
記第１の雑音低減処理（ａ）１７１をサンプリング周波
数が１６ＫＨｚの音声信号における２（＝ｆ_s2／ｆ_s1）
倍のサンプルに対して施す。Further, the noise reduction processing section converts the speech parameter code based on the transmission signal transmitted from the transmission device to generate the speech signal of the first sampling frequency f_s1 (8 kHz) into the first noise. Although the noise reduction processing (b) 172 and the noise reduction processing (b) 173 that are used for the reduction processing actually perform the noise reduction processing, the sound having the second sampling frequency f_s2 (16 KHz) is used. For the signal. For this reason, the second noise reduction processing (b) 172 and the third noise reduction processing (b) 173 perform the above-described first noise reduction processing (a) 171 by 2 (= f_s2 / f_s1 )
Apply to doubled sample.

【０１５５】このようにして、第１の雑音低減処理
（ａ）１７１は、上記ポストフィルタ（ａ）４７でスペ
クトル整形及び聴感上の品質が向上された音声信号中の
雑音成分を低減できる。また、第２の雑音低減処理
（ｂ）１７２，第３の雑音低減処理（ｂ）１７３はポス
トフィルタ処理済みの第２のサンプリング周波数ｆ
_s2（１６ＫＨｚ）とされた第１の帯域Ｂ₁及び広帯域Ｂ_W
の音声信号の雑音成分を低減できる。As described above, the first noise reduction processing (a) 171 can reduce the noise component in the audio signal whose spectral shaping and audibility have been improved by the post filter (a) 47. The second noise reduction processing (b) 172 and the third noise reduction processing (b) 173 are performed at the second sampling frequency f after the post-filter processing.
_s2 (16 KHz) and have been the first band B₁ and wide band B_W
Can reduce the noise component of the audio signal.

【０１５６】すなわち、図１７に示した信号切換部３２
は、切り換えスイッチ１５０により、第１のポストフィ
ルタ処理（ａ）４７，第２のポストフィルタ処理（ｂ）
４８及び第３のポストフィルタ処理（ｂ）４９でスペク
トル整形及び聴感上の品質が効果的に向上され、第１の
雑音低減処理（ａ）１７１，第２の雑音低減処理（ｂ）
１７２及び第３の雑音低減処理（ｂ）１７３で雑音が低
減された音声信号、つまりサンプリング周波数が８ＫＨ
ｚの第１の帯域Ｂ₁（３００〜３４００Ｈｚ）の音声信
号と、サンプリング周波数が１６ＫＨｚの第１の帯域Ｂ
₁（３００〜３４００Ｈｚ）の音声信号と、サンプリン
グ周波数が１６ＫＨｚの広帯域Ｂ_W（３００〜６０００
Ｈｚ）の広帯域音声信号とを切り換えてＤ／Ａ変換器６
に送ることができる。That is, the signal switching section 32 shown in FIG.
The first post-filter processing (a) 47 and the second post-filter processing (b) are performed by the changeover switch 150.
48 and the third post-filter processing (b) 49 effectively improve the spectral shaping and audible quality. The first noise reduction processing (a) 171 and the second noise reduction processing (b)
172 and the third noise reduction processing (b) The audio signal whose noise is reduced by 173, that is, the sampling frequency is 8 KH
audio signal in the first band B₁ (300-3400 Hz) of z and the first band B having a sampling frequency of 16 KHz
₁ and the audio signal (ranging from 300 to 3400 Hz), the sampling frequency is 16KHz wideband B_W (300 to 6000
Hz) and the D / A converter 6
Can be sent to

【０１５７】なお、上記信号切換部３２、６５、７０又
は９０を備えた信号処理装置を用いた受信装置は、送信
装置と一体化され、図１９に示すような、携帯電話装置
１１０を構成してもよい。この携帯電話装置１１０も、
ＰＤＣとして、現在広くしようされている、ディジタル
携帯電話に適用できる。A receiving device using a signal processing device provided with the above-described signal switching unit 32, 65, 70 or 90 is integrated with a transmitting device to constitute a portable telephone device 110 as shown in FIG. You may. This mobile phone device 110 also
As a PDC, it can be applied to digital mobile phones that are currently being widely used.

【０１５８】この携帯電話装置１１０で、マイクロホン
１１１から入力された音声信号は、アンプ１１２，ボリ
ューム１１３，アンチエイリアシングフィルタ１１４及
びＡ／Ｄ変換器１１５を経由して信号処理装置１１６の
メモリ１１６ａに格納される。In the portable telephone device 110, the audio signal input from the microphone 111 is stored in the memory 116a of the signal processing device 116 via the amplifier 112, the volume 113, the anti-aliasing filter 114 and the A / D converter 115. Is done.

【０１５９】メモリ１１６ａに格納された音声信号は、
信号処理装置１１６内部の音声符号化部で符号処理さ
れ、音声パラメータ符号として出力される。The audio signal stored in the memory 116a is
The audio signal is encoded by an audio encoding unit in the signal processing device 116 and output as an audio parameter code.

【０１６０】この音声パラメータ符号は、制御部１１７
及びＲＦ（ＲＦ送信）アンプ１１８及びアンテナ１１９
を経由して基地局へ送信される。This voice parameter code is transmitted to the control unit 117.
And RF (RF transmission) amplifier 118 and antenna 119
Is transmitted to the base station via.

【０１６１】ここで、信号処理装置１１６内部の音声符
号化部は、伝送路により制限される狭帯域化を考慮した
音声パラメータ符号を制御部１１７を介してＲＦアンプ
１１８に供給する。Here, the audio encoding unit in the signal processing device 116 supplies the audio parameter code to the RF amplifier 118 via the control unit 117 in consideration of the narrow band limited by the transmission path.

【０１６２】また、アンテナ１１９を介して基地局から
受信した音声パラメータ符号は、ＲＦアンプ１１８、制
御部１１７を経由して信号処理装置１２２のメモリ１２
２ａに格納される。The voice parameter code received from the base station via the antenna 119 is transmitted to the memory 12 of the signal processing device 122 via the RF amplifier 118 and the control unit 117.
2a.

【０１６３】信号処理装置１２２のメモリ１２２ａに格
納された音声パラメータ符号は、信号処理装置１２２の
復号部で復号処理された後、所定の信号処理が施されて
出力される。The speech parameter code stored in the memory 122a of the signal processing device 122 is decoded by the decoding section of the signal processing device 122, and then subjected to predetermined signal processing and output.

【０１６４】信号処理装置１２２から出力信号は、Ｄ／
Ａ変換器１２３でアナログ信号とされた後、アンチエイ
リアシングフィルター１２４、ボリューム１２５及びア
ンプ１２８を経由してスピーカ１２７から出力される。The output signal from the signal processing device 122 is D /
After being converted into an analog signal by the A converter 123, the analog signal is output from the speaker 127 via the anti-aliasing filter 124, the volume 125 and the amplifier 128.

【０１６５】ここで、信号処理装置１２２は、上記信号
切換部３２、６５、７０又は９０を備えてなる。したが
って、この図１９に示した携帯電話装置１１０は、受話
側でサンプリング周波数を２倍にした高品質の広帯域音
声信号の、スペクトル整形及び聴感上の品質を効果的に
向上し、かつ、雑音成分を低減することができる。Here, the signal processing device 122 includes the above-described signal switching unit 32, 65, 70 or 90. Therefore, the mobile phone device 110 shown in FIG. 19 can effectively improve the spectral shaping and audibility of a high-quality wideband audio signal whose sampling frequency is doubled on the receiving side, and can improve the noise component. Can be reduced.

【０１６６】なお、上記実施の形態では、受信装置、送
信装置、携帯電話装置を、ＰＤＣとして使用されている
ディジタル携帯電話装置に適用できるとして説明した
が、広帯域（ワイドバンド）ＣＤＭＡ方式、すなわち、
周波数帯域幅が広い移動体通信システムにも適用が可能
である。In the above embodiment, the receiving device, the transmitting device, and the mobile phone device have been described as being applicable to a digital mobile phone device used as a PDC. However, the wideband (wideband) CDMA system, that is,
The present invention is also applicable to a mobile communication system having a wide frequency bandwidth.

【０１６７】[0167]

【発明の効果】以上、本発明によれば、サンプリング周
波数が例えば８ＫＨｚ，１６ＫＨｚと異なる、第１の帯
域Ｂ₁（３００Ｈｚ〜３４００Ｈｚ）のＰＳＩ−ＣＥＬ
Ｐ又はＶＳＥＬＰによる受話音声信号や、サンプリング
周波数が１６ＫＨｚの広帯域（３００Ｈｚ〜６０００Ｈ
ｚ）の受話音声信号にポストフィルタ処理を施した上
で、ユーザに選択させることができる。このため、ユー
ザ側では選択肢が広がる。また、状況に応じて受話音声
を帯域拡張するだけでなく、入力時の帯域と同様にする
ことができるので、内蔵のバッテリーの減りを抑えるこ
ともできる。また、雑音低減処理を施し、雑音成分を低
減した上で、ユーザに選択させてもよい。Effect of the Invention] According to the present invention, the sampling frequency is for example 8 KHz, different from the 16 KHz, PSI-CEL of the first band B₁ (300Hz~3400Hz)
A received voice signal by P or VSELP, or a wide band (300 Hz to 6000 H) with a sampling frequency of 16 KHz
After the post-filter processing is performed on the received voice signal of z), the user can make the selection. For this reason, the user has more options. Further, not only the band of the received voice can be expanded according to the situation, but also the band can be made the same as the band at the time of input, so that the built-in battery can be prevented from being reduced. Alternatively, the user may be allowed to make a selection after performing noise reduction processing to reduce noise components.

【０１６８】したがって、聴覚的品質を向上させた受話
音声を得ることのできる受信装置及び方法、通信装置及
び方法の提供を実現できる。Therefore, it is possible to provide a receiving apparatus and method, a communication apparatus and a method capable of obtaining a received voice with improved auditory quality.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明の実施の形態となる受信装置の構成を示
すブロック図である。FIG. 1 is a block diagram illustrating a configuration of a receiving device according to an embodiment of the present invention.

【図２】上記図１に示した受信装置に音声パラメータ符
号を基地局を介して送信する送信装置の構成を示すブロ
ック図である。FIG. 2 is a block diagram showing a configuration of a transmitting apparatus for transmitting a voice parameter code to the receiving apparatus shown in FIG. 1 via a base station.

【図３】上記図１に示した受信装置内部の信号処理装置
を信号切換部と共に構成するＰＳＩ−ＣＥＬＰデコーダ
を示す図である。FIG. 3 is a diagram showing a PSI-CELP decoder which constitutes a signal processing device inside the receiving device shown in FIG. 1 together with a signal switching unit.

【図４】上記図１に示した受信装置内部の信号処理装置
をＰＳＩ−ＣＥＬＰデコーダと共に構成する信号切換部
の処理を説明するための機能ブロック図である。FIG. 4 is a functional block diagram for explaining a process of a signal switching unit configuring the signal processing device inside the receiving device shown in FIG. 1 together with a PSI-CELP decoder.

【図５】上記図４に示した信号切換部に含まれる励振源
拡張部の詳細な構成を示すブロック図である。FIG. 5 is a block diagram showing a detailed configuration of an excitation source extension unit included in the signal switching unit shown in FIG. 4;

【図６】上記図４に示した信号切換部に含まれるポスト
フィルタの詳細な構成を示すブロック図である。FIG. 6 is a block diagram showing a detailed configuration of a post filter included in the signal switching unit shown in FIG. 4;

【図７】上記図４に示した信号切換部の詳細な動作を説
明するためのフローチャートである。FIG. 7 is a flowchart illustrating a detailed operation of the signal switching unit shown in FIG. 4;

【図８】上記図４に示した信号切換部で用いられるコー
ドブックに使われるトレーニングデータ生成処理を説明
するためのフローチャートである。FIG. 8 is a flowchart for explaining a training data generation process used for a codebook used in the signal switching unit shown in FIG. 4;

【図９】上記コードブックの生成を説明するためのフロ
ーチャートである。FIG. 9 is a flowchart illustrating the generation of the code book.

【図１０】上記ポストフィルタのフィルタ係数更新周期
とゲイン更新周期とを説明するための図である。FIG. 10 is a diagram for explaining a filter coefficient update cycle and a gain update cycle of the post filter.

【図１１】上記図１に示した受信装置内部の信号処理装
置の他の具体例に含まれるＶＳＥＬＰデコーダを示す図
である。FIG. 11 is a diagram showing a VSELP decoder included in another specific example of the signal processing device inside the receiving device shown in FIG. 1;

【図１２】上記図１に示した受信装置内部の信号処理装
置の他の具体例に含まれる信号切換部の処理を説明する
ための機能ブロック図である。FIG. 12 is a functional block diagram for explaining processing of a signal switching unit included in another specific example of the signal processing device inside the receiving device shown in FIG. 1;

【図１３】上記図１２に示した信号切換部の詳細な動作
を説明するためのフローチャートである。FIG. 13 is a flowchart illustrating a detailed operation of the signal switching unit shown in FIG. 12;

【図１４】上記図１に示した受信装置内部の信号処理装
置のさらに他の具体例に含まれる信号切換部の処理を説
明するための機能ブロック図である。FIG. 14 is a functional block diagram for explaining a process of a signal switching unit included in still another specific example of the signal processing device inside the receiving device shown in FIG. 1;

【図１５】上記図１に示した受信装置内部の信号処理装
置のさらに他の具体例に含まれるデコード部の構成を示
すブロック図である。FIG. 15 is a block diagram showing a configuration of a decoding unit included in still another specific example of the signal processing device inside the receiving device shown in FIG. 1;

【図１６】上記図１に示した受信装置内部の信号処理装
置の、またさらに他の具体例に含まれる信号切換部の処
理を説明するための機能ブロック図である。FIG. 16 is a functional block diagram for explaining processing of a signal switching unit included in still another specific example of the signal processing device inside the receiving device shown in FIG. 1;

【図１７】上記図４に示した信号切換部内のポストフィ
ルタの後段で雑音低減処理を行う信号切換部の処理を説
明するための機能ブロック図である。FIG. 17 is a functional block diagram for explaining a process of a signal switching unit that performs a noise reduction process after the post filter in the signal switching unit shown in FIG. 4;

【図１８】上記図１７に示した信号切換部に含まれる雑
音低減処理部の詳細な構成を示すブロック図である。FIG. 18 is a block diagram illustrating a detailed configuration of a noise reduction processing unit included in the signal switching unit illustrated in FIG. 17;

【図１９】上記各信号切換部を用いた信号処理装置を含
んだ受信装置を、送信装置と一体化して有してなる、携
帯電話装置の構成を示すブロック図である。FIG. 19 is a block diagram illustrating a configuration of a mobile phone device including a receiving device including a signal processing device using each of the signal switching units and a transmitting device.

【符号の説明】[Explanation of symbols]

１受信装置、１５送信装置、２１信号処理装置、
２７ＰＳＩ−ＣＥＬＰデコーダ、３２信号切換部、
３６線形予測係数→自己相関（α_N→ｒ_N）変換部、３
７自己相関広帯域化部、３８広帯域コードブック、
３９自己相関→線形予測係数変換部、４０ＬＰＣ合
成部、４１励振源拡張部、４５アップサンプル部、
４６加算部、４７ポストフィルタ（ａ）、４８，４
９ポストフィルタ（ｂ）、１５０切り換えスイッチ1 receiving device, 15 transmitting device, 21 signal processing device,
27 PSI-CELP decoder, 32 signal switching unit,
36 Linear prediction coefficient → autocorrelation (α_N → r_N ) converter, 3
7 autocorrelation broadband unit, 38 wideband codebook,
39 autocorrelation → linear prediction coefficient conversion section, 40 LPC synthesis section, 41 excitation source expansion section, 45 upsample section,
46 adder, 47 post filter (a), 48, 4
9 Post filter (b), 150 changeover switch

フロントページの続き (72)発明者大森士郎東京都品川区北品川６丁目７番35号ソニー株式会社内Ｆターム(参考） 5D045 CB10 5K052 AA00 BB02 EE07 EE40 FF07 GG34 GG48 9A001 CC02 EE05 JJ12 KK56Continuing from the front page (72) Inventor Shiro Omori 6-7-35 Kita-Shinagawa, Shinagawa-ku, Tokyo Sony Corporation F-term (reference) 5D045 CB10 5K052 AA00 BB02 EE07 EE40 FF07 GG34 GG48 9A001 CC02 EE05 JJ12 KK12