JPS6033635A

Movatterモバイル変換

Info

Publication number: JPS6033635A
Application number: JP14337183A
Authority: JP
Inventors: Shigetatsu Katori; 香取　重達
Original assignee: NEC Corp; Nippon Electric Co Ltd
Current assignee: NEC Corp
Priority date: 1983-08-05
Filing date: 1983-08-05
Publication date: 1985-02-21
Also published as: JPS6359175B2

Abstract

PURPOSE:To obtain a pipeline controlling system which is optimum to a microcomputer by constituting said system so that all the function units start simultaneously the next basic processing in accordance with the execution procedure, and the execution of the prescribed next instruction is started. CONSTITUTION:A stage start signal 7-7' is AND of each output of the first, the second and the third stage end FFs 7-4, 7-5 and 7-6 obtained by an AND gate 7-7. When the stage start signal 7-7' is inputted, each function unit of a bus interface part 7-1, an operand address calculating part 7-2 and an executing part 7-3 resets the first, the second and the third stage end FFs 7-4, 7-5 and 7-6, and simultaneously starts processing of the next one stage portion. A timing storage part 7-8 holds a timing signal which is set in advance so that the next signal can be started without collision, and sends out its timing signal at prescrived timing.

Description

Translated fromJapanese

【発明の詳細な説明】〔発明の属する技術分野〕不発明は、複数の命令を並列に実行するデータ処理装置
のパイプライン制御方式に関する。DETAILED DESCRIPTION OF THE INVENTION [Technical field to which the invention pertains] The present invention relates to a pipeline control method for a data processing device that executes a plurality of instructions in parallel.

〔従来技術〕[Prior art]

マイクロコンピュータの如きデータ処理装置ニおいては
、処理速度の高速化を達成するために。In data processing devices such as microcomputers, to achieve faster processing speed.

動作クロ、りを品める方法と、内部の制御方式を改善す
る方法の２７１１１りの手段がとられている。このうち
、動作クロックを高める方法は、デノくイス技術に大き
く依存【７デバイス技術の改善に伴なってせいぜい１０
％程度の高速化が達成させるにとどまる。このため性能
を抜本的に改＠する場合にはパイプライン制御などの新
しい内部の開側ｊ方式を採用して対応している。A total of 27,111 measures have been taken, including ways to improve the operational complexity and methods to improve the internal control system. Among these methods, methods to increase the operating clock largely depend on device technology [7] As device technology improves, at most 10
The speedup can only be achieved by about %. For this reason, when we want to drastically improve performance, we adopt a new internal open-side method such as pipeline control.

パイプラインＨｉｌｌ　ｌ１１４１方式では、１つの命
令の実行で行なわれる一連の処理をいくつかの基本処理
に分割し、これらの基本処理毎にそれぞれの処理を逐行
するのに盛装な専用ハードウェア全備えた腹数個の機能
ユニット全用意する。これらの機能ユニットヲ処理順序
に従って連続的に動作させる事により、全体の処理速度
の向上をはかるもので。In the pipeline Hill 1141 method, a series of processes performed by executing one instruction is divided into several basic processes, and a complete set of dedicated hardware is used to execute each of these basic processes one by one. Prepare several functional units. By operating these functional units continuously according to the processing order, the overall processing speed is improved.

処理速反り向上によく使われている方法である。This is a method often used to improve processing speed and warpage.

従来使用されているパイプライン制御方式を説明する。A conventionally used pipeline control method will be explained.

ｌず第１図のブロック図において５本従来例における中
央処理装＠（以下、ＣＩ）Ｕという・）が持つ３個の機
能ユニットの機能？：説明４−る。バスインタフェース
部１−１はメモリから命令コードの胱出しや、データの
書込み読出しを行ない。In the block diagram of Figure 1, what are the functions of the three functional units of the central processing unit (hereinafter referred to as CI) in the conventional example? :Explanation 4-ru. The bus interface section 1-1 extracts instruction codes from the memory and writes and reads data.

オペランドアドレス計算部１−２はメモリ参照時。Operand address calculation unit 1-2 is used when referencing memory.

メモリのアドレス修飾を行ない、実行部１−３は各イ■
の転送、演算処理′ｆ：行なう。The execution unit 1-3 modifies the memory address, and the execution unit 1-3
Transfer and arithmetic processing 'f: Execute.

第２図は、第１図に示した３個の機能ユニットからイＩ
′１城されるＣＰＵのメモリ間Ｃ１Ｊ度算命令の動作図
である。これは、１つの命令の実行に際し。Figure 2 shows an example of the three functional units shown in Figure 1.
FIG. 2 is an operation diagram of the CPU's inter-memory C1J calculation instruction, which is executed by '1'. This is when executing one instruction.

各機能ユニットがどのように使われてゆくがをあらかし
め定められた時間単位（以下、ステージという。）に時
間を追って示したもので、各ステージの長さは固定さＪ
′１．ている。すなわち、各機能ユニットｉｄ以下の順
序でＪン［定の動作を行う。It shows how each functional unit is used over time in predetermined time units (hereinafter referred to as stages), and the length of each stage is fixed.
'1. ing. That is, each functional unit performs a certain operation in the order following its ID.

ＴＡＩの最初のステージの期間；バスインタフェース部
１−１で命令コードの読出しを行なう・ＴＡ２の２番目
のステージの期間；実行部３．−３で命令の解読を行う
。Period of first stage of TAI; instruction code is read by bus interface unit 1-1. Period of second stage of TA2; execution unit 3. -3 decodes the command.

Ｔλ３０３番目のステージの期間；オペランドアドレス
計算部１−２でメモリ参照時のアドレス修飾を行なう。Tλ30 Third stage period: The operand address calculation unit 1-2 modifies the address when referencing the memory.

１人４の４番目のステージの期間；バスインタフェース
部１−１において、メモリからのデータの読出しを行な
う。Fourth stage period for one person 4: Data is read from the memory in the bus interface section 1-1.

ＴＡ５の５番目のステージの期間；実行部１−３で演算
処理を行なう。During the fifth stage of TA5, the execution unit 1-3 performs arithmetic processing.

ＴＡ６の６番目のステージの期間；バスインタフェース
部１−１で、訊算結果全メモリへ格納する・本従来例で
説明するＣＰＵは、他１（も多くの命令を持ち様々な処
理が可能であるが、１つの命令に対して機能ユニットが
使われる順序はどれも等しく、従って、すべての命令は
、２８２図の動作図に示す通ｖｖｃ各機能ユニットが使
わｈ処理が進められる。Period of the 6th stage of TA6: The calculation result is stored in all memories at the bus interface unit 1-1. However, the order in which functional units are used for one instruction is the same, and therefore, all instructions are processed using each functional unit as shown in the operation diagram of FIG. 282.

次に第２図の動作図ｅこ示す様な６個のステージから構
成される動作図を持つ命令をパイプライン制御で処理す
るときの制御方法を説明する。Next, a control method when an instruction having an operation diagram consisting of six stages as shown in the operation diagram e in FIG. 2 is processed by pipeline control will be explained.

命令の処理を７１１２１７式に進める際に問題になるの
は、１つの命令を実行中に次の６令の実行を開始するタ
イミングである。The problem when proceeding with instruction processing according to the 711217 method is the timing of starting execution of the next six instructions while one instruction is being executed.

第３図（ａ）〜（ｆ）の動作図は、１つの命令の実行中
に次の命令を開始するステージ毎の各ユニットの動作状
態を示したものである。この第３図（ａｌ〜（ｆ）の動
作図に示す通り、Ｂの命令を実＜Ｊ中に、第３図（ｂ）
に示すタイミングで次のＮ命令を開始ツーると。The operation diagrams in FIGS. 3(a) to 3(f) show the operating state of each unit at each stage in which the next instruction is started during the execution of one instruction. As shown in the operation diagrams in FIG. 3 (al to (f)), during execution of the command B, in FIG. 3 (b)
Start the next N command at the timing shown in .

バスインタフェース部１−１の１１’、、６リステージ
で。11', 6 restage of bus interface section 1-1.

又、第３図（Ｃ）に示すタイミングでは、バスインタフ
ェース部１−１のＴＢ４　ステージとツコ行坩ｔ　］　
−３ノｌ１１８５ステージで、又、第３図（ｅ）に不す
タイミングでは、バスインタフェース部１−１のＩｌｌ
　Ｂ６のステージで、それぞれ命令Ａにおける処理と命
令Ｂにおける処理が衝突する。Moreover, at the timing shown in FIG. 3(C), the TB4 stage of the bus interface section 1-1 and the TB4 stage t]
At stage 1185 of bus interface section 1-3, and at the timing shown in FIG. 3(e),
At stage B6, the processing in instruction A and the processing in instruction B conflict.

第４図の衝突図は、第３図ｆａ）から第３図（ｆ）に示
した次の命令の各起動タイミングに対応して命令の実行
中に機能ユニット内で衝突が発生するかどうかを示した
ものである。衝突図のも列は次の命令の開始タイミング
を示し１表内のＯはそのタイミングで次の命令全開始し
ても衝突がなく、パイプライン動作が正常に維持される
事′ｆ：、１はそのタイミングで天の命令全開始したら
必らず機能ユニット内で、前の命令と次の命令の処理が
衝突ｈパイプラインの正常動作が継続できない事を表わ
している。The collision diagram in Figure 4 shows whether or not a collision occurs within the functional unit during the execution of an instruction, corresponding to the activation timing of the next instruction shown in Figures 3fa) to 3(f). This is what is shown. The column in the collision diagram indicates the start timing of the next instruction, and O in the table 1 indicates that there will be no collision even if all the next instructions start at that timing, and the pipeline operation will be maintained normally.'f:,1 This means that if all instructions are started at that timing, the processing of the previous instruction and the next instruction will collide within the functional unit. This means that the normal operation of the pipeline cannot continue.

不従来例の第４図の衝突図では１つの命令の実行中ｔＡ
１．　’Ａ４．　ｔＡ６のタイミングで、次の命令を開
始すれば衝突はないが、を人２．　ｔＡ３．　ｔＡ５で
は衝突が発生する。この’ｇＲｉ突図に基づいて、パイ
プライン制御で最も重要な次の命令の開始タイミングが
決定される。In the collision diagram of FIG. 4 of the non-conventional example, during the execution of one instruction tA
1. 'A4. If the next command is started at the timing of tA6, there will be no collision, but if the user 2. tA3. A collision occurs at tA5. Based on this 'gRi projection, the start timing of the next instruction, which is most important in pipeline control, is determined.

次に第５図に７ドフ“パイプライン制御回路のブロック
図で、１′にの命令の開始タイミングを決定するハード
ウェア構成及び動作を説明する・レジスタ５−１−０か
らレジスタ５−１−５は。Next, FIG. 5 is a block diagram of the 7Doff pipeline control circuit, and the hardware configuration and operation that determine the start timing of the 1' instruction will be explained. Register 5-1-0 to register 5-1- 5 is.

第４図の衝突図に示したパイプライン処理中に衝突が発
生するかと９かのデータ全保持する。この６個のレジス
タ５−１−５からレジスタ５−１−〇でシフトレジスタ
５　”−１ｆｉ”構成する。If a collision occurs during the pipeline processing shown in the collision diagram of FIG. 4, all data is retained. These six registers 5-1-5 to 5-1-0 constitute a shift register 5"-1fi".

シフトクロック５−２は各ステージにおける処理が終了
する毎にアクティブになりシフトレジスタ５−１円のデ
ータを左シフトする。シフトレジスタ５−１の最終出力
であるレジスタ５−１−５の出力は命令開始信号５−３
で、パイプライン制御における次の命令σノ開始全制御
する。０°′ならば次の命令を起動し１ｎならば起動し
ない。The shift clock 5-2 becomes active every time the processing at each stage is completed, and shifts the data in the shift register 5-1 to the left. The output of register 5-1-5, which is the final output of shift register 5-1, is the instruction start signal 5-3.
Then, the start of the next instruction σ in pipeline control is completely controlled. If it is 0°', the next instruction is activated, and if it is 1n, it is not activated.

初期衝突データ発生回路５−４は１次の命令が開始され
たときに、シフトレジスタ５−１円のデータを補正する
のに使用される１組みゲート５−５は、命令開始信号５
−３の制御でシフトレジスタ５−１内の次の段のレジス
タへ入力するデータを選択するも（／Ｊで、命令開始１
６号５−３が０”ならば、シフトレジスタ５−１内の各
レジスタのデータと初期衝突データ発生（ロ）路５−４
の対応するビットの論理和が次段のレジスタに入力する
。１”ならば、各レジスタ内のデータが単純に次段のレ
ジスタに入力する・第６図の動作図は、第５図のレジスタ５−１−５からレ
ジスタ５−１−０までのレジスタで構成されるシフトレ
ジスタ５−１のデータ保持状態をステージ毎に表わした
ものである。初期状態では、レフトレジスタ５−１内に
は、第４図の衝突図に示されりＯ″、″ｌ”、′１″、
″ｌ　Ｑ　！＋　、１”、０”がこの順に格納されてい
る。最初の命令が実行されると、その最初の第１ステー
ジで、シフトレジスタ５−１の最終出力である命令開始
信号５−３の状態をチェックする。この信号の状態が０
°′なりで、次の第２ステージで、（Ｋ命令をただちに
開始する。同時に新しく命令が開始された事により衝突
図の補正が必要となり１組みゲート５−５の制御で初期
衝突データ発生回路５−４で補正された衝突データが左
シフトされ、これにより、シフトレジスタ５−１の内容
は、第６図の第２ステージに示すデータとなる。第２ス
テージ内でｒｌ＋び。The initial collision data generation circuit 5-4 is used to correct the data in the shift register 5-1 when the primary instruction is started.The set of gates 5-5 receives the instruction start signal 5.
-3 control selects the data to be input to the next stage register in shift register 5-1 (/J starts instruction 1
If No. 6 5-3 is 0", the data of each register in the shift register 5-1 and the initial collision data generation (b) path 5-4
The logical sum of the corresponding bits is input to the next stage register. 1", the data in each register is simply input to the next register. The operation diagram in Figure 6 shows the registers from register 5-1-5 to register 5-1-0 in Figure 5. The data holding state of the shift register 5-1 is shown for each stage.In the initial state, the left register 5-1 contains O'', ``l'' as shown in the collision diagram of FIG. ",'1",
"l Q !+, 1", and 0" are stored in this order. When the first instruction is executed, at the first stage, the instruction start signal 5, which is the final output of the shift register 5-1, is stored in this order. Check the status of -3.If the status of this signal is 0
°', in the next second stage, the (K command is started immediately. At the same time, since a new command is started, it is necessary to correct the collision diagram, and the initial collision data generation circuit is controlled by the first set of gates 5-5. The collision data corrected in step 5-4 is shifted to the left, so that the contents of the shift register 5-1 become the data shown in the second stage of FIG. 6. In the second stage, rl+ is shifted.

命令開始信号５−３の状態をチェ、７りする。今度は”
１”なので、次の第３ステージは＋ａ、ｒ　３の命令の
起動はしない。又１組みゲート５−５の制御で、シフト
レジスタ５−１内で単純にデータの左シフトを行なう。Check the state of the command start signal 5-3. Next time"
1'', the next third stage does not activate the +a, r3 instructions. Also, under the control of the first set of gates 5-5, data is simply shifted to the left in the shift register 5-1.

以下同様に、各ステージの期間で命令開始１ｄ号５−：
うの状態をチェックし、上に記した処人里をくりかえす
。Similarly, the instruction starts in the period of each stage 1d No. 5-:
Check the condition of the fish and repeat the treatment described above.

以上説明した通り、促米のパイプライン制御方式は凌雑
な制御アルゴリズムに基づいた特殊な制御回ｌ！！６全
使用している。As explained above, Zakumai's pipeline control method is a special control circuit based on a complicated control algorithm. ! 6 are all in use.

ＣＰＴＪが持つナベての命令のステージ毎の各機能ユニ
、Ｖトの処理手順を、すべて同一としてすべての命令が
同一の動作Ｉｎ持つため、レジスタ間転送といった簡単
な命令からメモリ間のｅｉ、算の様な葭雑な命令を持つ
データ処理裟１６．へは応用が難しいという第１ｑ）大
きな欠点がある。The processing procedures for each functional unit and V for each stage of all the instructions in the CPTJ are all the same, and all instructions have the same operation. 16. Data processing equipment with complicated instructions such as There is a major drawback in 1q) that it is difficult to apply.

又、谷ステージの処理時間が一律に足められ六方式のた
め％　１ステージの処理時間を伸ばして、１つり機能ユ
ニット内での処理量を旨めるｊ」１が容易でなく、パイ
プラインの融通性に欠けるという第２の欠点がある。In addition, since the processing time of the valley stage is uniformly added to the six methods, it is not easy to extend the processing time of one stage and increase the processing amount within one functional unit, and the pipeline The second drawback is the lack of flexibility.

特に、バスとのインタフェース部ではメモリのアクセス
時間が長い場合には、１ステ一ジ処理時間内にメモリの
参照が完了せず、このためこの部分だけを特別にパイプ
ライン処理からはずした。す。In particular, if the memory access time is long in the interface section with the bus, the memory reference will not be completed within the processing time of one stage, so this section was specifically excluded from pipeline processing. vinegar.

又は、使用するメモリ全高速メモリに限るといった様々
な制約がつくという第３の欠点がある。Alternatively, there is a third drawback that there are various restrictions such as limiting the memory used to all high-speed memories.

更にこのパイプライン制御方式を、マイクロコンピュー
タに応用すると、パイプライン制御用回路が必要なため
１回路を複雑するはかりでなくチップサイズを大きくす
るという第４の欠点がある。Furthermore, when this pipeline control method is applied to a microcomputer, there is a fourth drawback that since a pipeline control circuit is required, one circuit is not complicated and the chip size is increased.

〔発明の目的〕[Purpose of the invention]

不発明の目的は、上記の諸欠点を取り除き、マイクロコ
ンピュータに最も適したパイプライン制御方式會提供す
る事にある。The object of the invention is to eliminate the above-mentioned drawbacks and provide a pipeline control system most suitable for microcomputers.

〔発明Ｃυ構成〕[Invention Cυ configuration]

本発明のパイプライン制御方式は、（層数の命令を並列
に実行するデータ処理装置のパイプライン制御方式にお
いてｂ　ｆ：ｌｉＪ記テーデー理装置が、命令実行手順
に従って分けられた複数の相異なる基本処理全専門［％
行する複数の機ｈ（シュニットと、実行中の命令の次の
命令の実行開始タイミングをすべての命令にわたって制
御するタイミング制御手段と、前記機能ユニット毎にそ
れぞれｔｆｉｌ記基本処理の完了状態を保持する保持手
段と、該保持手段の保持状態に対応して前記機能ユニッ
トに前記基本処理の開始悟号葡送出する処理開始制御手
段と’ｚ　（Ｉｉｔｉえ、前記保持手段が前記基本処理
の完ｒ状態全１呆持することに同期して前記処理開始制
御手段により前記機能ユニットのすべてが前Ｆｉ＋、命
令実行手順に従って次の基本処理ケ同時に開始すると共
に前記タイミング制御手段によりｊＪｒ定の次の命令の
実行全起動させることがらｔｔ成される・〔実施例の説
明〕以下−１不発明の実施例について図面を参Ｉｔｓ（して
ｄ発明する。The pipeline control method of the present invention is a pipeline control method for a data processing device that executes a number of layers of instructions in parallel. All processing specialties [%
a timing control means for controlling the execution start timing of the next instruction of the instruction being executed across all instructions; a holding means; and a processing start control means for sending a start signal for the basic processing to the functional unit in accordance with the holding state of the holding means. In synchronization with the pause for all 1, the processing start control means causes all of the functional units to simultaneously start the next basic processing according to the instruction execution procedure, and the timing control means starts the next basic processing at the same time. [Explanation of Embodiments] Below, reference is made to the drawings for embodiments of -1 non-invention.

第７図は本発明の一実施例全適用し／ζＵ　Ｐ　Ｕのブ
ロック図である。FIG. 7 is a block diagram of a fully applied /ζU P U according to an embodiment of the present invention.

ＣＰＵが、命令実行手順に従って分けられた３りの相異
なる基本処理を専門に実行する３個の機能ユニット、パ
スインタフェース部７−１．オヘラεノド計算部７−２．実部７−３と、実行中の命令の次の
命令の実行開始タイミングをすべての命令にわたって制
御するタイミング制御手段としてのタイミング記憶部７
−８と、前記機能ユニット毎にそれぞれ前記基本処理の
完了状態を保持する保持手段としての第１．第２．第３
のステージ終了フリッグフロ、プ（以下、ステージ終了
Ｆ／Ｆという、）７−４．７−５．７−６と、この保持
手段の保持状態に対応してｍｌ記機能ユニットに前記基
本処理の開始信号を送出する処理開始制御手段としての
ＡＮＤゲート７−７と奮備え、第１．第２、第３のステ
ージ終了Ｆ／Ｆ７−４．７−５゜７−６が前記基本処理
の完了状態を保持することに同期してＡＮＩ）ゲー）　
７−７　ｙｊｈら送出される命令開始信号７−７′によ
りバスインタフェース部７−１．オペランド言１算部７
−２．実行部７−３のすべてがｊ４ｉＪ　記命令実行手
順に従って次の基本処理を同時に開始すると共にタイミ
ング記憶部７−８からのタイミング信号７−８′より１
ノ１足の次の命令の実行全起動させることから４１４成
される。The CPU has three functional units, a path interface section 7-1, which specialize in executing three different basic processes divided according to instruction execution procedures. Ohera ε throat calculation section 7-2. a real part 7-3, and a timing storage unit 7 as a timing control means for controlling the execution start timing of the next instruction after the instruction being executed over all instructions.
-8, and a first. Second. Third
The stage end F/F (hereinafter referred to as stage end F/F) 7-4.7-5.7-6 corresponds to the holding state of this holding means, and the ml function unit starts the basic processing. AND gate 7-7 as a processing start control means for sending a signal, and the first. 2nd and 3rd stage end F/F7-4.7-5゜7-6 maintains the completion state of the basic processing (ANI) game)
7-7 yjh and the like, the bus interface section 7-1. Operand word 1 arithmetic section 7
-2. All of the execution units 7-3 simultaneously start the next basic processing according to the instruction execution procedure j4iJ, and the timing signal 7-8' from the timing storage unit 7-8
Step 414 is performed by starting the execution of the next instruction for one step.

なお、タイミング記憶部７−８は本実施例ではバスイン
タフェース部７−１内に設けであるが。Note that the timing storage section 7-8 is provided within the bus interface section 7-1 in this embodiment.

これは機能ユニット外Ｋまとめて設けても良く。This may be provided outside the functional unit.

又必要に応じ各機能ユニット内に分散しＣ設けられる。Further, C is provided in a distributed manner within each functional unit as necessary.

本実施例は第１図にボした従来例のＣＰ　ＩＪに適用し
たものであり、バスインタフェース部７−１゜オペラフ
ドアドレス計３ｊｔ部７−２．突汀部７−３の３個の機
能ユニットは、絹１図の従来例のブロック図に示したＣ
ＰＵの各（７，１能ユニットと同一の処理を行なう。The present embodiment is applied to the conventional CP IJ shown in FIG. The three functional units of the ridge part 7-3 are C shown in the block diagram of the conventional example in Figure 1.
Each of the PUs performs the same processing as the 7 and 1 function units.

第１のステージ終了に’／Ｆ　７−４は、バスインタフ
ェース部７−１の１ステ一ジ分の処理が完了するとセッ
トさＪしる。第２のステージ終了Ｆ／Ｆ７−５は、オペ
ランドアドレス計３１−　部７−２　（Ｑ　１ステ一ジ
分り処理が完了するとセットされる。第３のステージ終
了Ｆ７１ｒ７−６は実１１部７−３の１ステ一ジ分り処
理が完了するとセットされる。At the end of the first stage, /F 7-4 is set when one stage of processing by the bus interface section 7-1 is completed. The second stage end F/F7-5 is set when the operand address total 31- section 7-2 (Q) is completed. The third stage end F/F7-6 is set when the processing for one stage is completed. It is set when one stage of processing in step 3 is completed.

ステージ開始１計号７−７′はＡＮＩＪゲート７−７に
より得られた、第１．第２．第３のステージ終了Ｆ　／
　Ｆ　７−４　、７−５　、　７−６の各出力の論理積
で、この信号が入力すると、バスインタフェース部７−
１．オペランドアドレス計算部７−２゜実行部７−３の
各機能ユニットは、第１．第２゜第３のステージ終了Ｆ
／Ｆ７−４．７−５．７−６全リセツトし、同時に次の
１ステ一ジ分の処理を開始する。タイミング記憶部７−
８は、あらかじめ次の信号を衝突なしに開始できるよう
にセットされたタイミング信号が保持させられており。Stage start 1 number 7-7' is the 1st digit obtained by ANIJ gate 7-7. Second. 3rd stage end F/
When this signal is inputted by logical product of each output of F7-4, 7-5, and 7-6, the bus interface section 7-
1. Each functional unit of the operand address calculation section 7-2 and execution section 7-3 has a first . 2nd゜3rd stage end F
/F7-4.7-5.7-6 Resets everything and simultaneously starts processing for the next one stage. Timing storage section 7-
8 holds a timing signal set in advance so that the next signal can be started without collision.

所定のタイミングでそのタイミング信号を送出する。The timing signal is sent out at a predetermined timing.

第８図（ａ）の動作図は１本実施例に基づ（ＣＰＵのメ
モリ間の演ｑ、命令の動作図で、各機能ユニット毎の処
理動作は、第２図に示す従来例のものと’Ｉ’ｃｓ　の
第５ステージを除いて同一である。１１ｃ５の第５ステ
ージでは実行部７−３において、バスインタフェース部
７−２で読出されたデータと所定レジスタとの間での演
算のほかに１次の命令を起・１のさせる。The operation diagram in FIG. 8(a) is based on this embodiment (it is an operation diagram of operations and instructions between the CPU's memories, and the processing operation of each functional unit is based on the conventional example shown in FIG. 2). and 'I'cs are the same except for the fifth stage. In the fifth stage of 11c5, the execution unit 7-3 performs an operation between the data read by the bus interface unit 7-2 and a predetermined register. In addition, it causes a primary command to be activated/set to 1.

第８図（ｂ）は、同様に本実施例に基づ（ＣＰＵのレジ
スタ間の転送命令の動作図で　＋ｐＤ１　の最初のステ
ージのＪＷ間、バスインタフェース部７−１で命令コー
ドの読出しを行なうｌｌ’ｌＤ２　０２番目のステージ
の期間実行部７−３で命令のｊＩｌ￥読を行なう・ＴＤ
３　の３番目のステージの期間実行部７−３でデータの
転送全行なうほか１次の命令を起動させる。FIG. 8(b) is a diagram of the operation of a transfer instruction between registers of the CPU based on the present embodiment. Between the JWs of the first stage of +pD1, the bus interface section 7-1 reads out the instruction code. ll'lD2 02nd stage period execution unit 7-3 reads instruction TD
During the period of the third stage of 3, the execution unit 7-3 performs all data transfers and also activates the primary instruction.

次に、第９図に示す本実施例のタイミング図を参照して
、第８図（ａ）、　（ｂ）の動作図に示した２つの命令
が連続的に処理される場合の動作を説明する。Next, with reference to the timing diagram of this embodiment shown in FIG. 9, the operation when the two instructions shown in the operation diagrams of FIGS. 8(a) and 8(b) are successively processed will be explained. do.

ｔｃｌのタイミングで命令Ｃ（メ士り間の演算命令）が
起動され、同時にオペランドアドレスロ１算Ｍ１ｉ７−
２．実行部７−３も、Ｔｏｌの期間１ステ一ジ分の処理
全行なう。ｌ１ｌｃｌの終わりまでに各機能ユニットは
１ステ一ジ分の処理を終え、それぞれ。At the timing of tcl, instruction C (operation instruction between calculations) is activated, and at the same time, the operand address row 1 calculation M1i7-
2. The execution unit 7-3 also performs all the processing for one stage of the Tol period. By the end of l1lcl, each functional unit has completed one stage of processing, respectively.

第１．第２．第３（１）ステージ終了Ｆ／Ｐ７−４゜７
−５．７−６ｆ：セットする。これらの出力の論理積が
ＡＮＤゲート７−７によシステージ開始信号７−７′　
となり第１．第２．第３のステージ終了Ｆ／Ｆ７−４．
７−５．７−６をリセットすると同時に１次の１゛Ｃ２
に和尚する第２番目のステージの処理を、バスインタフ
ェース部７−１．オペランドアドレス計算部７−２．実
行部７−３に一斉に起動させる。1st. Second. 3rd (1) Stage End F/P7-4゜7
-5.7-6f: Set. The AND gate 7-7 generates the stage start signal 7-7'.
Next is the first one. Second. Third stage end F/F7-4.
7-5. At the same time as resetting 7-6, the primary 1゛C2
The second stage processing is performed by the bus interface unit 7-1. Operand address calculation unit 7-2. The execution units 7-3 are activated all at once.

以上の処理金続け　Ｉｌｌ　６５　の第５番目のステー
ジの起動に同期して、あらかじめ命令Ｃ（メモリ間の演
算命令）で定められた通りタイミング記憶部７−８から
読出されたタイミング信号７−８′により次の命令１〕
（レジスタ間の転送命令）の実行を開始する。同様に　
＋１＋　ｃ７　のＤ命令における第３番目のステージの
起動に同期してＤ命令（レジスタ間の転送命令）にあら
かじめ定められた通り、タイミング記憶部７−８から読
出されたタイミング信号７−８′によ勺次の命令Ｅの実
行を開始する。The above processing continues.In synchronization with the activation of the fifth stage of Ill 65, the timing signal 7-8 is read out from the timing storage unit 7-8 as determined in advance by instruction C (inter-memory operation instruction). ’ causes the next command 1]
(transfer instruction between registers) starts execution. similarly
+1+c7 In synchronization with the activation of the third stage in the D instruction, the timing signal 7-8' read out from the timing storage section 7-8 is set as predetermined in the D instruction (transfer instruction between registers). Execution of the next instruction E begins.

以上説明した通り、不発明の一実施例全適用したＣＰＵ
は１次の命令の起動タイミングは、衝突図金引くのでは
なく１％命令毎に次の命令を起動してもすべての命令に
対して衝突が発生しないタイミングで命令毎にあらかじ
め定められているので、パイプライン動作をスケシ−ル
するための。As explained above, a CPU to which all embodiments of the invention are applied
The activation timing of the first instruction is predetermined for each instruction at a timing that does not cause a collision for all instructions even if the next instruction is activated for every 1% instruction, rather than deducting the collision figure. So for scheduling pipeline operations.

従来必要とされた第５図に示す起動タイミング決定用の
バイグライン制御回路が不要とな、り制御回路を著しく
簡略化する事が可能である。The conventionally required big-line control circuit for determining start-up timing shown in FIG. 5 is no longer necessary, and the control circuit can be significantly simplified.

又、各機能ユニット間の同期関係が、従来の様に固定さ
れた特定の期間に一律に同期するものではなく、１ステ
一ジ分の処理が単位となって各機能ユニット間の１ステ
一ジ分の処理がすべて完了してから次のステージへ移る
。従って１機能ユニットの設刷では処理内容に融通性が
増し、特にバスインタフェース部も、メモリのアクセス
時間に関係なくメモリ参照が完了した時点で１ステ一ジ
分の処理の完了とする事ができ、パイプライン動作の中
にかなりの柔軟性が生じる。In addition, the synchronization relationship between each functional unit is not uniformly synchronized during a fixed specific period as in the past, but the processing for one step is a unit, and the synchronization relationship between each functional unit is synchronized in one step. After all processing is completed, move on to the next stage. Therefore, when printing a single functional unit, there is greater flexibility in the processing content, and especially for the bus interface section, the processing for one stage can be completed when the memory reference is completed, regardless of the memory access time. , allowing considerable flexibility in pipeline operation.

又、命令の動作図は、−律ではなくａ令毎に最適１ヒさ
れているので、汎用のデータ処理装置にも容易に応用す
る事ができる。Furthermore, since the instruction operation diagram is optimized for each a instruction rather than the - rule, it can be easily applied to a general-purpose data processing device.

〔発明の効果〕〔Effect of the invention〕

以上詳細に説明した通り、不発明のパイプライン制御方
式は、上記の構成をとることにより、従来のようにすべ
ての命令が同一の動作図でなく命令毎に最も適した動作
図金持ち、各（歳能ユニットの同期関係は従来のように
一律に固定された周期ではなく１ステージ内の１処理の
完了に基づいており、又、命令の開始タイミングは従来
のように衝突図によるのでなく各命令毎に次の命令を起
動してもすべての命令に対して衝突が発生しないタイミ
ング金あらかじめ規定しているので、従来のパイプライ
ン制御方式に比較して、）蔦−ドウエア量の増力口も最
少限に抑えられ、かつ設計における融通性も備えた高速
なパイプライン制御方式を得ることができるという効果
會有している。As explained in detail above, the inventive pipeline control system has the above configuration, so that all instructions do not have the same operation diagram as in the past, but the most suitable operation diagram for each instruction ( The synchronization relationship of the processing units is based on the completion of one process in one stage, rather than a uniformly fixed period as in the past, and the start timing of the command is not based on a collision diagram as in the past, but is based on each command. Since the timing at which collisions do not occur for all instructions is predefined even if the next instruction is started every time, the amount of increase in the amount of air is also minimized compared to the conventional pipeline control method. This has the advantage that it is possible to obtain a high-speed pipeline control method that is suppressed to a minimum and has flexibility in design.

特に、高性能化が著しいマイクロコンビーータへ応用し
た場合には、ノ＼−ドウエア量の増加が最少限であり、
逐次処理Ｖこ比較すると、大幅な処理速度の改善が期待
でき、芙用効果を最も顕著に発揮できる。In particular, when applied to microcombinators with significantly improved performance, the increase in the amount of node hardware is minimal.
Comparing the sequential processing V, a significant improvement in processing speed can be expected, and the effect of fertilization can be exhibited most noticeably.

【図面の簡単な説明】[Brief explanation of drawings]

第１図は、従来のパイプライン制御方式ｑノー例に用い
たＣＰＵのブロック図、第２図は第１図に示したＣＰＵ
の命令の動作図、第３図（ａｌ〜（ｆ）は。第２図に示した動作図で次の命令を各ステージ毎に起動
させた時の動作図、第４図は第２図に示した動作図によ
るｅＰＵ（１）衝突図、第５図は第２図に示した動作図
によるＣ　Ｐ　［Ｊのパイプライン制御回路のブロック
図、第６図は第５図のパイプライン制御回路のデータの
変化全示す動作図、第７図は本発明の一実施例に用いた
Ｃ　Ｐ　ＩＪのブロック図、第ン）図（ａ）、　（ｂｌ
は第７図に示すＣＰＵの命令の動作図、第９図は第７図
に示したＣ　１）　Ｕのタイムチャートである。１−１・・・・・・バスインタフェースｆｆ１ｉ、１−
２・・・・・・オペランドアドレス計算部、１−３・・
・・・実行部。５−１・・・・・・シフトレジスタ、５−１−０．５−
１−１，５−１−２．５−１−３．５−１−４．５−１
−５・・・・・・レジスタ、５−２・・・・・・シフト
クロック、５−３・・・・・・命令開始信号、５−４・
・・・・・初期衝突データ発生回路、５−５・・・・・
・組みゲート＆　７−１・・・・・・バスインタフェー
ス＊Ｌ７−２・・・・・オペランド計算部、７−３・・
・・・・実行部、７−４．７−５゜７−６・・・・・・
第１．第２．第３のステージ終了フリップフロップ、７
−７・・・・・・ＡＮＤゲート、７−７’・・・・・・
命令開始信号、７−８・・・・・・タイミング信号記憶
部、７−８’・・・・・・タイミング信号＊　ＴＡ　１
−ＴＡ　ｓ。ＴＢ１ゞＴａ　１１．　ＴＣ１″ＴＣｓ、　ＴＤ　ｓゝ
ＴＤ３°°°°°°ステージ＆　ｔＡｌ〜ｔＡ６．　ｔ
ＣＩＡ＋ｔＣ８１１”タイミング・第１図第２図第４図（’ｌ）　ｈｓ（ｅ）　ｔＡｇ（↑）榮３図第５図第６図（イ）７図（Ｉ））第８図Figure 1 is a block diagram of the CPU used in the conventional pipeline control method q-no example, and Figure 2 is the CPU shown in Figure 1.
The operation diagram of the instruction shown in Figure 3 (al to (f)) is the operation diagram when the next instruction is started at each stage in the operation diagram shown in Figure 2, and Figure 4 is the same as Figure 2. Figure 5 is a block diagram of the pipeline control circuit of C P [J according to the operation diagram shown in Figure 2. Figure 6 is the pipeline control circuit of Figure 5. FIG. 7 is a block diagram of the CPIJ used in one embodiment of the present invention.
is an operation diagram of the CPU instructions shown in FIG. 7, and FIG. 9 is a time chart of C1) U shown in FIG. 7. 1-1...Bus interface ff1i, 1-
2... Operand address calculation section, 1-3...
...Execution department. 5-1...Shift register, 5-1-0.5-
1-1, 5-1-2.5-1-3.5-1-4.5-1
-5...Register, 5-2...Shift clock, 5-3...Instruction start signal, 5-4...
...Initial collision data generation circuit, 5-5...
・Assembled gate & 7-1...Bus interface *L7-2...Operand calculation section, 7-3...
...Execution part, 7-4.7-5゜7-6...
1st. Second. Third stage end flip-flop, 7
-7...AND gate, 7-7'...
Command start signal, 7-8...Timing signal storage section, 7-8'...Timing signal* TA 1
-TAs. TB1ゞTa 11. TC1″TCs, TD sゝTD3°°°°° Stage & tAl~tA6.t
CIA+tC811'' timing・Figure 1Figure 2Figure 4 ('l) hs (e) tAg (↑) Figure 3Figure 5Figure 6(A)Figure 7(I)) Figure 8

Claims

Translated fromJapanese

【特許請求の範囲】複数の命令全並列に実行するデータ処理装置のパイプラ
イン制御方式において、前記データ処理装置が、命令実
行手順に従って分けられた複数の相異なる基本処理を専
門に実行する複数の機能ユニットと、実行中の命令の次
の命令の実行開始タイミングをすべての命令にわたって
ｆｌｊｌＪ　ｌｌｌするタイミング制御手段と、前記機
能ユニット毎にそれぞれ前記基本処理の完了状態を保持
する保持手段と。該保持手段の保持状態に対応して前記機能ユニットに前
記基本処理の開始信号を送出する処理開始制御、＋１１
＋段と全備え、前記保持手段が前記基本処理の完了状ｒ
ｂＨｋ保持することに同期して前記処理開始制御手段に
より前記機能ユニ、トのすべてが前記命令実行手順に従
って次の基本処理を同時に開始すると共に前記タイミン
グ制御手段により所定の次の命令の実行を起動させるこ
とを特許とするパイプライン制御方式。[Scope of Claims] In a pipeline control method for a data processing device that executes a plurality of instructions in full parallel, the data processing device executes a plurality of specialized processes that execute a plurality of different basic processes divided according to instruction execution procedures. a functional unit, a timing control means for fljlJ llll the execution start timing of an instruction next to the instruction being executed across all instructions, and a holding means for holding a completion state of the basic processing for each of the functional units. +11 processing start control for sending a start signal for the basic processing to the functional unit in response to the holding state of the holding means;
+ stage is fully equipped, and the holding means is fully equipped with the completion status r of the basic processing.
In synchronization with holding bHk, the processing start control means causes all of the functional units to simultaneously start the next basic processing according to the instruction execution procedure, and the timing control means starts execution of a predetermined next instruction. A patented pipeline control method that allows