JPH06162067A

JPH06162067A - Device and method for controlling vector instruction

Info

Publication number: JPH06162067A
Application number: JP31317892A
Authority: JP
Inventors: Fujio Wakui; 富士雄涌井
Original assignee: Hitachi Ltd; Hitachi Computer Engineering Co Ltd
Current assignee: Hitachi Ltd; Hitachi Computer Engineering Co Ltd
Priority date: 1992-11-24
Filing date: 1992-11-24
Publication date: 1994-06-10

Abstract

PURPOSE:To execute a vector operation at a high speed by executing only a valid operation without executing an invalid operation, even if an arithmetic inhibiting bit in a vector mask register exists discontinuously. CONSTITUTION:The device consists of an instruction precedent stage for storing only an element and an element number of vector data in which an element value of a vector mask bit instructs to execute an operation in a buffer device 500, in operand vector data designated by an instruction, and an instructions execution stage for reading out the vector data and the element number from the buffer device 500, and writing a result of operation in vector registers 105, 106 in accordance with the element number. Also, in a period in which an instruction register group before by one is executing an instruction execution stage, and instruction precedent control stage of the next instruction register group is executed.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、ベクトル演算を行うデ
−タ処理装置におけるベクトル命令制御方法およびその
制御装置に関し、特にマスク付きの演算で、ベクトルマ
スク中の演算許可要素を検出して、必要なベクトル要素
のみの演算を行い、先行制御により無効な演算を排除し
て、処理性能を向上することが可能なベクトル命令制御
装置およびその制御方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a vector instruction control method in a data processing apparatus for performing vector operations and a control apparatus therefor, and particularly, in an operation with a mask, an operation permission element in a vector mask is detected, The present invention relates to a vector instruction control device and a control method thereof that can perform processing only on necessary vector elements and eliminate invalid operations by advance control to improve processing performance.

【０００２】[0002]

【従来の技術】従来より、ス−パ−コンピュ−タは、演
算パイプライン方式により高速性を得ている。パイプラ
イン処理を効率よく行うためには、パイプが常時詰って
いる状態に保持されることが必要であり、そのために
は、（イ）プログラムが長いデ−タ系列に対して同一の
演算を繰り返すベクトル計算（ＤＯル−プ）を含むこ
と、（ロ）パイプに対する十分なデ−タ供給能力を有し
ていること、等の条件を満足する必要がある。このよう
に、ベクトル計算が大規模数値計算における計算時間の
主要部分を占めており、ベクトル計算に強いという特徴
を有していることから、パイプライン形科学技術用計算
機をベクトル計算機と呼んでいる。ベルトル演算を行う
場合、ベクトルレジスタに格納されているベクトルエレ
メントを順次読み出してベクトル演算器で演算し、その
結果を結果ベクトルレジスタに格納する。しかし、ベク
トルレジスタに格納されているベクトルエレメントのう
ち、不要なエレメントに対しては、ベクトルマスクレジ
スタに格納されるベクトルマスクにより対応するエレメ
ントを除去するように、ベクトル演算マスク制御を行っ
ている。しかし、従来のベクトル演算マスク制御では、
ベクトルマスクにより演算が抑止されている場合でも、
その演算は実行され、その結果の書き込みのみを抑止す
るように実行されていた。2. Description of the Related Art Conventionally, supercomputers have achieved high speed by an arithmetic pipeline system. In order to perform the pipeline processing efficiently, it is necessary to keep the pipe in a clogged state at all times. For that purpose, (a) the program repeats the same operation for a long data series. It is necessary to satisfy the conditions such as including vector calculation (DO loop), having a sufficient data supply capacity for the (b) pipe, and the like. In this way, since vector calculation occupies a major part of the calculation time in large-scale numerical calculation and has the characteristic of being strong in vector calculation, pipeline type scientific and technological computers are called vector computers. . In the case of performing the Bertre operation, the vector elements stored in the vector register are sequentially read out, the vector operation unit performs the operation, and the result is stored in the result vector register. However, of the vector elements stored in the vector register, vector operation mask control is performed so that unnecessary elements are removed by the vector mask stored in the vector mask register. However, in the conventional vector operation mask control,
Even if the operation is suppressed by the vector mask,
The operation was executed, and it was executed so as to suppress only writing of the result.

【０００３】図２は、従来のベクトル演算マスク制御を
行う情報処理装置の構成図である。図２において、１０
１は読み出しアドレスを格納するレジスタ、１０３はベ
クトルレジスタおよびベクトルマスクレジスタのアドレ
スをインクリメントするインクリメンタ、１０５，１０
６，１１５はそれぞれベクトルレジスタ、１０７はイン
クリメントされた現在のアドレスを格納するアドレスレ
ジスタ、１１０〜１１２，２１６〜２１８はアドレスお
よびマスクエレメントを演算時間に一致させるための遅
延ラッチ、１１３はベクトル演算器、１１４は最終アド
レスを検出して演算の終了を検知する終了検出器、２０
９はベクトルマスクレジスタである。このベクトル計算
機では、ベクトルレジスタ１０５，１０６内に保持され
ているベクトルエレメントを、アドレスレジスタ１０７
内のアドレスに従って順次読み出し、ベクトル演算器１
１３で演算した結果をベクトルレジスタ１１５に格納す
る。ここでは、ベクトルレジスタ１０５，１０６，１１
５の間で、Ａ（ｉ）＋Ｂ（ｉ）＝Ｃ（ｉ）のベクトル演
算を実行しているものとする。FIG. 2 is a block diagram of a conventional information processing apparatus for performing vector operation mask control. In FIG. 2, 10
1 is a register for storing the read address, 103 is an incrementer for incrementing the addresses of the vector register and the vector mask register, 105, 10
Reference numerals 6 and 115 are vector registers, 107 is an address register for storing the incremented current address, 110 to 112 and 216 to 218 are delay latches for matching the address and mask elements with the operation time, and 113 is a vector calculator. , 114 is an end detector for detecting the end address to detect the end of the operation, 20
Reference numeral 9 is a vector mask register. In this vector computer, the vector elements held in the vector registers 105 and 106 are transferred to the address register 107.
Sequential reading according to the address in the vector arithmetic unit 1
The result calculated in 13 is stored in the vector register 115. Here, the vector registers 105, 106, 11
It is assumed that the vector operation of A (i) + B (i) = C (i) is being executed during 5 times.

【０００４】さらに、詳細な動作を述べる。演算起動時
には、ベクトル読み出し用のアドレスレジスタ１０７は
‘０’にイニシャライズされることにより、ベクトルレ
ジスタ１０５，１０６からエレメントの０番目のデ−タ
Ａ（０），Ｂ（０）が読み出される。同時に、ベクトル
マスクレジスタ２０９からも０番目のマスクビットが読
み出される。読み出されたデ−タＡ（０），Ｂ（０）は
ベクトル演算器１１３に供給され、ここで加算が行われ
た後、その演算結果のデ−タＣ（０）がベクトルレジス
タ１１５に書き込まれる。書き込みのアドレスは、アド
レスレジスタ１０７に格納されているアドレスが遅延ラ
ッチ１１０，１１１，１１２を介してベクトルレジスタ
１１５に入力される。すなわち、アドレスレジスタ１０
７に格納されている現在アドレスは、ベクトルレジスタ
１０５，１０６の読み出しアドレスになるとともに、ベ
クトルマスクレジスタ２０９の読み出しアドレスにもな
り、さらに結果ベクトルレジスタ１１５への書き込みア
ドレスにもなる。これらの動作と並行して、アドレスレ
ジスタ１０７内のアドレスは、レジスタ１０１およびイ
ンクリメンタ１０３を介してカウントアップされ、アド
レスレジスタ１０７に再セットされる。このインクリメ
ントされたアドレスにより、次にベクトルレジスタ１０
５，１０６からエレメントの１番目のデ−タＡ（１），
Ｂ（１）が読み出されて、ベクトル演算器１１３に供給
されると同時に、ベクトルマスクレジスタ２０９からも
マスクビットが読み出された後、その演算結果Ｃ（１）
がベクトルレジスタ１１５に書き込まれる。以後、同じ
ような動作が繰り返し実行され、順次、ベクトルレジス
タ１０５，１０６内のエレメントが処理されていき、終
了検出器１１４で最後のエレメントの処理が検出される
と、バス１５０を介して図示省略されたベクトル制御論
理部に演算の終了が通知され、ベクトル演算が終了す
る。Further detailed operation will be described. When the operation is started, the vector read address register 107 is initialized to "0", so that the 0th data A (0), B (0) of the element is read from the vector registers 105, 106. At the same time, the 0th mask bit is also read from the vector mask register 209. The read data A (0) and B (0) are supplied to the vector calculator 113, where after addition is performed, the data C (0) of the calculation result is stored in the vector register 115. Written. As the write address, the address stored in the address register 107 is input to the vector register 115 via the delay latches 110, 111, 112. That is, the address register 10
The current address stored in 7 becomes the read address of the vector registers 105 and 106, the read address of the vector mask register 209, and the write address of the result vector register 115. In parallel with these operations, the address in the address register 107 is counted up via the register 101 and the incrementer 103 and reset in the address register 107. With this incremented address, the vector register 10
5, 106 to the first data A (1) of the element,
B (1) is read and supplied to the vector calculator 113, and at the same time, the mask bit is also read from the vector mask register 209, and then the calculation result C (1)
Are written into the vector register 115. After that, the same operation is repeatedly executed, the elements in the vector registers 105 and 106 are sequentially processed, and when the end detector 114 detects the processing of the last element, the illustration is omitted via the bus 150. The calculated vector control logic unit is notified of the end of the operation, and the vector operation ends.

【０００５】ベクトルマスクレジスタ２０９内のマスク
ビットは、ベクトルレジスタ１０５，１０６内に格納さ
れているベクトルエレメントに１対１に対応しており、
それらのビットの値が‘１’のときには対応するエレメ
ントの演算の実行を許可し、‘０’のときには対応する
エレメントの演算の実行を抑止する。つまり、マスクビ
ット‘１’は演算実行ビットを意味し、‘０’は演算抑
止ビットを意味する。ベクトルマスクレジスタ２０９内
のマスクビットは、アドレスレジスタ１０７によりベク
トルレジスタ１０５，１０６内のエレメントが読み出さ
れると同時に読み出され、遅延ラッチ２１６〜２１８お
よびバス１５１を介してベクトルレジスタ１１５に作用
する。また、これらの読み出しに用いられたアドレスレ
ジスタ１０７内のアドレスも、遅延ラッチ１１０〜１１
２を介してベクトルレジスタ１１５に供給される。遅延
ラッチ１１０〜１１２および２１６〜２１８は、その遅
延時間がベクトル演算器１１３の演算遅延時間と一致す
るように設定されている。従って、ベクトル演算器１１
３から演算結果がベクトルレジスタ１１５に供給された
とき、この演算のためのベクトルエレメントの読み出し
に用いられたアドレスと、このエレメントに対応して読
み出されたマスクビットとが、同時にベクトルレジスタ
１１５に供給されることになる。その結果、読み出し用
のアドレスレジスタ１０７により、ベクトルレジスタ１
０５，１０６内のエレメントと同時に読み出されたベク
トルマスクレジスタ２０９のマスクビットは、そのマス
クビットの値が‘０’のとき、バス１５１を介してその
演算結果のベクトルレジスタ１１５への書き込みを禁止
し、そのマスクビットの値が‘１’のとき、その演算結
果のベクトルレジスタ１１５への書き込みを許可する。The mask bits in the vector mask register 209 have a one-to-one correspondence with the vector elements stored in the vector registers 105 and 106.
When the value of those bits is “1”, the execution of the operation of the corresponding element is permitted, and when the value of these bits is “0”, the execution of the operation of the corresponding element is suppressed. That is, the mask bit "1" means an operation execution bit, and "0" means an operation inhibition bit. The mask bit in the vector mask register 209 is read at the same time when the elements in the vector registers 105 and 106 are read by the address register 107, and acts on the vector register 115 via the delay latches 216 to 218 and the bus 151. Further, the addresses in the address register 107 used for reading these are also the delay latches 110 to 11
2 to the vector register 115. The delay latches 110 to 112 and 216 to 218 are set so that the delay time thereof matches the operation delay time of the vector calculator 113. Therefore, the vector calculator 11
When the calculation result is supplied from 3 to the vector register 115, the address used for reading the vector element for this calculation and the mask bit read corresponding to this element are simultaneously stored in the vector register 115. Will be supplied. As a result, the vector register 1 is read by the read address register 107.
The mask bit of the vector mask register 209, which is read at the same time as the elements in 05 and 106, prohibits the writing of the operation result to the vector register 115 via the bus 151 when the value of the mask bit is "0". Then, when the value of the mask bit is “1”, writing of the operation result to the vector register 115 is permitted.

【０００６】[0006]

【発明が解決しようとする課題】前述の従来のマスク制
御方法では、ベクトルマスクビットが‘０’であって
も、演算が抑止されているエレメントに対して演算実行
ステ−ジが走行し、演算時間が浪費されてしまう。この
ような演算時間の浪費をなくすため、従来、例えば特開
昭５８−２２４４６号公報に記載されたベクトル演算マ
スク制御方法が提案されている。この制御方法では、ベ
クトルマスク内の演算抑止ビット、つまりビットの値が
‘０’のビットを検出すると、一定個数の‘０’が連続
して存在する場合にのみ、その間の演算を飛び越すよう
に制御する。すなわち、例えば５個の‘０’が検出され
ると、アドレスインクリメンタを５個進ませて、５個の
ベクトル演算をスライドさせ、６個目の演算から再開す
る。これにより、演算時間の浪費をなくすことができ
る。このように、従来のマスク制御方法では、いずれの
場合にも、ベクトルマスクビットが‘０’のときの演算
処理自体を抑止することができず、その結果、演算時間
を浪費してしまうという問題があった。また、ベクトル
マスクビットが一定個数だけ‘０’である場合にその演
算を抑止する方法でも、ベクトルマスク中に‘０’が不
連続に存在する場合については配慮されていない。従っ
て、不連続に‘０’が数多く存在する場合については、
無駄な演算が実行されてしまうため、演算時間が浪費さ
れるという問題がある。本発明の目的は、このような従
来の課題を解決し、ベクトルマスクレジスタ内の演算抑
止ビットが不連続に存在する場合でも、無効な演算を行
うことなく、有効な演算のみを実行して、ベクトル演算
を高速に処理することが可能なベクトル演算マスク制御
装置および制御方法を提供することにある。In the conventional mask control method described above, even if the vector mask bit is "0", the operation execution stage runs for the element in which the operation is inhibited, and the operation is executed. Time is wasted. In order to eliminate such waste of calculation time, a vector calculation mask control method disclosed in, for example, Japanese Patent Application Laid-Open No. 58-22446 has been proposed. In this control method, when an operation suppression bit in the vector mask, that is, a bit whose value is' 0 ', is detected, only when a certain number of'0's exist continuously, the operation between them is skipped. Control. That is, for example, when five "0" s are detected, the address incrementer is advanced by five, five vector operations are slid, and the sixth operation is restarted. As a result, it is possible to eliminate waste of calculation time. As described above, the conventional mask control method cannot suppress the arithmetic processing itself when the vector mask bit is “0” in any case, and as a result, the arithmetic time is wasted. was there. Further, even in the method of suppressing the operation when a certain number of vector mask bits are "0", no consideration is given to the case where "0" are discontinuous in the vector mask. Therefore, when there are many discontinuous'0's,
There is a problem that the calculation time is wasted because unnecessary calculation is executed. An object of the present invention is to solve such a conventional problem and execute valid operations only without performing invalid operations even when operation suppression bits in a vector mask register are discontinuous. It is an object of the present invention to provide a vector operation mask control device and a control method capable of processing a vector operation at high speed.

【０００７】[0007]

【課題を解決するための手段】上記目的を達成するた
め、本発明のベクトル演算マスク制御装置は、（イ）複
数個の要素からなるベクトルデ−タおよびベクトルマス
クデ−タを順次読み出し、ベクトルマスクデ−タの要素
値に従ってベクトルデ−タの演算を行うベクトル命令制
御装置において、命令の実行ステ−ジに先立って、命令
で指定されるオペランドベクトルデ−タの要素と要素の
番号を格納するバッファ装置、およびベクトルマスクデ
−タの要素値が演算の実行を指示しているオペランドベ
クトルデ−タの要素と要素の番号のみをバッファ装置に
順次格納するためのアドレスを指定する第１のアドレス
レジスタと第１のアドレスインクリメンタを有すること
を特徴としている。また、（ロ）バッファ装置に格納さ
れたオペランドベクトルデ−タを読み出すための第２の
アドレスレジスタと第２のインクリメンタを設け、読み
出されたオペランドベクトルデ−タを用いて演算を実行
することも特徴としている。また、（ハ）バッファ装置
から読み出されたオペランドベクトルデ−タを用いて演
算を実行した結果を、バッファ装置から読み出された要
素番号に従って、ベクトルレジスタに書き込むことも特
徴としている。さらに、本発明のベクトル命令制御方法
は、（ニ）複数個の要素からなるベクトルデ−タおよび
ベクトルマスクデ−タを順次読み出し、該ベクトルマス
クデ−タの要素値に従って該ベクトルデ−タの演算を行
うベクトル命令制御方法において、前の命令に対し、バ
ッファ装置からオペランドベクトルデ−タと要素番号を
順次読み出し、演算を実行して、演算の結果を要素番号
に従って結果レジスタに格納する命令実行ステ−ジを実
行している期間に、次の命令に対し、命令で指定される
オペランドベクトルデ−タのうち、ベクトルマスクデ−
タの要素値が演算の実行を指示しているオペランドベク
トルデ−タの要素と要素の番号のみをバッファ装置に格
納する命令先行ステ−ジを並行して実行することを特徴
としている。In order to achieve the above object, a vector operation mask control device of the present invention is provided with (a) vector data consisting of a plurality of elements and vector mask data are sequentially read out to obtain a vector mask. In a vector instruction control device for calculating vector data according to the element value of the data, a buffer for storing the element of the operand vector data designated by the instruction and the element number prior to the execution step of the instruction. A device and a first address register for designating an address for sequentially storing only an element and an element number of an operand vector data in which an element value of vector mask data instructs execution of an operation And a first address incrementer. Further, (b) a second address register and a second incrementer for reading the operand vector data stored in the buffer device are provided, and an operation is executed using the read operand vector data. It is also characterized. Further, (c) the result of executing the operation using the operand vector data read from the buffer device is written in the vector register according to the element number read from the buffer device. Furthermore, the vector instruction control method of the present invention (d) sequentially reads vector data consisting of a plurality of elements and vector mask data, and calculates the vector data according to the element values of the vector mask data. In the vector instruction control method to be performed, an instruction execution step for sequentially reading the operand vector data and the element number from the buffer device for the previous instruction, executing the operation, and storing the result of the operation in the result register according to the element number. Of the operand vector data specified by the instruction, the vector mask data
It is characterized in that an instruction preceding stage for storing only the element and the element number of the operand vector data in which the element value of the data instructs the execution of the operation in parallel is executed.

【０００８】[0008]

【作用】本発明においては、ベクトル命令の実行ステ−
ジに先立って、その命令で指定されているオペランドベ
クトルデ−タのうち、ベクトルマスクデ−タの要素値が
演算の実行を指示しているベクトルデ−タの要素と要素
番号のみを、バッファ装置に格納する。すなわち、マス
ク付き演算で無効な演算を先行制御により排除する。次
に、命令の実行ステ−ジでは、そのバッファ装置から順
次ベクトルデ−タと要素番号とを読み出し、演算を実行
した後、その演算結果を要素番号に従ってベクトルレジ
スタに書き込む。これにより、有効なベクトル演算のみ
を実行させることができるので、ベクトル演算を高速に
処理することが可能となる。すなわち、図３に示すよう
に、１マシンサイクルをＴｃとし、そのうちベクトルマ
スクビットが‘０’に対応するエレメントの演算を行う
無効時間をＴαとすると、従来の方法における１つのベ
クトル命令（ｎ個のエレメント演算）を実行する演算時
間はｎＴｃであるのに対して、本発明における演算時間
は、無効時間Ｔαをｎ個なくすことができるので、ｎ
（Ｔｃ−Ｔα）となる。従って、複数個のベクトル命令
を実行する場合には、従来の方法に比べて演算時間を格
段に短縮することができる。In the present invention, the vector instruction execution step is executed.
Of the operand vector data specified by the instruction, only the element and the element number of the vector data for which the element value of the vector mask data instructs the execution of the operation prior to To store. That is, an operation invalid in the operation with the mask is excluded by the preceding control. Next, in the instruction execution stage, the vector data and the element number are sequentially read from the buffer device, the operation is executed, and the operation result is written in the vector register according to the element number. As a result, only valid vector operations can be executed, so that vector operations can be processed at high speed. That is, as shown in FIG. 3, assuming that one machine cycle is Tc, and an invalid time for performing an operation of an element whose vector mask bit is '0' is Tα, one vector instruction (n The calculation time for executing the element calculation) is nTc, while the calculation time in the present invention can eliminate n invalid times Tα.
(Tc-Tα). Therefore, when executing a plurality of vector instructions, the operation time can be significantly shortened as compared with the conventional method.

【０００９】[0009]

【実施例】以下、本発明の実施例を、図面により詳細に
説明する。図１は、本発明の一実施例を示すベクトル命
令制御装置の構成図である。図１において、１は解読指
示、演算指示、レジスタ読み出し、書き込み指示等の命
令を順次出力する命令制御部、２は命令レジスタ群を格
納するメモリ、３ａ，３ｂ，３ｃ，１０３ａ，３０３
ａ，４０３ａはそれぞれ読み出されたデ−タ、あるいは
アドレスを格納するレジスタ、１０３，３０３，４０３
はそれぞれアドレスをインクリメントするインクリメン
タ、５００は先行制御により有効デ−タのみを格納する
バッファ装置、５００ａ，５００ｂ，５００ｃはバッフ
ァ装置５００内の構成要素であって、それぞれベクトル
デ−タＡ、ベクトルデ−タＢ、およびベクトルデ−タＩ
Ｄを格納する領域である。また、１０５，１０６，１１
５はベクトルレジスタ、１１３はベクトル演算器、５，
６はそれぞれ命令の先行ステ−ジおよび命令の実行ステ
−ジの終了を検知する終了検出回路、２０９はベクトル
マスクビットを格納するベクトルマスクレジスタであ
る。ベクトルレジスタ１０５，１０６、ベクトルマスク
レジスタ２０９、およびインクリメンタ３０３より上方
が命令の先行ステ−ジ部であり、バッファ装置５００、
レジスタ４０３より下方が命令の実行ステ−ジ部であ
る。本発明で新たに設置された部分は、バッファ装置５
００と、インクリメンタ３０３，４０３と、レジスタ３
０３ａ，４０３ａと、終了検出回路５である。すなわ
ち、本発明においては、従来のベクトル演算ステ−ジに
加えて、命令の先行ステ−ジを設けたため、先行ステ−
ジで有効エレメントのみを格納するためのバッファ装置
５００と、そのバッファ装置５００にデ−タを書き込む
ためのインクリメンタ３０３、レジスタ３０３ａと、バ
ッファ装置５００からエレメントを読み出して演算器１
１３に供給し、かつ演算結果を書き込むためのインクリ
メンタ４０３、レジスタ４０３ａと、命令の先行ステ−
ジの終了を検出するための検出回路５とが、余分に必要
となる。Embodiments of the present invention will now be described in detail with reference to the drawings. FIG. 1 is a block diagram of a vector instruction control device showing an embodiment of the present invention. In FIG. 1, reference numeral 1 is an instruction control unit for sequentially outputting instructions such as a decoding instruction, a calculation instruction, a register read instruction, and a write instruction, and 2 is a memory for storing an instruction register group, 3a, 3b, 3c, 103a, 303.
a and 403a are registers for storing read data or addresses, and 103, 303, and 403, respectively.
Is an incrementer for incrementing the address, 500 is a buffer device for storing only valid data by the preceding control, and 500a, 500b, 500c are constituent elements in the buffer device 500, which are vector data A and vector data, respectively. Data B and vector data I
This is an area for storing D. Also, 105, 106, 11
5 is a vector register, 113 is a vector calculator, 5,
Reference numeral 6 is an end detection circuit for detecting the end of the instruction preceding stage and the instruction execution stage, respectively, and 209 is a vector mask register for storing vector mask bits. The portion above the vector registers 105 and 106, the vector mask register 209, and the incrementer 303 is the preceding stage portion of the instruction, and the buffer device 500,
Below the register 403 is an instruction execution stage section. The portion newly installed in the present invention is the buffer device 5
00, the incrementers 303 and 403, and the register 3
03a and 403a, and the end detection circuit 5. That is, in the present invention, since the instruction preceding stage is provided in addition to the conventional vector operation stage, the preceding stage is provided.
Buffer device 500 for storing only valid elements, an incrementer 303 for writing data to the buffer device 500, a register 303a, an element from the buffer device 500, and an arithmetic unit 1
13 and the incrementer 403 and the register 403a for writing the operation result and the instruction preceding step.
The detection circuit 5 for detecting the end of the error is additionally required.

【００１０】命令レジスタ群格納メモリ２には、命令
Ａ，Ｂ，Ｃが順にセットされており、それぞれＯＰ
（Ａ）（Ｂ）（Ｃ）は命令部、ＶＲ３はオペランド
（３）のベクトルレジスタ番号を格納するフィ−ルド
値、ＶＲ１は演算結果であるオペランド（１）のベクト
ルレジスタ番号を格納するフィ−ルド値、ＶＲ２はオペ
ランド（２）のベクトルレジスタ番号を格納するフィ−
ルド値である。先ず、命令の先行ステ−ジの動作を述べ
る。命令制御部１からの解読指示が信号線１ａを介して
出されると、順に命令が解読される。最初に命令Ａが解
読され、オペランド（２）のベクトルレジスタ番号であ
るフィ−ルドＶＲ２の値が信号線２ｂに読み出され、レ
ジスタ３ｂにセットされる。同じようにして、オペラン
ド（３）のベクトルレジスタ番号であるフィ−ルドＶＲ
３の値が信号線２ａに読み出され、レジスタ３ａにセッ
トされる。同じく、演算結果であるオペランド（１）の
ベクトルレジスタ番号であるＶＲ１の値が信号線２ｃに
読み出され、レジスタ３ｃにセットされる。これと並行
して、ベクトルレジスタ１０５，１０６の読み出し要素
アドレスを示すレジスタ１０３ａと、バッファ装置５０
０の書き込み要素アドレスを示すレジスタ３０３ａの
‘０’をイニシャライズする。これにより、ベクトルレ
ジスタ１０５，１０６およびベクトルマスクレジスタ２
０９から０番目の要素Ａ（０），Ｂ（０），Ｍ（０）が
読み出される。読み出された要素のＡ（０），Ｂ（０）
は、バッファ装置５００のベクトルデ−タＡ部５００
ａ、Ｂ部５００ｂに接続される。また、レジスタ１０３
ａとレジスタ３ｃの値は、バッファ装置５００のベクト
ルデ−タＩＤ部５００ｃに接続される。すなわち、この
ベクトルデ−タＩＤ部５００ｃに格納される内容は、Ｖ
Ｒ１とレジスタ１０３ａに格納されている有効な番号で
ある。Instructions A, B, and C are sequentially set in the instruction register group storage memory 2, each of which has an OP.
(A), (B) and (C) are instruction parts, VR3 is a field value for storing the vector register number of the operand (3), and VR1 is a field for storing the vector register number of the operand (1) which is the operation result. Field, VR2 is a field that stores the vector register number of operand (2).
Value. First, the operation of the preceding stage of the instruction will be described. When a decoding instruction from the instruction control unit 1 is issued via the signal line 1a, the instructions are sequentially decoded. First, the instruction A is decoded, the value of the field VR2 which is the vector register number of the operand (2) is read out to the signal line 2b and set in the register 3b. Similarly, the field VR which is the vector register number of the operand (3)
The value of 3 is read out to the signal line 2a and set in the register 3a. Similarly, the value of the vector register number VR1 of the operand (1) which is the operation result is read out to the signal line 2c and set in the register 3c. In parallel with this, the register 103 a indicating the read element address of the vector registers 105 and 106, and the buffer device 50.
Initialize '0' of the register 303a indicating the write element address of 0. As a result, the vector registers 105 and 106 and the vector mask register 2
The 0th elements A (0), B (0), and M (0) are read out from 09. A (0), B (0) of the read element
Is the vector data A section 500 of the buffer device 500.
a, B part 500b. In addition, the register 103
The values of a and the register 3c are connected to the vector data ID section 500c of the buffer device 500. That is, the contents stored in the vector data ID portion 500c is V
It is a valid number stored in R1 and the register 103a.

【００１１】ベクトルマスクレジスタ２０９内の要素Ｍ
（０）は、信号線２０９ａを介してバッファ装置５００
に接続される。ここでは、信号線２０９ａの値が‘１’
のときバッファ装置５００への格納指示として働く。ま
た、信号線２０９ａは、レジスタ３０３ａの更新許可信
号としても動作する。この後、ベクトルレジスタ１０
５，１０６の読み出し要素アドレスを示すレジスタ１０
３ａをインクリメンタ１０３により‘＋１’して、次要
素であるＶＲ３（１）、ＶＲ２（１）、ＶＭＲ（１）を
読み出す。以降、前述の動作を繰り返し実行することに
より、演算を実行すべきベクトルデ−タの要素のみをバ
ッファ装置５００内に順次格納する。命令制御部１は、
終了検出回路５が動作することにより、最後のベクトル
要素アドレスが検出されたことを知り、処理を終了させ
る。インクリメンタ３０３は、ベクトルマスクレジスタ
２０９中の要素値が演算の実行を指示しているベクトル
デ−タの要素と要素番号のみをバッファ装置５００に順
次格納するため、信号線２０９ａの値が‘１’のときの
みレジスタ３０９ａのアドレス値をバッファ装置５００
に書き込み、‘０’のときにはインクリメンタ３０３で
‘＋１’して次要素のアドレス値に順次、インクリメン
トする。以上、命令先行ステ−ジでは、命令の解読動作
から、演算を実行すべきベクトルデ−タの要素のみをバ
ッファ装置５００内に順次格納する処理を実行する。次
に、命令制御部１は、先行ステ−ジを終了した命令を順
次、命令実行ステ−ジに進める。命令実行ステ−ジで
は、先ず、バッファ装置５００の読み出し要素アドレス
を示すレジスタ４０３ａを‘０’にイニシャライズす
る。Element M in vector mask register 209
(0) is the buffer device 500 via the signal line 209a.
Connected to. Here, the value of the signal line 209a is "1".
At this time, it works as a storage instruction to the buffer device 500. The signal line 209a also operates as an update permission signal for the register 303a. After this, the vector register 10
Register 10 indicating the read element address of 5,106
3a is incremented by 1 by the incrementer 103, and the next elements VR3 (1), VR2 (1), and VMR (1) are read. After that, by repeatedly executing the above-mentioned operation, only the elements of the vector data for which the calculation is to be executed are sequentially stored in the buffer device 500. The instruction control unit 1
By the operation of the end detection circuit 5, it is known that the last vector element address is detected, and the processing is ended. Since the incrementer 303 sequentially stores only the element and the element number of the vector data whose element value in the vector mask register 209 instructs the execution of the operation in the buffer device 500, the value of the signal line 209a is "1". The address value of the register 309a only when
When the value is "0", the incrementer 303 increments it by "+1" to sequentially increment the address value of the next element. As described above, in the instruction precedent stage, the process of sequentially storing only the elements of the vector data to be operated on from the instruction decoding operation is executed in the buffer device 500. Next, the instruction control unit 1 sequentially advances the instructions for which the preceding stage has been completed to the instruction execution stage. In the instruction execution stage, first, the register 403a indicating the read element address of the buffer device 500 is initialized to "0".

【００１２】レジスタ４０３ａをイニシャライズするこ
とにより、バッファ装置５００内の構成要素であるベク
トルデ−タＡ部５００ａ、ベクトルデ−タＢ部５００ｂ
より演算のオペランドデ−タであるＡ（０）、Ｂ
（０）、ベクトルデ−タＩＤ部５００ｃより演算結果の
ベクトルレジスタへの格納アドレスＡＤＲ（０）が読み
出される。デ−タＡ（０）、デ−タＢ（０）は、ベクト
ル演算器１１３に供給される。演算器１１３による演算
結果は、ＡＤＲ（０）で示されるベクトルレジスタ１１
５に格納される。この後、レジスタ４０３ａはインクリ
メンタ４０３により‘＋１’される。以後、前述の動作
を繰り返し実行することにより、ベクトルマスクデ−タ
が演算を指示している要素のみの演算を実行することが
できる。命令制御部１は、終了検出回路６で最後のベク
トル要素アドレスを検出したとき、命令の実行処理を終
了させる。この命令Ａの命令実行ステ−ジと並行して、
命令Ｂの先行ステ−ジを開始することにより、命令実行
ステ−ジでは、順次、ベクトルマスクデ−タが演算を指
示している要素のみの演算を実行することが可能にな
る。By initializing the register 403a, the vector data A section 500a and the vector data B section 500b, which are constituent elements in the buffer device 500, can be obtained.
More operation operand data A (0), B
(0), the storage address ADR (0) of the calculation result in the vector register is read from the vector data ID section 500c. The data A (0) and the data B (0) are supplied to the vector calculator 113. The calculation result by the calculator 113 is the vector register 11 indicated by ADR (0).
Stored in 5. After that, the register 403a is incremented by "+1" by the incrementer 403. After that, by repeatedly executing the above-described operation, it is possible to execute the operation only for the element for which the vector mask data instructs the operation. When the end detection circuit 6 detects the last vector element address, the instruction control unit 1 ends the instruction execution process. In parallel with the instruction execution stage of this instruction A,
By starting the preceding stage of the instruction B, in the instruction execution stage, it becomes possible to successively execute only the elements for which the vector mask data instructs the operation.

【００１３】なお、メモリ２に格納される命令レジスタ
群としては、例えば東京都の住民台帳のデ−タベ−スを
そのままオペランドＶＲ２，ＶＲ３，ＶＲ１として格納
する。ここでは、氏名、性別、職業、世帯主か否か等を
ベクトルデ−タとし、演算部としては不等式、加算、減
算等にする。一方、ベクトルマスクレジスタ２０９に格
納されるマスクビットとしては、住民の中の女性のみの
デ−タベ−スを作成するために、先ずベクトルマスクビ
ットを作るための命令を発行して、元のデ−タベ−ス中
のデ−タ部分を女性に指定し、比較した結果をベクトル
マスクで比較して、一致した箇所に‘１’を格納するこ
とにより作成する。このようにして作成したベクトルマ
スクレジスタ２０９と、住民台帳のデ−タベ−スをベク
トル命令レジスタ群にすることにより、ベクトルマスク
レジスタ２０９のマスクビットが‘１’に対応するベク
トルデ−タのみを演算するので、高速に女性のみの住民
台帳のデ−タベ−スを作成することができる。As the instruction register group stored in the memory 2, for example, the database of the resident register in Tokyo is directly stored as the operands VR2, VR3 and VR1. Here, the name, sex, occupation, whether or not a householder is the vector data, and the arithmetic unit is an inequality, addition, subtraction, or the like. On the other hand, as a mask bit stored in the vector mask register 209, in order to create a database of only women among residents, an instruction for creating a vector mask bit is first issued and the original data is created. -It is created by designating the data portion in the database as a female, comparing the comparison results with a vector mask, and storing "1" in the coincident portion. By making the vector mask register 209 thus created and the data base of the resident register into a vector instruction register group, only the vector data in which the mask bit of the vector mask register 209 corresponds to "1" is calculated. Therefore, it is possible to quickly create a database of resident register for women only.

【００１４】[0014]

【発明の効果】以上説明したように、本発明によれば、
ベクトル演算を行うデ−タ処理装置において、ベクトル
マスクの値がどのように配列されていても、無効な演算
を行うことなく、有効な要素のみの演算を行うので、ベ
クトル演算を高速に実行することができる。As described above, according to the present invention,
In a data processing device that performs vector operations, no matter how the values of the vector mask are arranged, only valid elements are operated without performing invalid operations, so vector operations are executed at high speed. be able to.

[Brief description of drawings]

【図１】本発明の一実施例を示すベクトル命令制御装置
のブロック構成図である。FIG. 1 is a block configuration diagram of a vector instruction control device showing an embodiment of the present invention.

【図２】従来のベクトル命令制御装置のブロック構成図
である。FIG. 2 is a block diagram of a conventional vector instruction control device.

【図３】本発明のベクトル命令制御方法の効果を説明す
る図である。FIG. 3 is a diagram for explaining the effect of the vector instruction control method of the present invention.

[Explanation of symbols]

１命令制御部２命令レジスタ群３ａ，３ｂ，３ｃ，１０１，１０３ａ，４０３ａレジ
スタ１１０〜１１２，２１６〜２１８遅延ラッチ１０３，３０３，４０３インクリメンタ５００バッファ装置５００ａ，５００ｂ，５００ｃバッファ装置の構成要
素１０５，１０６，１１５ベクトルレジスタ１１３ベクトル演算器５，６終了検出回路２０９ベクトルマスクレジスタ1 Instruction Control Unit 2 Instruction Register Group 3a, 3b, 3c, 101, 103a, 403a Registers 110-112, 216-218 Delay Latch 103, 303, 403 Incrementer 500 Buffer Device 500a, 500b, 500c Components of Buffer Device 105 , 106, 115 Vector register 113 Vector calculator 5, 6 End detection circuit 209 Vector mask register

Claims

[Claims]

1. A vector instruction control device for sequentially reading vector data and vector mask data consisting of a plurality of elements and performing an operation of the vector data according to an element value of the vector mask data, Prior to the execution stage, an element of the operand vector data designated by the instruction and a buffer device for storing the number of the element, and an element value of the vector mask data indicate execution of the operation. Vector instruction control characterized by having a first address register and a first address incrementer for designating an address for sequentially storing only an element of operand vector data and the number of the element in the buffer device. apparatus.

2. The vector instruction control device according to claim 1, further comprising a second address register and a second incrementer for reading the operand vector data stored in the buffer device, and reading the operand vector data. And a vector instruction control device for executing an operation using the operand vector data.

3. The vector instruction control device according to claim 1, wherein a result obtained by executing an operation using the operand vector data read from the buffer device is an element number read from the buffer device. A vector instruction controller characterized by writing to a vector register according to the above.

4. A vector instruction control method for sequentially reading vector data composed of a plurality of elements and vector mask data, and calculating the vector data according to the element value of the vector mask data, A period during which an instruction execution stage is executed in which the operand vector data and the element number are sequentially read from the buffer device for the instruction, the operation is executed, and the result of the operation is stored in the result register according to the element number. For the next instruction, among the operand vector data designated by the instruction, the element value of the operand vector data whose element value of the vector mask data instructs execution of the operation and the element of the element A vector instruction control method characterized in that an instruction preceding stage for storing only numbers in a buffer device is executed in parallel.