JP6794854B2

JP6794854B2 - Arithmetic processing unit and control method of arithmetic processing unit

Info

Publication number: JP6794854B2
Application number: JP2017017668A
Authority: JP
Inventors: 仁 ▲高▼橋
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2017-02-02
Filing date: 2017-02-02
Publication date: 2020-12-02
Anticipated expiration: 2037-02-02
Also published as: US20180217962A1; JP2018124867A

Description

本発明は、演算処理装置及び演算処理装置の制御方法に関する。 The present invention relates to an arithmetic processing unit and a control method for the arithmetic processing unit.

演算処理装置に用いられるＧＰＵ（Graphic Processing Unit）は、元々は画像処理用のプロセッサであるが、多数の和積演算器を備えることにより行列計算に最適化されているため、機械学習用の処理を行うプロセッサとしても用いられることが多い。そして、深層学習（ディープラーニング）を行う処理においても、ＧＰＵが用いられることが一般的である。 The GPU (Graphic Processing Unit) used in the arithmetic processing unit is originally a processor for image processing, but since it is optimized for matrix calculation by having a large number of sum-product arithmetic units, it is a process for machine learning. It is often used as a processor that performs. The GPU is also generally used in the process of performing deep learning.

深層学習では、ニューラルネットワークを用いて処理が行われることが多い。例えば、画像認識の深層学習の場合、与えられた画像が何か判断するフォワード処理及び判断するためのニューラルネットワークのパラメータを更新するためのバックワード処理の２つの処理を有する。深層学習を行う演算処理装置は、フォワード処理での計算結果と期待値との差分を用いてバックワード処理を行い、ニューラルネットワークのパラメータを更新する。そして、演算処理装置は、更新したパラメータを用いてフォワード処理の精度を向上させる。 In deep learning, processing is often performed using a neural network. For example, in the case of deep learning of image recognition, it has two processes, a forward process for determining what a given image is and a backward process for updating the parameters of the neural network for determining. The arithmetic processing unit that performs deep learning performs backward processing using the difference between the calculation result in the forward processing and the expected value, and updates the parameters of the neural network. Then, the arithmetic processing unit improves the accuracy of the forward processing by using the updated parameters.

ニューラルネットワークは複数の層で構成される場合がある。フォワード処理が行われる順伝播では、入力データに対して各層で特徴量の抽出などの演算処理が行われ出力結果となる。そして、バックワード処理が行われる逆伝播では、それぞれの層において、順伝播の結果と期待値との差分を用いて各パラメータを更新する学習が順伝播と逆方向に繰り返される。このように、ニューラルネットワークは、それぞれの層で実施される異なる演算処理が行われる多層の構造を有する。このような構造を有することから、層毎のパラメータの更新を行うために、後の層の計算結果と期待値との差分を求め、その差分を１つ前の層に、その層の差分計算の結果をさらに１つ前の層に伝搬しながら学習が行われる。ここでの説明における１つ前及び１つ先は、順伝播の方向を基準とする。 A neural network may consist of multiple layers. In the forward propagation in which the forward processing is performed, arithmetic processing such as extraction of features is performed on the input data in each layer, and the output result is obtained. Then, in the back propagation in which the backward processing is performed, learning to update each parameter using the difference between the result of the forward propagation and the expected value is repeated in the opposite direction to the forward propagation in each layer. As described above, the neural network has a multi-layer structure in which different arithmetic processes performed in each layer are performed. Since it has such a structure, in order to update the parameters for each layer, the difference between the calculation result of the subsequent layer and the expected value is obtained, and the difference is set to the previous layer and the difference calculation of that layer is performed. Learning is performed while propagating the result of the above to the previous layer. The one before and one ahead in the explanation here are based on the direction of forward propagation.

さらに、深層学習の中で主に画像認識で用いられる演算処理として、畳み込みニューラルネットワークという処理がある。畳み込みニューラルネットワークでは、畳み込み（convolution）と呼ばれる演算が多用される。以下では、「畳込演算」という。例えば、画像認識を行う場合、入力画像上の領域に予め決められたパラメータを各要素として有するフィルタを元画像に配置する。順伝播における入力側はボトムと呼び、出力側はトップと呼ぶ。逆伝播においても位置関係は変わらず出力側をボトムと呼び、入力側をトップと呼ぶ。元画像を含む順伝播方向のときの各層の入力データは、「ボトムデータ」と呼ぶ。深層学習の画像認識において畳込演算を行う場合、入力データは、ビットマップ形式となっており、データを順番に並べて積んでいくと見た目の画像と同じになる。また、入力データを構成する各要素データは、グレースケールの場合であれば濃淡を表し、ＲＧＢ（Read Green Blue）であれば３色分のデータを表す。また、フィルタは、「重みデータ」と呼ばれる。 Further, as an arithmetic process mainly used in image recognition in deep learning, there is a process called a convolutional neural network. In a convolutional neural network, an operation called convolution is often used. Hereinafter, it is referred to as "convolution operation". For example, in the case of image recognition, a filter having predetermined parameters as each element is arranged in the area on the input image. In forward propagation, the input side is called the bottom and the output side is called the top. Even in back propagation, the positional relationship does not change, and the output side is called the bottom and the input side is called the top. The input data of each layer in the forward propagation direction including the original image is called "bottom data". When the convolution operation is performed in the image recognition of deep learning, the input data is in a bitmap format, and when the data are arranged in order and stacked, the input data becomes the same as the apparent image. Further, each element data constituting the input data represents a shade in the case of gray scale, and represents data for three colors in the case of RGB (Read Green Blue). The filter is also called "weight data".

そして、フィルタが配置された入力データの各要素と、フィルタの各要素とを乗算したものを合計することで、入力データにおけるフィルタが配置された領域の特徴量を算出する。この元画像へのフィルタの配置を予め決められたフィルタの移動幅を用いて入力データ全体に行い、算出した特徴量をまとめたものが、畳込演算の結果として出力される出力データとなる。このフォワード処理における畳込演算の結果である出力データは、「トップデータ」と呼ぶ。 Then, by multiplying each element of the input data in which the filter is arranged and each element of the filter, the feature amount of the area in which the filter is arranged in the input data is calculated. The arrangement of the filter on the original image is performed on the entire input data using the predetermined movement width of the filter, and the calculated feature amount is summarized as the output data output as the result of the convolution operation. The output data that is the result of the convolution operation in this forward processing is called "top data".

バックワード処理における畳込演算には、２つの演算が存在する。１つは、フォワード処理の計算結果であるトップデータと期待値との差分と、元画像とを用いて差分パラメータを算出する演算である。フォワード処理の計算結果であるトップデータと期待値との差分は、「トップ差分データ」と呼ばれる。また、算出される差分パラメータは、「重み差分データ」とよばれる。この重み差分データは、重みデータを更新してフォワード処理における計算精度を上げるために用いられる。もう１つは、トップ差分データと重みデータとを使用して、１つ前のバックワード処理の演算用の差分を算出する演算である。１つ前のバックワード処理の演算用の差分は、「ボトム差分データ」と呼ばれる。このボトム差分データが、１つ前の層におけるトップ差分データとして用いられる。 There are two operations in the convolution operation in backward processing. One is an operation of calculating the difference parameter using the difference between the top data and the expected value, which is the calculation result of the forward processing, and the original image. The difference between the top data and the expected value, which is the calculation result of the forward processing, is called "top difference data". Further, the calculated difference parameter is called "weight difference data". This weight difference data is used to update the weight data to improve the calculation accuracy in the forward processing. The other is an operation of calculating the difference for the operation of the previous backward processing by using the top difference data and the weight data. The difference for the calculation of the previous backward processing is called "bottom difference data". This bottom difference data is used as the top difference data in the previous layer.

特開２０１１−１１３１６８号公報Japanese Unexamined Patent Publication No. 2011-13168

しかしながら、畳込演算の総演算数は、以下のように計算できる。例えば、ボトムデータの要素データの数がＣ’×Ｃ’であり、ボトムデータの数がＮ個あり、重みデータの要素データの数がＫ×Ｋであり、トップ差分データの要素数がＣ×Ｃであり、トップデータの数をＰの場合を考える。さらに、フォワード処理における１つの畳込演算が１つの乗算と１つの加算であるとする。この場合、フォワード処理の総演算数は、Ｐ×Ｃ×Ｃ×Ｎ×Ｋ×Ｋ×２となる。例えば、Ｃ＝１３、Ｎ＝２５６、Ｋ＝３、Ｃ＝１３及びＰ＝２５６の場合、フォワード処理における総演算数は、２５６×１３×１３×２５６×３×３×２＝１９９０３６０５１２である。ここで、重みデータのサイズが大きい場合などでは、高速フーリエ変換（ＦＦＴ：Fast Fourier Transform）による高速化手法が有効であるが、その条件を満たさない場合、ＦＦＴによる演算拘束かの効果を得ることは困難である。そのため、特定の条件に縛られない畳込演算において、画像認識精度制度を維持しつつ演算数を軽減させることは困難である。 However, the total number of convolution operations can be calculated as follows. For example, the number of element data of bottom data is C'× C', the number of bottom data is N, the number of element data of weight data is K × K, and the number of elements of top difference data is C ×. Consider the case where C is and the number of top data is P. Further, it is assumed that one convolution operation in the forward processing is one multiplication and one addition. In this case, the total number of operations in the forward processing is P × C × C × N × K × K × 2. For example, in the case of C = 13, N = 256, K = 3, C = 13 and P = 256, the total number of operations in the forward processing is 256 × 13 × 13 × 256 × 3 × 3 × 2 = 1990360512. Here, when the size of the weight data is large, the speed-up method by Fast Fourier Transform (FFT) is effective, but when the condition is not satisfied, the effect of calculation constraint by FFT is obtained. It is difficult. Therefore, it is difficult to reduce the number of operations while maintaining the image recognition accuracy system in the convolution operation that is not bound by a specific condition.

開示の技術は、上記に鑑みてなされたものであって、画像認識精度制度を維持しつつ演算数を軽減させる演算処理装置及び演算処理装置の制御方法を提供することを目的とする。 The disclosed technique has been made in view of the above, and an object of the present invention is to provide an arithmetic processing unit and a control method of the arithmetic processing unit that reduce the number of arithmetic operations while maintaining an image recognition accuracy system.

本願の開示する演算処理装置及び演算処理装置の制御方法の一つの態様において、記憶部は、行列を形成する要素データを有する第１データ及び行列を形成する要素データから所定数の要素データを除いた配置形状を有する第２データを記憶する。変換部は、前記第２データの配置形状を基に前記第１データを変換する。畳込演算部は、前記変換部により変換された前記第１データに対して前記第２データをフィルタとして用いて畳み込み演算を行う。 In one embodiment of the arithmetic processing apparatus and the control method of the arithmetic processing apparatus disclosed in the present application, the storage unit excludes a predetermined number of element data from the first data having the element data forming the matrix and the element data forming the matrix. The second data having the arranged shape is stored. The conversion unit converts the first data based on the arrangement shape of the second data. The convolution calculation unit performs a convolution calculation on the first data converted by the conversion unit using the second data as a filter.

１つの側面では、本発明は、画像認識精度制度を維持しつつ演算数を軽減させることができる。 In one aspect, the present invention can reduce the number of operations while maintaining the image recognition accuracy system.

図１は、畳み込みニューラルネットにおける処理の全体的な流れを説明するための図である。FIG. 1 is a diagram for explaining the overall flow of processing in a convolutional neural network. 図２は、フォワード畳込演算及びバックワード畳込演算を説明するための図である。FIG. 2 is a diagram for explaining a forward convolution operation and a backward convolution operation. 図３は、演算処理層の詳細を表すブロック図である。FIG. 3 is a block diagram showing details of the arithmetic processing layer. 図４は、実施例１に係るフォワード畳込演算を行う畳込演算部の詳細を表すブロック図である。FIG. 4 is a block diagram showing details of a convolution calculation unit that performs a forward convolution operation according to the first embodiment. 図５は、フィルタ定義の一例を示す図である。FIG. 5 is a diagram showing an example of a filter definition. 図６は、ボトムデータの変換の一例を説明するための図である。FIG. 6 is a diagram for explaining an example of conversion of bottom data. 図７は、変換後のボトムデータの見た目を表す図である。FIG. 7 is a diagram showing the appearance of the bottom data after conversion. 図８は、ボトムデータの変換の一例を表す図である。FIG. 8 is a diagram showing an example of conversion of bottom data. 図９は、ボトムデータの変換の他の例を表す図である。FIG. 9 is a diagram showing another example of bottom data conversion. 図１０は、新フィルタ定義を用いる場合のフォワード畳込演算を説明するための図である。FIG. 10 is a diagram for explaining a forward convolution operation when the new filter definition is used. 図１１は、新フィルタ定義を用いる場合のバックワード畳込ボトム差分演算を説明するための図である。FIG. 11 is a diagram for explaining the backward convolution bottom difference operation when the new filter definition is used. 図１２は、新フィルタ定義を用いる場合のバックワード畳込重み差分演算を説明するための図である。FIG. 12 is a diagram for explaining the backward convolution weight difference calculation when the new filter definition is used. 図１３は、新フィルタ定義を使用する場合の演算処理層における処理のフローチャートである。FIG. 13 is a flowchart of processing in the arithmetic processing layer when the new filter definition is used. 図１４は、実施例１に係る畳込演算部によるフォワード畳込演算のフローチャートである。FIG. 14 is a flowchart of the forward convolution calculation by the convolution calculation unit according to the first embodiment. 図１５は、実施例１に係る畳込演算部によるバックワード畳込演算のフローチャートである。FIG. 15 is a flowchart of a backward convolution calculation by the convolution calculation unit according to the first embodiment. 図１６は、実施例２に係るプーリング処理部によるストライド数が２の場合のプーリング処理を説明するための図である。FIG. 16 is a diagram for explaining a pooling process when the number of strides by the pooling process unit according to the second embodiment is 2. 図１７は、実施例２に係るプーリング処理部によるストライド数が１の場合のプーリング処理を説明するための図である。FIG. 17 is a diagram for explaining a pooling process when the number of strides by the pooling process unit according to the second embodiment is 1. 図１８は、実施例３に係る畳込演算部によるフォワード畳込演算を説明するための図である。FIG. 18 is a diagram for explaining a forward convolution operation by the convolution operation unit according to the third embodiment. 図１９は、実施例４に係る畳込演算部による新フィルタ定義を用いたフォワード畳込演算の一例を説明するための図である。FIG. 19 is a diagram for explaining an example of a forward convolution operation using a new filter definition by the convolution operation unit according to the fourth embodiment. 図２０は、実施例４に係る畳込演算部による新フィルタ定義を用いたフォワード畳込演算の他の例を説明するための図である。FIG. 20 is a diagram for explaining another example of the forward convolution operation using the new filter definition by the convolution operation unit according to the fourth embodiment. 図２１は、フォワード畳込演算のプログラムの記述例を説明するための図である。FIG. 21 is a diagram for explaining a description example of a program for forward convolution operation. 図２２は、バックワード畳込重み差分演算のプログラムの記述例を説明するための図である。FIG. 22 is a diagram for explaining a description example of a program for backward convolution weight difference calculation. 図２３は、バックワード畳込ボトム差分演算のプログラムの記述例を説明するための図である。FIG. 23 is a diagram for explaining a description example of a program for backward convolution bottom difference calculation. 図２４は、演算処理装置のハードウェア構成図である。FIG. 24 is a hardware configuration diagram of the arithmetic processing unit.

以下に、本願の開示する演算処理装置及び演算処理装置の制御方法の実施例を図面に基づいて詳細に説明する。なお、以下の実施例により本願の開示する演算処理装置及び演算処理装置の制御方法が限定されるものではない。 Hereinafter, examples of the arithmetic processing unit and the control method of the arithmetic processing unit disclosed in the present application will be described in detail with reference to the drawings. The following examples do not limit the arithmetic processing unit and the control method of the arithmetic processing unit disclosed in the present application.

図１は、畳み込みニューラルネット（ＣＮＮ：Convolutional Neural Network）における処理の全体的な流れを説明するための図である。ここで、本実施例では、画像認識のためのＣＮＮにおける処理について説明する。図１に示すように、演算処理装置１は、入力データ２の入力を受ける。演算処理装置１は、ＣＮＮにおいて複数の演算処理層１１〜１３による処理を実行する。以下では、各演算処理層１１〜１３を区別しない場合、単に「演算処理層１０」という。 FIG. 1 is a diagram for explaining the overall flow of processing in a convolutional neural network (CNN). Here, in this embodiment, the processing in CNN for image recognition will be described. As shown in FIG. 1, the arithmetic processing unit 1 receives the input of the input data 2. The arithmetic processing unit 1 executes processing by a plurality of arithmetic processing layers 11 to 13 in the CNN. In the following, when the arithmetic processing layers 11 to 13 are not distinguished, they are simply referred to as “arithmetic processing layer 10”.

各演算処理層１０では、矢印Ｐ１方向である伝播方向に向かって、特徴点の抽出などの演算処理を行う。以下では、演算処理装置１による矢印Ｐ１へ向かう方向の演算処理を、「フォワード演算」という場合がある。また、各演算処理層１０では、矢印Ｐ２方向である逆伝播方向に向かって、各層における特徴点の抽出の精度を上げるために、矢印Ｐ２方向である逆伝播方向に向かって２種類の演算処理を行う。以下では、演算処理装置１による矢印Ｐ２へ向かう方向の演算処理を「バックワード演算」という場合がある。 In each arithmetic processing layer 10, arithmetic processing such as extraction of feature points is performed in the propagation direction which is the direction of arrow P1. In the following, the arithmetic processing in the direction toward the arrow P1 by the arithmetic processing unit 1 may be referred to as “forward arithmetic”. Further, in each arithmetic processing layer 10, two types of arithmetic processing are performed in the back propagation direction in the arrow P2 direction in order to improve the accuracy of extraction of feature points in each layer in the back propagation direction in the arrow P2 direction. I do. In the following, the arithmetic processing in the direction toward the arrow P2 by the arithmetic processing unit 1 may be referred to as “backward arithmetic”.

各演算処理層１０は、それぞれ特徴量の抽出に用いるフィルタである重みデータをメモリなどの記憶装置から取得する。さらに、第１層である演算処理層１１は、メモリなどの記憶装置から入力データ２を取得する。そして、演算処理層１１は、入力データ２をボトムデータとして、ボトムデータに対して重みデータを用いて畳込演算を実行する。次に、第２層である演算処理層１２は、演算処理層１１からの出力データをボトムデータとして、そのボトムデータ及び重みデータを用いて畳込演算を行う。演算処理装置１は、このように各演算処理層１０で演算処理を順次行い、第ｎ層である演算処理層１３での重みデータを用いた畳込演算の演算結果に対して正規化処理などを施した特徴量を表すデータを出力データ３として出力する。以下では、フォワード演算においてボトムデータと重みデータとを用いた畳込演算を、「フォワード畳込演算」という。 Each arithmetic processing layer 10 acquires weight data, which is a filter used for extracting a feature amount, from a storage device such as a memory. Further, the arithmetic processing layer 11, which is the first layer, acquires the input data 2 from a storage device such as a memory. Then, the arithmetic processing layer 11 executes the convolution operation using the input data 2 as the bottom data and the weight data for the bottom data. Next, the arithmetic processing layer 12, which is the second layer, uses the output data from the arithmetic processing layer 11 as bottom data, and performs the convolution operation using the bottom data and the weight data. In this way, the arithmetic processing device 1 sequentially performs arithmetic processing in each arithmetic processing layer 10, and normalizes the arithmetic result of the convolution operation using the weight data in the nth arithmetic processing layer 13. The data representing the feature amount subjected to the above is output as the output data 3. Hereinafter, the convolution operation using the bottom data and the weight data in the forward operation is referred to as a “forward convolution operation”.

さらに、各演算処理層１０は、バックワード演算における畳込み演算の１つとして、期待値と出力データ３との差分であるトップ差分データを用いて重み差分データを求める。例えば、第ｎ層である演算処理層１３は、予め決められた期待値を有し、出力データ３と期待値とを比較する。そして、演算処理層１３は、出力データ３と期待値との差分であるトップ差分データを求め、その求めたトップ差分データを入力データとして取得する。次に、演算処理層１３は、入力データ及び第ｎ層におけるフォワード畳込演算で用いたボトムデータを用いて重みデータの重みデータの期待値との差分である重み差分データを求める。そして、演算処理層１３は、求めた重み差分データを用いて第ｎ層における重みデータを修正する。さらに、演算処理層１３は、もう１つのバックワード演算における畳込み演算として、修正した重みデータと出力データ３と期待値との差分とを用いてボトムデータとボトムデータの期待値との差分であるボトム差分データを算出する。 Further, each arithmetic processing layer 10 obtains the weight difference data by using the top difference data which is the difference between the expected value and the output data 3 as one of the convolution operations in the backward operation. For example, the arithmetic processing layer 13, which is the nth layer, has a predetermined expected value, and compares the output data 3 with the expected value. Then, the arithmetic processing layer 13 obtains the top difference data which is the difference between the output data 3 and the expected value, and acquires the obtained top difference data as input data. Next, the arithmetic processing layer 13 obtains the weight difference data which is the difference from the expected value of the weight data of the weight data by using the input data and the bottom data used in the forward convolution operation in the nth layer. Then, the arithmetic processing layer 13 corrects the weight data in the nth layer by using the obtained weight difference data. Further, the arithmetic processing layer 13 uses the difference between the corrected weight data, the output data 3, and the expected value as a convolution operation in another backward arithmetic, and uses the difference between the bottom data and the expected value of the bottom data. Calculate some bottom difference data.

次に、第ｎ−１層の演算処理層１０は、演算処理層１３において算出されたボトム差分データに逆プーリング処理や逆正規化処理が施されたデータをトップ差分データとして取得する。次に、第ｎ−１層の演算処理層１０は、第ｎ−１層におけるフォワード畳込演算で用いたボトムデータとトップ差分データとを用いて重み差分データを算出する。そして、第ｎ−１層の演算処理層１０は、求めた重み差分データを用いて第ｎ−１層における重みデータを修正する。さらに、第ｎ−１層の演算処理層１０は、修正した重みデータとトップ差分データとを用いて第ｎ−１層におけるボトム差分データを算出する。演算処理装置１は、上述したバックワード演算における畳込演算を第１層まで繰り返す。以下では、バックワード演算における畳込演算を、「バックワード畳込演算」という。 Next, the arithmetic processing layer 10 of the n-1th layer acquires data obtained by subjecting the bottom difference data calculated in the arithmetic processing layer 13 to reverse pooling processing or denormalization processing as top difference data. Next, the arithmetic processing layer 10 of the n-1th layer calculates the weight difference data using the bottom data and the top difference data used in the forward convolution operation in the n-1th layer. Then, the arithmetic processing layer 10 of the n-1th layer corrects the weight data in the n-1th layer by using the obtained weight difference data. Further, the arithmetic processing layer 10 of the n-1th layer calculates the bottom difference data in the n-1th layer by using the modified weight data and the top difference data. The arithmetic processing unit 1 repeats the convolution operation in the backward operation described above up to the first layer. Hereinafter, the convolution operation in the backward operation is referred to as "backword convolution operation".

すなわち、矢印Ｐ１方向を各層の並び方向として、演算処理装置１は、特定の演算処理層１０の１つ先の層の演算処理層１０において特定の演算処理層１０におけるトップ差分データを算出する。そして、演算処理装置１は、算出したトップ差分データと１つ前の演算処理層１０の出力データであるボトムデータとを用いて、特定の演算処理層１０における重み差分データを求める。そして、演算処理装置１は、求めた特定の演算処理層１０における重み差分データを用いて特定の演算処理層１０が使用する重みデータを修正する。さらに、演算処理装置１は、トップ差分データと特定の演算処理層１０におけるボトム差分データを算出する。 That is, with the arrow P1 direction as the arrangement direction of each layer, the arithmetic processing unit 1 calculates the top difference data in the specific arithmetic processing layer 10 in the arithmetic processing layer 10 which is one layer ahead of the specific arithmetic processing layer 10. Then, the arithmetic processing unit 1 obtains the weight difference data in the specific arithmetic processing layer 10 by using the calculated top difference data and the bottom data which is the output data of the previous arithmetic processing layer 10. Then, the arithmetic processing unit 1 corrects the weight data used by the specific arithmetic processing layer 10 by using the weight difference data in the specific arithmetic processing layer 10 obtained. Further, the arithmetic processing unit 1 calculates the top difference data and the bottom difference data in the specific arithmetic processing layer 10.

以下では、バックワード畳込演算において、トップ差分データとボトムデータとを用いて重み差分データを求める演算を、「バックワード畳込重み差分演算」という。さらに、修正された重みデータとトップ差分データとを用いてボトム差分データを算出する演算を、「バックワード畳込ボトム差分演算」という。 In the following, in the backward convolution operation, the operation of obtaining the weight difference data by using the top difference data and the bottom data is referred to as "backward convolution weight difference operation". Further, the operation of calculating the bottom difference data using the corrected weight data and the top difference data is called "backward convolution bottom difference calculation".

演算処理装置１は、各演算処理層１０における重みデータの修正及び１つ前の演算処理層におけるトップ差分データの算出を順次繰り返ことにより、各演算処理層１０の全ての層の重みデータを演算処理層１３の出力データ３の期待値に合わせて修正する。 The arithmetic processing unit 1 sequentially repeats the correction of the weight data in each arithmetic processing layer 10 and the calculation of the top difference data in the previous arithmetic processing layer 10 to obtain the weight data of all the layers of each arithmetic processing layer 10. It is modified according to the expected value of the output data 3 of the arithmetic processing layer 13.

演算処理装置１は、各層で取得した特徴量を用いて繰り返しパラメータ更新する学習することで、画像認識の精度を向上させ、精度の高い画像認識を行うことができる。また、例えば、音声認識の場合には、入力データ２は音声データとなり、テキストマイニングの場合には入力データ２は単語となる。 The arithmetic processing unit 1 can improve the accuracy of image recognition and perform highly accurate image recognition by learning to repeatedly update parameters using the feature amounts acquired in each layer. Further, for example, in the case of voice recognition, the input data 2 becomes voice data, and in the case of text mining, the input data 2 becomes a word.

ここで、本実施例では、画像データで有るボトムデータを方形に行列として並んだ要素データを有する場合で説明する。以下では、フォワード畳込演算における重みデータの１回の移動量を「ストライド数」という場合がある。 Here, in this embodiment, the case where the bottom data, which is the image data, has the element data arranged as a matrix in a square will be described. In the following, the amount of movement of weight data at one time in the forward convolution operation may be referred to as “the number of strides”.

ここで、図２を参照して、係るフォワード畳込演算及びバックワード演算をさらに説明する。図２は、フォワード畳込演算及びバックワード畳込演算を説明するための図である。図２は、入力データ２を用いて演算処理を始める第１層から出力データ２０６と期待値２０７からトップ差分データ２０３を生成する第ｎ層までを表す。ここでは、演算処理層１１を第１層とし、演算処理層１４を第ｎ−１層とし、演算処理層１３を第ｎ層として、第ｎ層まで各演算処理層１１〜１４における演算を例に記載した。また、図２中の円で記載した処理は演算処理を表す。演算処理Ｆ１は、フォワード畳込演算を表す。演算処理Ｆ２は、バックワード畳込重み差分演算を表す。また、演算処理Ｆ３は、バックワード畳込ボトム差分演算を表す。 Here, with reference to FIG. 2, the forward convolution operation and the backward operation will be further described. FIG. 2 is a diagram for explaining a forward convolution operation and a backward convolution operation. FIG. 2 shows from the first layer that starts arithmetic processing using the input data 2 to the nth layer that generates the top difference data 203 from the output data 206 and the expected value 207. Here, the arithmetic processing layer 11 is the first layer, the arithmetic processing layer 14 is the n-1th layer, the arithmetic processing layer 13 is the nth layer, and the arithmetic operations in the arithmetic processing layers 11 to 14 up to the nth layer are examples. Described in. Further, the processing described by the circle in FIG. 2 represents an arithmetic processing. The arithmetic processing F1 represents a forward convolution operation. The calculation process F2 represents a backward convolution weight difference calculation. Further, the calculation process F3 represents a backward convolution bottom difference calculation.

演算処理装置１は、演算処理層１１において入力データ２及び第１層での重みデータ２０２に対して演算処理Ｆ１で表されるフォワード畳込演算を行い、トップデータ２０９を算出する。その後は、図示しないが、同様に次の第２層において、前の層において算出されたトップデータ２０９から取得したボトムデータ２０１及び第２層での重みデータ２０２に対して同様に演算処理Ｆ１で表されるフォワード畳込演算を行う。各演算処理層１０は同様のフォワード演算を繰り返す。そして、最後の第ｎ層である演算処理層１３は、同様に演算処理層１４において算出されたトップデータ２０９から取得したボトムデータ２０１及び第ｎ層での重みデータ２０２に対して演算処理Ｆ１で表されるフォワード畳込演算を行う。 The arithmetic processing unit 1 performs a forward convolution operation represented by the arithmetic processing F1 on the input data 2 and the weight data 202 in the first layer in the arithmetic processing layer 11, and calculates the top data 209. After that, although not shown, similarly, in the next second layer, the bottom data 201 acquired from the top data 209 calculated in the previous layer and the weight data 202 in the second layer are similarly subjected to the arithmetic processing F1. Perform the represented forward convolution operation. Each arithmetic processing layer 10 repeats the same forward arithmetic. Then, the final nth layer, the arithmetic processing layer 13, is subjected to the arithmetic processing F1 with respect to the bottom data 201 acquired from the top data 209 calculated in the arithmetic processing layer 14 and the weight data 202 in the nth layer. Perform the represented forward convolution operation.

さらに、演算処理層１３は、出力データ３と期待値２０７とを比較して、トップ差分データ２０３を算出する。ここで、入力データ２は、第２層〜第ｎ層におけるボトムデータ２０１にあたるため、以下では、第１層のボトムデータ２０１として扱う。また、第ｎ層の出力データ３は、第１層〜第ｎ−１層におけるトップデータ２０９にあたる。 Further, the arithmetic processing layer 13 compares the output data 3 with the expected value 207 to calculate the top difference data 203. Here, since the input data 2 corresponds to the bottom data 201 in the second layer to the nth layer, it is treated as the bottom data 201 of the first layer below. Further, the output data 3 of the nth layer corresponds to the top data 209 in the first layer to the n-1th layer.

バックワード演算の場合、演算処理層１３は、トップ差分データ２０３及びボトムデータ２０１に対して演算処理Ｆ２で表される畳み込みバックワードの重み差分演算を行い、重み差分データ２０４を算出する。さらに、演算処理層１３は、重み差分データ２０４を用いて重みデータ２０２を更新する。ここで、図２における一点鎖線の矢印が重みデータ２０２の更新の処理を表す。具体的には、演算処理装置１は、重み差分データ２０４に学習率を乗算して、新たな重みデータ２０２を算出する。さらに、演算処理層１３は、フォワード畳込演算で使用した重みデータ２０２及びトップ差分データ２０３に対して演算処理Ｆ３で表されるバックワード畳込ボトム差分演算を行い、ボトム差分データ２０５を算出する。 In the case of the backward calculation, the arithmetic processing layer 13 performs the weight difference calculation of the convolution backward represented by the arithmetic processing F2 on the top difference data 203 and the bottom data 201, and calculates the weight difference data 204. Further, the arithmetic processing layer 13 updates the weight data 202 by using the weight difference data 204. Here, the arrow of the alternate long and short dash line in FIG. 2 represents the process of updating the weight data 202. Specifically, the arithmetic processing unit 1 multiplies the weight difference data 204 by the learning rate to calculate new weight data 202. Further, the arithmetic processing layer 13 performs the backward convolution bottom difference calculation represented by the arithmetic processing F3 on the weight data 202 and the top difference data 203 used in the forward convolution operation, and calculates the bottom difference data 205. ..

演算処理層１４は、演算処理層１３が出力したボトム差分データ２０５から取得したトップ差分データ２０３及びボトムデータ２０１に対して演算処理Ｆ２で表される畳み込みバックワードの重み差分演算を行い、重み差分データ２０４を算出する。さらに、演算処理層１４は、重み差分データ２０４を用いて重みデータ２０２を更新する。さらに、演算処理層１４は、フォワード畳込演算で使用した重みデータ２０２及びトップ差分データ２０３に対して演算処理Ｆ３で表されるバックワード畳込ボトム差分演算を行い、ボトム差分データ２０５を算出する。各演算処理層１０は同様のバックワード演算を繰り返す。そして、最後の第１層である演算処理層１１は、同様に第２層で算出されたボトム差分データ２０５から取得したトップ差分データ２０３を用いて、バックワード畳込重み差分演算及びバックワード畳込ボトム差分演算を行う。 The arithmetic processing layer 14 performs a weight difference calculation of the convolution backward represented by the arithmetic processing F2 on the top difference data 203 and the bottom data 201 acquired from the bottom difference data 205 output by the arithmetic processing layer 13, and the weight difference. Data 204 is calculated. Further, the arithmetic processing layer 14 updates the weight data 202 by using the weight difference data 204. Further, the arithmetic processing layer 14 performs the backward convolution bottom difference calculation represented by the arithmetic processing F3 on the weight data 202 and the top difference data 203 used in the forward convolution operation, and calculates the bottom difference data 205. .. Each arithmetic processing layer 10 repeats the same backward arithmetic. Then, the arithmetic processing layer 11, which is the last first layer, uses the top difference data 203 acquired from the bottom difference data 205 similarly calculated in the second layer, and uses the backward convolution weight difference calculation and the backward tatami. Performs included bottom difference calculation.

図３は、演算処理層の詳細を表すブロック図である。演算処理層１０は、フォワード演算を実行する機能部として、畳込演算部１０１、活性化処理部１０２及びプーリング処理部１０３を有する。また、演算処理層１０は、バックワード演算を実行する機能部として、プーリング処理部１０４、活性化処理部１０５及び畳込演算部１０６を有する。 FIG. 3 is a block diagram showing details of the arithmetic processing layer. The arithmetic processing layer 10 has a convolution arithmetic unit 101, an activation processing unit 102, and a pooling processing unit 103 as functional units that execute forward operations. In addition, the arithmetic processing layer 10 has a pooling processing unit 104, an activation processing unit 105, and a convolution operation unit 106 as functional units for executing backward operations.

畳込演算部１０１は、前段の演算処理層１０からの出力データを用いて後述する畳込演算を行う。ここで、図４を参照して、畳込演算部１０１についてさらに詳細に説明する。図４は、実施例１に係るフォワード畳込演算を行う畳込演算部の詳細を表すブロック図である。図４に示すように、畳込演算部１０１は、入力データ処理部１１１、乗算部１１２、加算部１１３、出力データ作成部１１４及び重みデータ記憶部１１５を有する。 The convolution calculation unit 101 performs a convolution calculation described later using the output data from the calculation processing layer 10 in the previous stage. Here, the convolution calculation unit 101 will be described in more detail with reference to FIG. FIG. 4 is a block diagram showing details of a convolution calculation unit that performs a forward convolution operation according to the first embodiment. As shown in FIG. 4, the convolution calculation unit 101 includes an input data processing unit 111, a multiplication unit 112, an addition unit 113, an output data creation unit 114, and a weight data storage unit 115.

重みデータ記憶部１１５は、フォワード畳込演算に使用する複数種類のフィルタ定義に対応する重みデータ２０２を記憶する。本実施例では、重みデータ記憶部１１５は、図５に示す新フィルタ定義３０１及びフィルタ定義３０２を使用して作成された重みデータ２０２を記憶する。図５は、フィルタ定義の一例を示す図である。フィルタ定義３０２は、３×３のサイズを有する従来のフィルタ定義である。新フィルタ定義３０１は、フィルタ定義３０２に対応する新しいフィルタ定義である。 The weight data storage unit 115 stores weight data 202 corresponding to a plurality of types of filter definitions used in the forward convolution operation. In this embodiment, the weight data storage unit 115 stores the weight data 202 created by using the new filter definition 301 and the filter definition 302 shown in FIG. FIG. 5 is a diagram showing an example of a filter definition. The filter definition 302 is a conventional filter definition having a size of 3 × 3. The new filter definition 301 is a new filter definition corresponding to the filter definition 302.

新フィルタ定義３０１は、軸３１１〜３１４に関して中心に対して対称性を有する。すなわち、新フィルタ定義３０１は、縦横斜めの方向に対称性を有しており、画像の縦横斜め方向に対する画像認識を精度良く行うことができる。したがって、新フィルタ定義３０１は、フィルタ定義３０２を用いた場合に比べて画像認識の精度の低下は少なく、十分に画像認識を行うことができる。 The new filter definition 301 has symmetry with respect to the center with respect to axes 31 to 314. That is, the new filter definition 301 has symmetry in the vertical, horizontal, and diagonal directions, and can accurately recognize the image in the vertical, horizontal, and diagonal directions. Therefore, the new filter definition 301 has less decrease in image recognition accuracy than the case where the filter definition 302 is used, and can sufficiently perform image recognition.

本実施例では、３×３の重みデータ２０２を用いたが、重みデータ記憶部１１５は、サイズの異なる重みデータ２０２を記憶してもよい。例えば、重みデータ記憶部１１５は、新フィルタ定義３０３及びフィルタ定義３０４を記憶してもよい。フィルタ定義３０４は、５×５のサイズを有する従来のフィルタ定義である。新フィルタ定義３０３は、フィルタ定義３０４に対応する新しいフィルタ定義である。新フィルタ定義３０３も、軸３３１〜３３４に関して中心に対して対称性を有する。すなわち、新フィルタ定義３０３は、フィルタ定義３０４を用いた場合に比べて画像認識の精度の低下は少なく、十分に画像認識を行うことができる。新フィルタ定義３０１や３０３は、行方向及び列方向に同数の要素データが配置された状態から真ん中の行から１つ離れるにしたがい行の含まれる要素データが１つずつのぞかれる。さらに、新フィルタ定義３０１や３０３は、要素データを除いた行の半分の位置と要素データを除く前の行の半分の位置とが一致するように行がずらされる。 In this embodiment, 3 × 3 weight data 202 is used, but the weight data storage unit 115 may store weight data 202 having different sizes. For example, the weight data storage unit 115 may store the new filter definition 303 and the filter definition 304. The filter definition 304 is a conventional filter definition having a size of 5 × 5. The new filter definition 303 is a new filter definition corresponding to the filter definition 304. The new filter definition 303 also has center symmetry with respect to axes 331-334. That is, the new filter definition 303 has less decrease in image recognition accuracy than the case where the filter definition 304 is used, and can sufficiently perform image recognition. In the new filter definitions 301 and 303, the element data including the row is peeked one by one according to the distance from the middle row from the state where the same number of element data are arranged in the row direction and the column direction. Further, in the new filter definitions 301 and 303, the rows are shifted so that the position of half of the row excluding the element data and the position of half of the row before excluding the element data match.

また、本実施例では、新フィルタ定義３０１及び３０３という２種類のフィルタ定義について説明したが、フィルタ定義３０２や３０４といった従来のフィルタ定義に比べて要素データの数が少ないものであれば新フィルタ定義はこれに限らない。ただし、新フィルタ定義は、縦横斜めの方向に中心に対して対称性を有することが好ましい。以下では、新フィルタ定義３０１を使用して作成された重みデータ２０２を「重みデータ２２１」という。 Further, in this embodiment, two types of filter definitions, the new filter definitions 301 and 303, have been described, but if the number of element data is smaller than that of the conventional filter definitions such as the filter definitions 302 and 304, the new filter definition is defined. Is not limited to this. However, it is preferable that the new filter definition has symmetry with respect to the center in the vertical, horizontal, and diagonal directions. In the following, the weight data 202 created by using the new filter definition 301 will be referred to as “weight data 221”.

入力データ処理部１１１は、フォワード演算における前段の演算処理層１０からボトムデータ２０１の入力を受ける。このボトムデータ２０１が、「第１データ」の一例にあたる。そして、入力データ処理部１１１は、重みデータ記憶部１１５から重みデータ２０２を取得する。次に、入力データ処理部１１１は、図示しない入力装置から入力された操作者からの指示から画像判定に新フィルタ定義３０１を用いるか否かを判定する。新フィルタ定義３０１を用いない場合、入力データ処理部１１１は、フィルタ定義３０２を使用して作成された重みデータ２０２を用いることを乗算部１１２に伝えるとともに、ボトムデータを出力する。 The input data processing unit 111 receives the input of the bottom data 201 from the arithmetic processing layer 10 in the previous stage in the forward operation. This bottom data 201 corresponds to an example of "first data". Then, the input data processing unit 111 acquires the weight data 202 from the weight data storage unit 115. Next, the input data processing unit 111 determines whether or not to use the new filter definition 301 for image determination based on an instruction from an operator input from an input device (not shown). When the new filter definition 301 is not used, the input data processing unit 111 informs the multiplication unit 112 that the weight data 202 created by using the filter definition 302 is used, and outputs bottom data.

一方、新フィルタ定義３０１を用いる場合、入力データ処理部１１１は、入力されたボトムデータ２０１が新フィルタ定義３０１に対応するデータか否かを判定する。ボトムデータ２０１が新フィルタ定義３０１に対応するデータの場合、入力データ処理部１１１は、新フィルタ定義３０１を使用して作成された重みデータ２２１を用いることを乗算部１１２に伝えるとともに、ボトムデータを出力する。この重みデータ２２１が、「第２データ」の一例にあたる。 On the other hand, when the new filter definition 301 is used, the input data processing unit 111 determines whether or not the input bottom data 201 is data corresponding to the new filter definition 301. When the bottom data 201 is data corresponding to the new filter definition 301, the input data processing unit 111 informs the multiplication unit 112 that the weight data 221 created by using the new filter definition 301 is used, and also transmits the bottom data. Output. This weight data 221 corresponds to an example of "second data".

これに対して、ボトムデータ２０１が新フィルタ定義３０１に対応していないデータの場合、入力データ処理部１１１は、ボトムデータ２０１を新フィルタ定義３０１に合わせて変換する。図６は、ボトムデータの変換の一例を説明するための図である。 On the other hand, when the bottom data 201 does not correspond to the new filter definition 301, the input data processing unit 111 converts the bottom data 201 according to the new filter definition 301. FIG. 6 is a diagram for explaining an example of conversion of bottom data.

本実施例では、入力データ処理部１１１は、ボトムデータ２０１の隔行について、隣接する要素データとの平均を算出して、要素データの位置に格納する。例えば、図６に示す８×８の要素データｂ００〜ｂ６３を有するボトムデータ２０１の場合について説明する。入力データ処理部１１１は、１行目を飛ばして２行目を先頭に隔行を変更する行とする。 In this embodiment, the input data processing unit 111 calculates the average with the adjacent element data for the interval of the bottom data 201 and stores it at the position of the element data. For example, the case of bottom data 201 having 8 × 8 element data b00 to b63 shown in FIG. 6 will be described. The input data processing unit 111 skips the first line and sets the second line as the first line to change the interval.

まず、入力データ処理部１１１は、２行目の要素データｂ０８と要素データｂ０９との平均である要素データｎｂ０８を算出し、要素データｂ０８の位置に格納する。次に、入力データ処理部１１１は、要素データｂ０９と要素データｂ１０との平均である要素データｎｂｖ０９を算出し、要素データｂ０９の位置に格納する。このように、入力データ処理部１１１は、隣合う２つの要素データの平均値を若番の要素データの位置に格納することを要素データｂ０８〜ｂ１５まで繰り返す。ただし、要素データｂ１５に関しては、右隣に次の要素データｂ１６が存在しない。そこで、要素データｂ１５の右隣りには、平均を出すための要素データとして値が０である要素データが隣に存在するものとして計算を行う。すなわち、入力データ処理部１１１は、要素データｂ１５と０の要素データとの平均である要素データｎｂ１５を算出し、要素データｂ１５の位置に格納する。このように、入力データ処理部１１１は、変換後の２行目の要素データｎｂ０８〜ｎｂ１５を算出する。 First, the input data processing unit 111 calculates the element data nb08, which is the average of the element data b08 and the element data b09 in the second line, and stores the element data nb08 at the position of the element data b08. Next, the input data processing unit 111 calculates the element data nbv09, which is the average of the element data b09 and the element data b10, and stores it at the position of the element data b09. In this way, the input data processing unit 111 repeats storing the average value of the two adjacent element data at the position of the younger element data from the element data b08 to b15. However, regarding the element data b15, the next element data b16 does not exist on the right side. Therefore, on the right side of the element data b15, the calculation is performed assuming that the element data having a value of 0 exists next to the element data for calculating the average. That is, the input data processing unit 111 calculates the element data nb15, which is the average of the element data b15 and the element data of 0, and stores the element data nb15 at the position of the element data b15. In this way, the input data processing unit 111 calculates the element data nb08 to nb15 of the second line after conversion.

同様に、入力データ処理部１１１は、４，６及び８行目の要素データｎｂ２４〜ｎｂ３１，ｎｂ４０〜ｎｂ４７及びｎｂ５６〜ｎｂ６３を算出する。これにより、入力データ処理部１１１は、ボトムデータ２０１を変換したボトムデータ２１１を作成する。以下では、ボトムデータ２１１の全ての要素データを表す場合には要素データｂ００〜ｎｂ６３と表記する。 Similarly, the input data processing unit 111 calculates the element data nb24 to nb31, nb40 to nb47, and nb56 to nb63 on the fourth, sixth, and eighth lines. As a result, the input data processing unit 111 creates the bottom data 211 obtained by converting the bottom data 201. In the following, when all the element data of the bottom data 211 is represented, it is expressed as element data b00 to nb63.

図７は、変換後のボトムデータの見た目を表す図である。ボトムデータ２１１の要素データｂ００〜ｎｂ６３は、各ドットに割り当てた状態で配置される。すなわち、演算処理装置１は、変換したボトムデータ２１１を用いてフォワード畳込演算を行う。ただし、画像としての実際の見た目は、変換を行った各行の要素データｎｂ０８〜ｎｂ１５，ｎｂ２４〜ｎｂ３１，ｎｂ４０〜ｎｂ４７及びｎｂ５６〜ｎｂ６３が右側にドットの半分ずつずらされたボトムデータ２１０となる。すなわち、見た目は、図７に示すように、ボトムデータ２１１の見た目はボトムデータ２１０として表すことができる。以下では、分かり易いように、変換後のボトムデータ２１１を見た目のボトムデータ２１０を用いて説明する。 FIG. 7 is a diagram showing the appearance of the bottom data after conversion. The element data b00 to nb63 of the bottom data 211 are arranged in a state of being assigned to each dot. That is, the arithmetic processing unit 1 performs a forward convolution operation using the converted bottom data 211. However, the actual appearance as an image is the bottom data 210 in which the element data nb08 to nb15, nb24 to nb31, nb40 to nb47, and nb56 to nb63 of each converted line are shifted to the right by half a dot. That is, as shown in FIG. 7, the appearance of the bottom data 211 can be represented as the bottom data 210. In the following, for the sake of clarity, the converted bottom data 211 will be described using the apparent bottom data 210.

ここで、図８及び９を参照して、さらに具体的にボトムデータ２０１の変換について説明する。図８は、ボトムデータの変換の一例を表す図である。また、図９は、ボトムデータの変換の他の例を表す図である。 Here, the conversion of the bottom data 201 will be described more specifically with reference to FIGS. 8 and 9. FIG. 8 is a diagram showing an example of conversion of bottom data. Further, FIG. 9 is a diagram showing another example of conversion of bottom data.

例えば、図８のように、ボトムデータ２０１として漢数字の三が入力データ処理部１１１に入力された場合で説明する。この場合、ボトムデータ２０１の２行目に三の一番上の線が存在し、５行目に三の真ん中の線が存在し、８行目に三の一番下の線が存在する。各要素データｂ００〜ｂ６３は、濃淡情報３０で表される値を有する。ボトムデータ２０１において三を表す要素データ以外の要素データは、白色を表す０を値として有する。さらに、ボトムデータ２０１において三を表す要素データは、黒を表す値２５５を有する。 For example, as shown in FIG. 8, the case where the Chinese numeral three is input to the input data processing unit 111 as the bottom data 201 will be described. In this case, the top line of the three exists in the second line of the bottom data 201, the middle line of the three exists in the fifth line, and the bottom line of the three exists in the eighth line. Each element data b00 to b63 has a value represented by the shade information 30. The element data other than the element data representing 3 in the bottom data 201 has 0 representing white as a value. Further, the element data representing three in the bottom data 201 has a value 255 representing black.

入力データ処理部１１１は、２行目の要素データｂ０８〜ｂ１５の隣り合うデータの平均を算出して、変換後の要素データｎｂ０８〜ｎｂ１５を算出する。この場合、要素データｎｂ０８は、値１２７を有する。また、要素データｎｂ０９〜ｎｂ１３は、値２５５を有する。また、要素データｎｂ１４は、値１２７を有する。さらに、要素データｎｂ１５は、値として０を有する。 The input data processing unit 111 calculates the average of the adjacent data of the element data b08 to b15 of the second line, and calculates the element data nb08 to nb15 after conversion. In this case, the element data nb08 has a value of 127. Further, the element data nb09 to nb13 have a value of 255. Further, the element data nb14 has a value 127. Further, the element data nb15 has 0 as a value.

また、入力データ処理部１１１は、４及び６行目の要素データｂ２４〜ｂ３１及びｂ４０〜ｂ４７の隣り合うデータの平均を算出して、変換後の要素データｎｂ２４〜ｎｂ３１及びｎｂ４０〜ｎｂ４７を算出する。この場合、４及び６行目は要素データｂ２４〜ｂ３１及びｂ４０〜ｂ４７は全て値が０であるので、変換後の要素データｎｂ２４〜ｎｂ３１及びｎｂ４０〜ｎｂ４７も全て値が０である。 Further, the input data processing unit 111 calculates the average of adjacent data of the element data b24 to b31 and b40 to b47 in the 4th and 6th lines, and calculates the converted element data nb24 to nb31 and nb40 to nb47. .. In this case, since the values of the element data b24 to b31 and b40 to b47 are all 0 in the 4th and 6th lines, the values of the converted element data nb24 to nb31 and nb40 to nb47 are also 0.

さらに、入力データ処理部１１１は、８行目の要素データｂ５６〜ｂ６３の隣り合うデータの平均を算出して、変換後の要素データｎｂ５６〜ｎｂ６３を算出する。この場合、要素データｎｂ５６〜ｎｂ６２は、値２５５を有する。また、要素データｎｂ６４は、値１２７を有する。 Further, the input data processing unit 111 calculates the average of the adjacent data of the element data b56 to b63 on the eighth line, and calculates the converted element data nb56 to nb63. In this case, the element data nb56 to nb62 have a value of 255. Further, the element data nb64 has a value 127.

入力データ処理部１１１は、漢数字の三を表す画像であるボトムデータ２０１を変換する。その場合、変換後のボトムデータ２１０は、図８に示すように、濃淡にわずかな違いが存在する漢数字の三を表す画像となる。 The input data processing unit 111 converts the bottom data 201, which is an image representing the three Chinese numerals. In that case, as shown in FIG. 8, the converted bottom data 210 is an image representing the three Chinese numerals having a slight difference in shading.

次に、図９のように、ボトムデータ２０１として対角線の画像が入力データ処理部１１１に入力された場合で説明する。この場合、ボトムデータ２０１の対角線に線が存在する。この場合も、各要素データｂ００〜ｂ６３は、図８における濃淡情報３０で表される値を有する。対角線を表す要素データｂ００，ｂ０９，ｂ１８，ｂ２７，ｂ３６，ｂ４５，ｂ５４及びｂ６３が、グレーを表す値を有し、他の要素データは値として０を有する。 Next, as shown in FIG. 9, a case where a diagonal image is input to the input data processing unit 111 as bottom data 201 will be described. In this case, a line exists on the diagonal line of the bottom data 201. Also in this case, each element data b00 to b63 has a value represented by the shade information 30 in FIG. The element data b00, b09, b18, b27, b36, b45, b54 and b63 representing the diagonal line have a value representing gray, and the other element data has a value of 0.

そして、入力データ処理部１１１は、要素データｂ０８〜ｂ１５，ｂ２４〜ｂ３１，ｂ４０〜ｂ４７及びｂ５６〜ｂ６３の隣り合うデータの平均を算出し、要素データｎｂ０８〜ｎｂ１５，ｎｂ２４〜ｎｂ３１，ｎｂ４０〜ｎｂ４７及びｎｂ５６〜ｎｂ６３を求める。この場合、要素データｎｂ０８，ｎｂ０９，ｎｂ２６，ｎｂ２７ｎｂ４４，ｎｂ４５，ｎｂ６２及びｎｂ６３は、要素データｂ０８〜ｂ１５，ｂ２４〜ｂ３１，ｂ４０〜ｂ４７及びｂ５６〜ｂ６３の半分の値を有する。また、要素データｎｂ１０〜ｎｂ１５，ｎｂ２４〜ｎｂ２５，ｎｂ２８〜ｎｂ３１，ｎｂ４０〜ｎｂ４３，ｎｂ４６〜ｎｂ４７及びｎｂ５６〜ｎｂ６１は値として０を有する。 Then, the input data processing unit 111 calculates the average of the adjacent data of the element data b08 to b15, b24 to b31, b40 to b47 and b56 to b63, and the element data nb08 to nb15, nb24 to nb31, nb40 to nb47 and Find nb56 to nb63. In this case, the element data nb08, nb09, nb26, nb27nb44, nb45, nb62 and nb63 have half the values of the element data b08 to b15, b24 to b31, b40 to b47 and b56 to b63. Further, the element data nb10 to nb15, nb24 to nb25, nb28 to nb31, nb40 to nb43, nb46 to nb47 and nb56 to nb61 have 0 as a value.

この場合、入力データ処理部１１１は、対角線を表す画像であるボトムデータ２０１を変換する。その場合、変換後のボトムデータ２１０は、図９に示すように、濃淡にわずかな違いが存在する対角線を表す画像となる。 In this case, the input data processing unit 111 converts the bottom data 201, which is an image representing the diagonal line. In that case, the converted bottom data 210 is an image representing a diagonal line in which there is a slight difference in shading, as shown in FIG.

このように、入力データ処理部１１１により変換されることで作成されるボトムデータ２１０は、縦横方向及び斜め方向に変換前のボトムデータ２０１と同じ画像として用いることが可能な画像となる。画像は縦線、横線及び斜め線の組み合わせでほぼ表すことが可能であるため、変換後のボトムデータ２１０は、変換前のボトムデータ２０１と同様の画像として使用可能である。 In this way, the bottom data 210 created by being converted by the input data processing unit 111 becomes an image that can be used as the same image as the bottom data 201 before conversion in the vertical and horizontal directions and the diagonal direction. Since the image can be substantially represented by a combination of vertical lines, horizontal lines, and diagonal lines, the converted bottom data 210 can be used as an image similar to the unconverted bottom data 201.

そして、入力データ処理部１１１は、変換後のボトムデータ２１０を乗算部１１２へ出力する。さらに、入力データ処理部１１１は、重みデータ２２１を用いることを乗算部１１２へ通知する。 Then, the input data processing unit 111 outputs the converted bottom data 210 to the multiplication unit 112. Further, the input data processing unit 111 notifies the multiplication unit 112 that the weight data 221 is used.

ここで、図１における第１層の演算処理層１１においては、入力データ処理部１１１は、外部から入力された入力データ２をボトムデータ２０１として使用するため、ボトムデータ２０１が新フィルタ定義３０１に対応していない場合がある。その場合に、入力データ処理部１１１は、ボトムデータ２０１を新フィルタ定義３０１に合わせるために変換する。これに対して、図１における第２層以降の演算処理層１２〜１３では、前段の演算処理層１０から出力されるトップデータ２０９は既に新フィルタ定義３０１に対応しているので、入力データ処理部１１１は、変換を行わずにそのまま乗算部１１２へボトムデータ２０１を出力することができる。この入力データ処理部１１１が、「変換部」の一例にあたる。 Here, in the arithmetic processing layer 11 of the first layer in FIG. 1, the input data processing unit 111 uses the input data 2 input from the outside as the bottom data 201, so that the bottom data 201 becomes the new filter definition 301. It may not be supported. In that case, the input data processing unit 111 converts the bottom data 201 to match the new filter definition 301. On the other hand, in the arithmetic processing layers 12 to 13 of the second and subsequent layers in FIG. 1, the top data 209 output from the arithmetic processing layer 10 in the previous stage already corresponds to the new filter definition 301, so that the input data processing The unit 111 can output the bottom data 201 to the multiplication unit 112 as it is without performing conversion. This input data processing unit 111 corresponds to an example of a “conversion unit”.

乗算部１１２は、新フィルタ定義３０１を用いない場合、フィルタ定義３０２を使用して作成された重みデータ２０２の使用の通知を入力データ処理部１１１から受ける。さらに、乗算部１１２は、変換を行っていないボトムデータ２０１の入力を受ける。 When the new filter definition 301 is not used, the multiplication unit 112 receives a notification from the input data processing unit 111 of the use of the weight data 202 created by using the filter definition 302. Further, the multiplication unit 112 receives the input of the bottom data 201 that has not been converted.

乗算部１１２は、フィルタ定義３０２を使用して作成された重みデータ２０２とボトムデータ２０１と用いて通常のフォワード畳込演算における各要素データの乗算を行う。そして、乗算部１１２は、乗算結果を加算部１１３へ出力する。 The multiplication unit 112 multiplies each element data in the normal forward convolution operation by using the weight data 202 and the bottom data 201 created by using the filter definition 302. Then, the multiplication unit 112 outputs the multiplication result to the addition unit 113.

また、新フィルタ定義３０１を用いる場合、乗算部１１２は、重みデータ２２１の使用の通知を入力データ処理部１１１から受ける。さらに、乗算部１１２は、新フィルタ定義３０１に対応したボトムデータ２０１又は新フィルタ定義３０１に対応するように変換されたボトムデータ２１０の入力を受ける。そして、乗算部１１２は、入力されたボトムデータ２０１又は２１０と重みデータ２２１とを用いてフォワード畳込演算における各要素データの乗算を行う。 Further, when the new filter definition 301 is used, the multiplication unit 112 receives a notification of the use of the weight data 221 from the input data processing unit 111. Further, the multiplication unit 112 receives the input of the bottom data 201 corresponding to the new filter definition 301 or the bottom data 210 converted so as to correspond to the new filter definition 301. Then, the multiplication unit 112 multiplies each element data in the forward convolution operation by using the input bottom data 201 or 210 and the weight data 221.

例えば、変換後とボトムデータ２１０を用いる場合の乗算方法を、図１０を参照して説明する。図１０は、新フィルタ定義を用いる場合のフォワード畳込演算を説明するための図である。ここでは、重みデータ２２１の１回の移動量であるストライド数が１の場合で説明する。また、以下では、図１０におけるボトムデータ２１０の列が伸びる方向、すなわち縦の方向を「列方向」と言い、行が伸びる方向、すなわち横の方向を「行方向」と言う。 For example, the multiplication method after conversion and when the bottom data 210 is used will be described with reference to FIG. FIG. 10 is a diagram for explaining a forward convolution operation when the new filter definition is used. Here, the case where the number of strides, which is the amount of movement of the weight data 221 at one time, is 1, will be described. Further, in the following, the direction in which the columns of the bottom data 210 in FIG. 10 extend, that is, the vertical direction is referred to as "column direction", and the direction in which the rows extend, that is, the horizontal direction is referred to as "row direction".

乗算部１１２は、図１０に示すボトムデータ２１０の入力を受ける。さらに、乗算部１１２は、図１０に示す重みデータ２２１を重みデータ記憶部１１５から取得する。そして、乗算部１１２は、最初にボトムデータ２１０の１列目に重みデータ２２１の一列目を一致させ、且つ、重みデータ２２１の各要素データがボトムデータ２１０のより若い番号の要素データに重なるように重みデータ２２１を配置する。例えば、図１０の場合、乗算部１１２は、要素データｗ００が要素データｂ０１に一致し、要素データｗ０２が要素データｎｂ０９に一致し、要素データｗ０５が要素データｂ１７に一致するように重みデータ２２１を配置する。そして、乗算部１１２は、ボトムデータ２１０と重みデータ２２１との重なった各要素データ同士を乗算し、各乗算結果を加算部１１３へ出力する。以下では、ボトムデータ２１０上の所定の位置に重みデータ２２１を配置し、重なった各要素データを乗算する計算を「トップデータ２０９の１つの要素データに対する乗算」という。 The multiplication unit 112 receives the input of the bottom data 210 shown in FIG. Further, the multiplication unit 112 acquires the weight data 221 shown in FIG. 10 from the weight data storage unit 115. Then, the multiplication unit 112 first matches the first column of the weight data 221 with the first column of the bottom data 210, and makes each element data of the weight data 221 overlap with the element data of the lower number of the bottom data 210. The weight data 221 is arranged in. For example, in the case of FIG. 10, the multiplication unit 112 sets the weight data 221 so that the element data w00 matches the element data b01, the element data w02 matches the element data nb09, and the element data w05 matches the element data b17. Deploy. Then, the multiplication unit 112 multiplies each overlapping element data of the bottom data 210 and the weight data 221 and outputs each multiplication result to the addition unit 113. In the following, the calculation of arranging the weight data 221 at a predetermined position on the bottom data 210 and multiplying each overlapping element data is referred to as "multiplication of one element data of the top data 209".

次に、乗算部１１２は、ストライド数である１つの要素データ分だけ重みデータ２２１をボトムデータ２１０上で行方向に移動する。そして、乗算部１１２は、移動した位置でトップデータ２０９の１つの要素データに対する乗算を行い、各乗算結果を加算部１１３へ出力する。このように、乗算部１１２は、計算完了後にストライド数ずつ行方向に重みデータ２２１を移動させ、トップデータ２０９の１つの要素データに対する乗算を繰返す。そして、重みデータ２２１が行方向の最後尾まで移動すると、次の計算では、乗算部１１２は、重みデータ２２１を列方向にストライド数である１つの要素データ分だけ移動させ、さらに、行方向の先頭の位置に重みデータ２２１を戻す。そして、乗算部１１２は、行方向に重みデータ２２１を移動させてトップデータ２０９の１つの要素データに対する乗算を繰返す。乗算部１１２は、重みデータ２２１の最下行がボトムデータ２１０の最下行に一致し、且つ、重みデータ２２１がボトムデータ２１０の最後尾に移動するまで、トップデータ２０９の１つの要素データに対する乗算を繰返す。 Next, the multiplication unit 112 moves the weight data 221 in the row direction on the bottom data 210 by one element data which is the number of strides. Then, the multiplication unit 112 performs multiplication on one element data of the top data 209 at the moved position, and outputs each multiplication result to the addition unit 113. In this way, after the calculation is completed, the multiplication unit 112 moves the weight data 221 in the row direction by the number of strides, and repeats the multiplication on one element data of the top data 209. Then, when the weight data 221 moves to the end in the row direction, in the next calculation, the multiplication unit 112 moves the weight data 221 in the column direction by one element data which is the number of strides, and further, in the row direction. The weight data 221 is returned to the head position. Then, the multiplication unit 112 moves the weight data 221 in the row direction and repeats the multiplication on one element data of the top data 209. The multiplication unit 112 multiplies one element data of the top data 209 until the bottom line of the weight data 221 matches the bottom line of the bottom data 210 and the weight data 221 moves to the end of the bottom data 210. Repeat.

例えば、図１０におけるボトムデータ２１０の太線枠で囲まれた位置に重みデータ２２１を配置して計算を行う場合を説明する。ここでは、各要素データの乗算を符号のみで表す。乗算部１１２は、１つのトップデータ２０９に対する乗算として、ｗ００×ｎｂ０９，ｗ０１×ｎｂ１０，ｗ０２×ｂ１７，ｗ０３×ｂ１８，ｗ０４×ｂ１９，ｗ０５×ｎｂ２５及びｗ０６×ｎｂ２６を行う。そして、乗算部１１２は、各乗算結果を加算部１１３へ出力する。 For example, a case where the weight data 221 is arranged at the position surrounded by the thick line frame of the bottom data 210 in FIG. 10 and the calculation is performed will be described. Here, the multiplication of each element data is represented only by a sign. The multiplication unit 112 performs w00 × nb09, w01 × nb10, w02 × b17, w03 × b18, w04 × b19, w05 × nb25 and w06 × nb26 as multiplications for one top data 209. Then, the multiplication unit 112 outputs each multiplication result to the addition unit 113.

加算部１１３は、乗算結果の入力を乗算部１１２から受ける。そして、加算部１１３は、１つのトップデータ２０９に対する乗算の乗算結果それぞれを加算して合計を算出する。以下では、１つのトップデータ２０９に対する乗算の乗算結果の加算を、「トップデータ２０９の１つの要素データに対する加算」という。そして、加算部１１３は、加算結果を出力データ作成部１１４へ出力する。加算部１１３は、乗算部１１２が行ったトップデータ２０９の１つの要素データに対する乗算の全てに対して、トップデータ２０９の１つの要素データに対する加算を繰り返し、加算結果を出力データ作成部１１４へ出力する。 The addition unit 113 receives the input of the multiplication result from the multiplication unit 112. Then, the addition unit 113 adds each of the multiplication results of multiplication to one top data 209 to calculate the total. Hereinafter, the addition of the multiplication result of multiplication to one top data 209 is referred to as "addition to one element data of top data 209". Then, the addition unit 113 outputs the addition result to the output data creation unit 114. The addition unit 113 repeats the addition to one element data of the top data 209 for all the multiplications performed by the multiplication unit 112 to one element data of the top data 209, and outputs the addition result to the output data creation unit 114. To do.

例えば、図１０におけるボトムデータ２１０の太線枠で囲まれた位置に重みデータ２２１を配置された場合について説明する。加算部１１３は、ｗ００×ｎｂ０９，ｗ０１×ｎｂ１０，ｗ０２×ｎ１７，ｗ０３×ｂ１８，ｗ０４×ｂ１９，ｗ０５×ｎｂ２５及びｗ０６×ｎｂ２６の入力を乗算部１１２から受ける。そして、加算部１１３は、ｗ００×ｎｂ０９＋ｗ０１×ｎｂ１０＋ｗ０２×ｂ１７＋ｗ０３×ｂ１８＋ｗ０４×ｂ１９＋ｗ０５×ｎｂ２５＋ｗ０６×ｎｂ２６を算出する。 For example, a case where the weight data 221 is arranged at the position surrounded by the thick line frame of the bottom data 210 in FIG. 10 will be described. The addition unit 113 receives inputs of w00 × nb09, w01 × nb10, w02 × n17, w03 × b18, w04 × b19, w05 × nb25, and w06 × nb26 from the multiplication unit 112. Then, the addition unit 113 calculates w00 × nb09 + w01 × nb10 + w02 × b17 + w03 × b18 + w04 × b19 + w05 × nb25 + w06 × nb26.

出力データ作成部１１４は、トップデータ２０９の１つの要素データに対する加算の加算結果の入力を加算部１１３から受ける。そして、出力データ作成部１１４は、トップデータ２０９の先頭から順に、取得した加算結果の割り当てを繰り返す。例えば、出力データ作成部１１４は、図１０におけるボトムデータ２１０の太線枠で囲まれた位置に重みデータ２２１を配置された場合、取得した加算結果を要素データｔ１８とする。すなわち、ｗ００×ｎｂ０９＋ｗ０１×ｎｂ１０＋ｗ０２×ｎ１７＋ｗ０３×ｎ１８＋ｗ０４×ｎ１９＋ｗ０５×ｎｂ２５＋ｗ０６×ｎｂ２６が、トップデータ２０９の要素データｔ１８にあたる。出力データ作成部１１４は、このように取得した加算結果のトップデータ２０９の各要素データへの割当を繰り返してトップデータ２０９を生成する。そして、出力データ作成部１１４は、生成したトップデータ２０９を活性化処理部１０２へ出力する。以下では、トップデータ２０９の１つの要素データに対する乗算及び加算、並びに、その加算結果のトップデータ２０９の要素データの割当をまとめて、「トップデータ２０９の１つの要素データに対する和積演算」という。乗算部１１２、加算部１１３及び出力データ作成部１１４が、「畳込演算部」の一例にあたる。 The output data creation unit 114 receives the input of the addition result of the addition for one element data of the top data 209 from the addition unit 113. Then, the output data creation unit 114 repeats the allocation of the acquired addition results in order from the beginning of the top data 209. For example, when the weight data 221 is arranged at the position surrounded by the thick line frame of the bottom data 210 in FIG. 10, the output data creation unit 114 uses the acquired addition result as the element data t18. That is, w00 × nb09 + w01 × nb10 + w02 × n17 + w03 × n18 + w04 × n19 + w05 × nb25 + w06 × nb26 corresponds to the element data t18 of the top data 209. The output data creation unit 114 repeatedly assigns the top data 209 of the addition result acquired in this way to each element data to generate the top data 209. Then, the output data creation unit 114 outputs the generated top data 209 to the activation processing unit 102. Hereinafter, the multiplication and addition of the top data 209 to one element data and the allocation of the element data of the top data 209 of the addition result are collectively referred to as "sum product operation for one element data of the top data 209". The multiplication unit 112, the addition unit 113, and the output data creation unit 114 correspond to an example of the “convolution calculation unit”.

ここで、従来のフィルタ定義３０２を用いた場合、畳込演算部１０１は、トップデータ２０９の１つの要素データに対する和積演算において、９回の乗算と９個の乗算結果の加算を行う。これに対して、新フィルタ定義３０１を用いた場合、畳込演算部１０１は、トップデータ２０９の１つの要素データに対する和積演算において、７回の乗算と７個の乗算結果の加算を行う。したがって、乗算数及び加算する値の数共に、新フィルタ定義３０１を用いた場合の方が、従来のフィルタ定義３０２を用いた場合に比べて小さくなる。すなわち、フォワード畳込演算の場合、新フィルタ定義３０１を用いた場合の方が、従来のフィルタ定義３０２を用いた場合に比べて、使用する記憶領域を小さくすることができ、計算効率も向上させることができる。 Here, when the conventional filter definition 302 is used, the convolution calculation unit 101 performs 9 multiplications and addition of 9 multiplication results in the sum product calculation for one element data of the top data 209. On the other hand, when the new filter definition 301 is used, the convolution calculation unit 101 performs 7 multiplications and addition of 7 multiplication results in the sum product calculation for one element data of the top data 209. Therefore, both the number of multiplications and the number of values to be added are smaller when the new filter definition 301 is used than when the conventional filter definition 302 is used. That is, in the case of the forward convolution operation, the storage area used can be smaller and the calculation efficiency can be improved when the new filter definition 301 is used as compared with the case where the conventional filter definition 302 is used. be able to.

図３に戻って説明を続ける。畳込演算部１０６は、活性化処理部１０５により逆正規化処理が施されたデータに対してバックワード畳込演算を行う。ここで、畳込演算部１０６によるバックワード畳込演算についてさらに詳細に説明する。まず、図１１を参照して、バックワード畳込ボトム差分演算について説明する。図１１は、新フィルタ定義を用いる場合のバックワード畳込ボトム差分演算を説明するための図である。 The explanation will be continued by returning to FIG. The convolution calculation unit 106 performs a backward convolution operation on the data that has been denormalized by the activation processing unit 105. Here, the backward convolution operation by the convolution operation unit 106 will be described in more detail. First, the backward convolution bottom difference calculation will be described with reference to FIG. FIG. 11 is a diagram for explaining the backward convolution bottom difference operation when the new filter definition is used.

ここでは、図１０で用いた８×８のボトムデータ２０１を変換したボトムデータ２１０と重みデータ２２１とを用いてフォワード畳込演算を行った場合で説明する。この場合、畳込演算部１０６は、フォワード畳込演算により求められた図１０のトップデータ２０９の配置形状と同じ配置形状、すなわち隔行で１つ要素分ずつずれたデータの配置形状の見た目を有するトップ差分データ２０３の入力を活性化処理部１０５から受ける。ここで、トップ差分データ２０３も、トップデータ２０９の配置形状と同じ配置形状を有する。また、バックワード畳込ボトム差分演算で算出されるボトム差分データ２０５は、ボトムデータ２０１と同じ配置形状を有する。トップ差分データ２０３は、要素データｔｄ００〜ｔｄ６３を有する。また、ボトム差分データ２０５は、要素データｂｄ００〜ｎｂｄ６３を有する。 Here, the case where the forward convolution operation is performed using the bottom data 210 obtained by converting the 8 × 8 bottom data 201 used in FIG. 10 and the weight data 221 will be described. In this case, the convolution calculation unit 106 has the same arrangement shape as the arrangement shape of the top data 209 of FIG. 10 obtained by the forward convolution operation, that is, the appearance of the arrangement shape of the data shifted by one element at intervals. The input of the top difference data 203 is received from the activation processing unit 105. Here, the top difference data 203 also has the same arrangement shape as the arrangement shape of the top data 209. Further, the bottom difference data 205 calculated by the backward convolution bottom difference calculation has the same arrangement shape as the bottom data 201. The top difference data 203 has element data td00 to td63. Further, the bottom difference data 205 has element data bd00 to nbd63.

畳込演算部１０６は、図１１に示すトップ差分データ２０３の入力を受ける。そして、畳込演算部１０６は、最初にトップ差分データ２０３の１列目に重みデータ２２１の一列目を一致させ、且つ、重みデータ２２１の各要素データがトップ差分データ２０３のより若い番号の要素データに重なるように重みデータ２２１を配置する。例えば、図１１の場合、畳込演算部１０６は、要素データｗ００が要素データｔｄ０１に一致し、要素データｗ０２が要素データｔｄ０９に一致し、要素データｗ０５が要素データｔｄ１７に一致するように重みデータ２２１を配置する。そして、畳込演算部１０６は、トップ差分データ２０３と重みデータ２２１との重なった各要素データ同士を乗算する。さらに、畳込演算部１０６は、乗算結果のそれぞれを加算し合計を算出する。そして、畳込演算部１０６は、算出した加算結果をボトム差分データ２０５の要素データｂｄ００とする。 The convolution calculation unit 106 receives the input of the top difference data 203 shown in FIG. Then, the convolution calculation unit 106 first matches the first column of the weight data 221 with the first column of the top difference data 203, and each element data of the weight data 221 is an element having a lower number than the top difference data 203. The weight data 221 is arranged so as to overlap the data. For example, in the case of FIG. 11, the convolution calculation unit 106 uses weight data such that the element data w00 matches the element data td01, the element data w02 matches the element data td09, and the element data w05 matches the element data td17. Place 221. Then, the convolution calculation unit 106 multiplies each overlapping element data of the top difference data 203 and the weight data 221. Further, the convolution calculation unit 106 adds each of the multiplication results to calculate the total. Then, the convolution calculation unit 106 sets the calculated addition result as the element data bd00 of the bottom difference data 205.

次に、畳込演算部１０６は、ストライド数である１つの要素データ分だけ重みデータ２２１をトップ差分データ２０３上で行方向に移動する。そして、畳込演算部１０６は、移動した位置で１つのボトム差分データ２０５に対する乗算を行い、乗算結果を加算して合計を算出する。このように、畳込演算部１０６は、計算完了後にストライド数ずつ行方向に重みデータ２２１を移動させ、乗算及び加算を繰返す。そして、重みデータ２２１が行方向の最後尾まで移動すると、次の計算では、畳込演算部１０６は、重みデータ２２１を列方向にストライド数である１つの要素データ分だけ移動させ、さらに、行方向の先頭の位置に重みデータ２２１を戻す。そして、畳込演算部１０６は、行方向に重みデータ２２１を移動させつつ乗算及び加算を繰返す。畳込演算部１０６は、重みデータ２２１の最下行がトップ差分データ２０３の最下行に一致し、且つ、重みデータ２２１がトップ差分データ２０３の最後尾に移動するまで、乗算及び加算を繰返す。そして、畳込演算部１０６は、乗算及び加算結果をボトム差分データ２０５の要素データｂ０１〜ｎｂ６３の番号順に割り当てていく。以下では、トップ差分データ２０３上の所定の位置に重みデータ２２１を配置した状態での、乗算及び加算、並びに、ボトム差分データ２０５の要素データｂ００〜ｎｂ６３に割り当てる演算を、まとめて「ボトム差分データ２０５の１つの要素データに対する和積演算」という。 Next, the convolution calculation unit 106 moves the weight data 221 in the row direction on the top difference data 203 by the amount of one element data which is the number of strides. Then, the convolution calculation unit 106 multiplies one bottom difference data 205 at the moved position, adds the multiplication results, and calculates the total. In this way, the convolution calculation unit 106 moves the weight data 221 in the row direction by the number of strides after the calculation is completed, and repeats multiplication and addition. Then, when the weight data 221 moves to the end in the row direction, in the next calculation, the convolution calculation unit 106 moves the weight data 221 in the column direction by one element data which is the number of strides, and further, the row The weight data 221 is returned to the position at the beginning of the direction. Then, the convolution calculation unit 106 repeats multiplication and addition while moving the weight data 221 in the row direction. The convolution calculation unit 106 repeats multiplication and addition until the bottom line of the weight data 221 matches the bottom line of the top difference data 203 and the weight data 221 moves to the end of the top difference data 203. Then, the convolution calculation unit 106 allocates the multiplication and addition results in the numerical order of the element data b01 to nb63 of the bottom difference data 205. In the following, multiplication and addition in a state where the weight data 221 is arranged at a predetermined position on the top difference data 203, and operations assigned to the element data b00 to nb63 of the bottom difference data 205 are collectively referred to as "bottom difference data". It is called "sum product operation for one element data of 205".

例えば、図１１におけるトップ差分データ２０３の太線枠で囲まれた位置に重みデータ２２１を配置して計算を行う場合を説明する。ここでは、各要素データの乗算を符号のみで表す。畳込演算部１０６は、ボトム差分データ２０５の１つの要素データに対する和積演算として、ｗ００×ｔｄ０９＋ｗ０１×ｔｄ１０＋ｗ０２×ｔｄ１７＋ｗ０３×ｔｄ１８＋ｗ０４×ｔｄ１９＋ｗ０５×ｔｄ２５＋ｗ０６×ｔｄ２６を要素データｂｄ１８とする。 For example, a case where the weight data 221 is arranged at the position surrounded by the thick line frame of the top difference data 203 in FIG. 11 and the calculation is performed will be described. Here, the multiplication of each element data is represented only by a sign. The convolution calculation unit 106 sets w00 × td09 + w01 × td10 + w02 × td17 + w03 × td18 + w04 × td19 + w05 × td25 + w06 × td26 as element data bd18 as a sum product operation for one element data of the bottom difference data 205.

ここで、従来のフィルタ定義３０２を用いた場合、畳込演算部１０１は、ボトム差分データ２０５の１つの要素データに対する和積演算において、９回の乗算と９個の乗算結果の加算を行う。これに対して、新フィルタ定義３０１を用いた場合、畳込演算部１０１は、ボトム差分データ２０５の１つの要素データに対する和積演算において、７回の乗算と７個の乗算結果の加算を行う。したがって、乗算数及び加算する値の数共に、新フィルタ定義３０１を用いた場合の方が、従来のフィルタ定義３０２を用いた場合に比べて小さくなる。すなわち、バックワード畳込ボトム差分演算の場合も、新フィルタ定義３０１を用いた場合の方が、従来のフィルタ定義３０２を用いた場合に比べて、使用する記憶領域を小さくすることができ、計算効率も向上させることができる。 Here, when the conventional filter definition 302 is used, the convolution calculation unit 101 performs 9 multiplications and addition of 9 multiplication results in the sum product calculation for one element data of the bottom difference data 205. On the other hand, when the new filter definition 301 is used, the convolution calculation unit 101 performs 7 multiplications and addition of 7 multiplication results in the sum product calculation for one element data of the bottom difference data 205. .. Therefore, both the number of multiplications and the number of values to be added are smaller when the new filter definition 301 is used than when the conventional filter definition 302 is used. That is, even in the case of the backward convolution bottom difference calculation, the storage area used can be smaller when the new filter definition 301 is used than when the conventional filter definition 302 is used, and the calculation can be performed. Efficiency can also be improved.

次に、図１２を参照して、バックワード畳込重み差分演算を説明する。図１２は、新フィルタ定義を用いる場合のバックワード畳込重み差分演算を説明するための図である。フォワード畳込重差分演算で算出される重み差分データ２０４は、重みデータ２２１の配置形状と同じ配置形状を有する。重み差分データ２０４は、要素データｗｄ００〜ｗｄ０７を有する。 Next, the backward convolution weight difference calculation will be described with reference to FIG. FIG. 12 is a diagram for explaining the backward convolution weight difference calculation when the new filter definition is used. The weight difference data 204 calculated by the forward convolution weight difference calculation has the same arrangement shape as the arrangement shape of the weight data 221. The weight difference data 204 has element data wd00 to wd07.

畳込演算部１０６は、フォワード畳込演算で使用したボトムデータ２１０を取得する。また、畳込演算部１０６は、図１２に示すトップ差分データ２０３の入力を受ける。次に、畳込演算部１０６は、ボトムデータ２１０が重み差分データ２０４を算出するサイズを有するか否かを判定する。サイズが小さい場合、畳込演算部１０６は、ボトムデータ２１０の周りに値が０の要素データ２１２を付加する。以下では、要素データ２１２を付加したボトムデータ２１０を単にボトムデータ２１０という。 The convolution calculation unit 106 acquires the bottom data 210 used in the forward convolution calculation. Further, the convolution calculation unit 106 receives the input of the top difference data 203 shown in FIG. Next, the convolution calculation unit 106 determines whether or not the bottom data 210 has a size for calculating the weight difference data 204. When the size is small, the convolution calculation unit 106 adds element data 212 having a value of 0 around the bottom data 210. In the following, the bottom data 210 to which the element data 212 is added is simply referred to as the bottom data 210.

次に、畳込演算部１０６は、最初にボトムデータ２１０の１列目にトップ差分データ２０３の一列目を一致させ、且つ、トップ差分データ２０３の各要素データがボトムデータ２１０のより若い番号の要素データに重なるようにトップ差分データ２０３を配置する。例えば、畳込演算部１０６は、ボトムデータ２１０の太線枠に一致するようにトップ差分データ２０３を配置する。そして、畳込演算部１０６は、ボトムデータ２１０とトップ差分データ２０３との重なった各要素データ同士を乗算する。さらに、畳込演算部１０６は、乗算結果のそれぞれを加算し合計を算出する。そして、畳込演算部１０６は、算出した加算結果を重み差分データ２０４の要素データｗ００とする。 Next, the convolution calculation unit 106 first matches the first column of the top difference data 203 with the first column of the bottom data 210, and each element data of the top difference data 203 has a lower number of the bottom data 210. The top difference data 203 is arranged so as to overlap the element data. For example, the convolution calculation unit 106 arranges the top difference data 203 so as to match the thick line frame of the bottom data 210. Then, the convolution calculation unit 106 multiplies each overlapping element data of the bottom data 210 and the top difference data 203. Further, the convolution calculation unit 106 adds each of the multiplication results to calculate the total. Then, the convolution calculation unit 106 sets the calculated addition result as the element data w00 of the weight difference data 204.

次に、畳込演算部１０６は、ストライド数である１つの要素データ分だけトップ差分データ２０３をボトムデータ２１０上で行方向に移動する。そして、畳込演算部１０６は、移動した位置で要素データ同士の乗算を行い、乗算結果を加算して合計を算出する。このように、畳込演算部１０６は、計算完了後にストライド数ずつ行方向にトップ差分データ２０３を移動させ、乗算及び加算を繰返す。そして、トップ差分データ２０３が行方向の最後尾まで移動すると、次の計算では、畳込演算部１０６は、トップ差分データ２０３を列方向にストライド数である１つの要素データ分だけ移動させ、さらに、行方向の先頭の位置にトップ差分データ２０３を戻す。そして、畳込演算部１０６は、行方向にトップ差分データ２０３を移動させつつ乗算及び加算を繰返す。畳込演算部１０６は、トップ差分データ２０３の最下行がボトムデータ２１０の最下行に一致し、且つ、トップ差分データ２０３がボトムデータ２１０の最後尾に移動するまで、乗算及び加算を繰返す。そして、畳込演算部１０６は、乗算及び加算結果を重み差分データ２０４の要素データｗ０１〜ｗ０７の番号順に割り当てていく。以下では、ボトムデータ２１０上の所定の位置にトップ差分データ２０３を配置した状態での、乗算及び加算、並びに、重み差分データ２０４の要素データｗ００〜ｗ０７に割り当てる演算を、まとめて「重み差分データ２０４の１つの要素データに対する和積演算」という。 Next, the convolution calculation unit 106 moves the top difference data 203 in the row direction on the bottom data 210 by one element data which is the number of strides. Then, the convolution calculation unit 106 multiplies the element data at the moved position, adds the multiplication results, and calculates the total. In this way, the convolution calculation unit 106 moves the top difference data 203 in the row direction by the number of strides after the calculation is completed, and repeats multiplication and addition. Then, when the top difference data 203 moves to the end in the row direction, in the next calculation, the convolution calculation unit 106 moves the top difference data 203 in the column direction by one element data which is the number of strides, and further. , The top difference data 203 is returned to the first position in the row direction. Then, the convolution calculation unit 106 repeats multiplication and addition while moving the top difference data 203 in the row direction. The convolution calculation unit 106 repeats multiplication and addition until the bottom line of the top difference data 203 matches the bottom line of the bottom data 210 and the top difference data 203 moves to the end of the bottom data 210. Then, the convolution calculation unit 106 allocates the multiplication and addition results in the numerical order of the element data w01 to w07 of the weight difference data 204. In the following, multiplication and addition, and operations assigned to the element data w00 to w07 of the weight difference data 204 in a state where the top difference data 203 is arranged at a predetermined position on the bottom data 210 are collectively referred to as "weight difference data". It is called "sum product operation for one element data of 204".

例えば、図１２におけるボトムデータ２１０の太線枠で囲まれた位置にトップ差分データ２０３を配置して計算を行う場合を説明する。ここでは、各要素データの乗算を符号のみで表す。畳込演算部１０６は、重み差分データ２０４の１つの要素データに対する和積演算として、以下の計算を行う。畳込演算部１０６は、ｔｄ００×０＋・・・＋ｔｄ０７×０＋ｔｄ０８×ｂ００＋・・・＋ｔｄ１５×ｂ０７＋ｔｄ１６×０＋ｔｄ１７×ｎｂ０８＋・・・＋ｔｄ２３×ｎｂ１４＋・・・＋ｔｄ５６×ｂ４８＋・・・ｔｄ６３×ｂ５５を算出する。そして、畳込演算部１０６は、演算結果を要素データｗｄ００とする。 For example, a case where the top difference data 203 is arranged at the position surrounded by the thick line frame of the bottom data 210 in FIG. 12 and the calculation is performed will be described. Here, the multiplication of each element data is represented only by a sign. The convolution calculation unit 106 performs the following calculation as a sum product calculation for one element data of the weight difference data 204. The convolution calculation unit 106 calculates td00 × 0 + ... + td07 × 0 + td08 × b00 + ... + td15 × b07 + td16 × 0 + td17 × nb08 + ・・・ + td23 × nb14 + ・・・ + td56 × b48 + ... Then, the convolution calculation unit 106 sets the calculation result as the element data wd00.

ここで、従来のフィルタ定義３０２を用いた場合、畳込演算部１０１は、重み差分データ２０４の１つの要素データに対する和積演算を９回行う。これに対して、新フィルタ定義３０１を用いた場合、畳込演算部１０１は、重み差分データ２０４の１つの要素データに対する和積演算は７回で済む。したがって、新フィルタ定義３０１を用いた場合の方が、従来のフィルタ定義３０２を用いた場合に比べて計算回数を減らすことができる。すなわち、バックワード畳込重み差分演算の場合も、新フィルタ定義３０１を用いた場合の方が、従来のフィルタ定義３０２を用いた場合に比べて、使用する記憶領域を小さくすることができ、計算効率も向上させることができる。 Here, when the conventional filter definition 302 is used, the convolution calculation unit 101 performs the sum product calculation on one element data of the weight difference data 204 nine times. On the other hand, when the new filter definition 301 is used, the convolution calculation unit 101 only needs to perform the sum product operation for one element data of the weight difference data 204 seven times. Therefore, the number of calculations can be reduced when the new filter definition 301 is used as compared with the case where the conventional filter definition 302 is used. That is, even in the case of the backward convolution weight difference calculation, the storage area used can be made smaller when the new filter definition 301 is used than when the conventional filter definition 302 is used, and the calculation can be performed. Efficiency can also be improved.

図３に戻って説明を続ける。活性化処理部１０２は、畳込演算部１０１から出力されたトップデータを正規化する。プーリング処理部１０３は、活性化処理部１０２により正規化されたトップデータに対して要素データの間引や統合を行うことで、微小な位置変化に対して応答を不変化する。このプーリング処理部１０３が行う処理をプーリング処理という。そして、プーリング処理部１０３は、プーリング処理を施したトップデータを次の段の演算処理層１０へ出力する。プーリング処理部１０３は、データに加えた処理を表すタグ１５１をプーリング処理部１０４へ出力する。 The explanation will be continued by returning to FIG. The activation processing unit 102 normalizes the top data output from the convolution calculation unit 101. The pooling processing unit 103 does not change the response to a minute position change by thinning out or integrating the element data with respect to the top data normalized by the activation processing unit 102. The process performed by the pooling process unit 103 is called a pooling process. Then, the pooling processing unit 103 outputs the top data subjected to the pooling processing to the arithmetic processing layer 10 of the next stage. The pooling processing unit 103 outputs a tag 151 representing the processing added to the data to the pooling processing unit 104.

プーリング処理部１０４は、実施した応答のプーリング処理を表すタグ１５１の入力をプーリング処理部１０３から受ける。また、プーリング処理部１０４は、後段の演算処理層１０からボトム差分データの入力を受ける。そして、プーリング処理部１０４は、取得したボトム差分データにタグ１５１により特定されるプーリング処理の逆処理を施す。このプーリング処理部１０４が行う処理を逆プーリング処理という。活性化処理部１０５は、プーリング処理部１０４により逆プーリング処理が施されたデータに対して活性化処理を施す。 The pooling processing unit 104 receives an input of a tag 151 representing the pooling processing of the executed response from the pooling processing unit 103. Further, the pooling processing unit 104 receives input of bottom difference data from the arithmetic processing layer 10 in the subsequent stage. Then, the pooling processing unit 104 performs the reverse processing of the pooling processing specified by the tag 151 on the acquired bottom difference data. The process performed by the pooling process unit 104 is called a reverse pooling process. The activation processing unit 105 performs activation processing on the data that has been reverse pooled by the pooling processing unit 104.

さらに、以上では、演算処理装置１の学習時の動作について説明したが、演算処理装置１は、学習により取得した重みデータ２０２を用いて入力データ２の認識を行う。そこで、各演算処理層１０における認識の処理について説明する。 Further, although the operation of the arithmetic processing unit 1 at the time of learning has been described above, the arithmetic processing unit 1 recognizes the input data 2 by using the weight data 202 acquired by the learning. Therefore, the recognition process in each arithmetic processing layer 10 will be described.

畳込演算部１０１は、ボトムデータの入力を受ける。そして、学習で取得した重みデータを使用してフォワード畳込演算を行う。そして、活性化処理部１０２及びプーリング処理部１０３は、トップデータに対して正規化などのプーリング処理を行う。その後、プーリング処理部１０３は、処理を施したトップデータを次の演算処理層１０へ出力する。このようなフォワード畳込演算を各演算処理層１０で繰返して、演算処理装置１は、最終的に認識用の出力データ３を取得する。 The convolution calculation unit 101 receives the input of bottom data. Then, the forward convolution operation is performed using the weight data acquired in the learning. Then, the activation processing unit 102 and the pooling processing unit 103 perform pooling processing such as normalization on the top data. After that, the pooling processing unit 103 outputs the processed top data to the next arithmetic processing layer 10. By repeating such a forward convolution operation in each arithmetic processing layer 10, the arithmetic processing unit 1 finally acquires the output data 3 for recognition.

次に、図１３を参照して、新フィルタ定義３０１を使用する場合の演算処理層における処理の流れについて説明する。図１３は、新フィルタ定義を使用する場合の演算処理層における処理のフローチャートである。 Next, with reference to FIG. 13, the processing flow in the arithmetic processing layer when the new filter definition 301 is used will be described. FIG. 13 is a flowchart of processing in the arithmetic processing layer when the new filter definition is used.

第１層の演算処理層１１における入力データ処理部１１１は、入力データ２に対して隔行の隣り合う要素データの平均を算出して一方の要素データとして、新フィルタ定義３０１に合うボトムデータ２１０を生成する（ステップＳ１）。そして、入力データ処理部１１１は、ボトムデータ２１０を乗算部１１２へ出力する。 The input data processing unit 111 in the arithmetic processing layer 11 of the first layer calculates the average of adjacent element data in intervals with respect to the input data 2, and uses the bottom data 210 that matches the new filter definition 301 as one element data. Generate (step S1). Then, the input data processing unit 111 outputs the bottom data 210 to the multiplication unit 112.

乗算部１１２は、ボトムデータ２１０の入力を入力データ処理部１１１から受ける。そして、乗算部１１２、加算部１１３及び出力データ作成部１１４は、トップデータ２０９の１つの要素データに対する和積演算を繰返してフォワード畳込演算を行う（ステップＳ２）。そして、出力データ作成部１１４は、演算結果であるトップデータ２０９を出力する。 The multiplication unit 112 receives the input of the bottom data 210 from the input data processing unit 111. Then, the multiplication unit 112, the addition unit 113, and the output data creation unit 114 repeat the sum product operation on one element data of the top data 209 to perform the forward convolution operation (step S2). Then, the output data creation unit 114 outputs the top data 209 which is the calculation result.

活性化処理部１０２及びプーリング処理部１０３は、出力データ作成部１１４から出力されたトップデータ２０９に対して正規化を施すプーリング処理といったフォワード他処理演算を行う（ステップＳ３）。そして、プーリング処理部１０３は、処理を施したデータを第２層の演算処理層１２へ出力する。 The activation processing unit 102 and the pooling processing unit 103 perform forward and other processing operations such as a pooling process for normalizing the top data 209 output from the output data creation unit 114 (step S3). Then, the pooling processing unit 103 outputs the processed data to the arithmetic processing layer 12 of the second layer.

第２〜第ｎ−１層の演算処理層１０及び第ｎ層の演算処理層１３は、フォワード畳込演算及びフォワード他処理演算を含む同様の処理を実行する（ステップＳ４）。 The arithmetic processing layer 10 of the second to second n-1 layers and the arithmetic processing layer 13 of the nth layer execute the same processing including the forward convolution operation and the forward other processing operation (step S4).

次に、第ｎ層の演算処理層１３は、出力データ３と期待値２０７とを比較する（ステップＳ５）。 Next, the arithmetic processing layer 13 of the nth layer compares the output data 3 with the expected value 207 (step S5).

次に、第ｎ層の演算処理層１３のプーリング処理部１０４及び活性化処理部１０５は、比較結果に対して、逆プーリング処理を含むバックワード他処理演算を行う（ステップＳ６）。そして、活性化処理部１０５は、処理を施したデータをトップ差分データ２０３として畳込演算部１０６へ出力する。 Next, the pooling processing unit 104 and the activation processing unit 105 of the arithmetic processing layer 13 of the nth layer perform backward other processing operations including the reverse pooling processing on the comparison result (step S6). Then, the activation processing unit 105 outputs the processed data as the top difference data 203 to the convolution calculation unit 106.

次に、第ｎ層の演算処理層１３の畳込演算部１０６は、トップ差分データ２０３の入力を活性化処理部１０５から受ける。そして、畳込演算部１０６は、トップ差分データ２０３、重みデータ２０２及びボトムデータ２１０を用いてバックワード畳込演算を行う（ステップＳ７）。畳込演算部１０６は、重みデータ２０２を更新する。さらに、畳込演算部１０６は、求めたボトム差分データ２０５を第ｎ−１層の演算処理層１０へ出力する。 Next, the convolution calculation unit 106 of the calculation processing layer 13 of the nth layer receives the input of the top difference data 203 from the activation processing unit 105. Then, the convolution calculation unit 106 performs a backward convolution calculation using the top difference data 203, the weight data 202, and the bottom data 210 (step S7). The convolution calculation unit 106 updates the weight data 202. Further, the convolution calculation unit 106 outputs the obtained bottom difference data 205 to the calculation processing layer 10 of the n-1th layer.

第ｎ−１〜３層の演算処理層１０、第２層の演算処理層１２及び第１層の演算処理層１１は、バックワード他処理演算及びバックワード畳込演算を含む同様の処理を実行する（ステップＳ８）。これにより、第ｎ−１〜３層の演算処理層１０、第２層の演算処理層１２及び第１層の演算処理層１１の重みデータ２０２が更新される。 The arithmetic processing layer 10 of the first to third layers, the arithmetic processing layer 12 of the second layer, and the arithmetic processing layer 11 of the first layer execute the same processing including the backward other processing arithmetic and the backward convolution operation. (Step S8). As a result, the weight data 202 of the arithmetic processing layer 10 of the first to third layers, the arithmetic processing layer 12 of the second layer, and the arithmetic processing layer 11 of the first layer is updated.

次に、図１４を参照して、畳込演算部１０１によるフォワード畳込演算の流れについて説明する。図１４は、実施例１に係る畳込演算部によるフォワード畳込演算のフローチャートである。 Next, with reference to FIG. 14, the flow of the forward convolution calculation by the convolution calculation unit 101 will be described. FIG. 14 is a flowchart of the forward convolution calculation by the convolution calculation unit according to the first embodiment.

入力データ処理部１１１は、新フィルタ定義３０１を使用するか否かを判定する（ステップＳ１０１）。新フィルタ定義３０１を使用しない場合（ステップＳ１０１：否定）、入力データ処理部１１１は、入力データをそのままボトムデータ２０１として乗算部１１２へ出力する。乗算部１１２、加算部１１３及び出力データ作成部１１４は、通常のフォワード畳込演算を実行し（ステップＳ１０２）、フォワード畳込演算を終了する。 The input data processing unit 111 determines whether or not to use the new filter definition 301 (step S101). When the new filter definition 301 is not used (step S101: negation), the input data processing unit 111 outputs the input data as it is to the multiplication unit 112 as bottom data 201. The multiplication unit 112, the addition unit 113, and the output data creation unit 114 execute a normal forward convolution operation (step S102), and end the forward convolution operation.

これに対して、新フィルタ定義３０１を使用する場合（ステップＳ１０１：肯定）、入力データ処理部１１１は、新フィルタ定義３０１に対応する重みデータ２２１を重みデータ記憶部１１５から取得する（ステップＳ１０３）。 On the other hand, when the new filter definition 301 is used (step S101: affirmative), the input data processing unit 111 acquires the weight data 221 corresponding to the new filter definition 301 from the weight data storage unit 115 (step S103). ..

次に、入力データ処理部１１１は、入力データが新フィルタ定義３０１に対応するか否かを判定する（ステップＳ１０４）。入力データが新フィルタ定義３０１に対応しない場合（ステップＳ１０４：否定）、入力データ処理部１１１は、前層における処理結果の入力データを隔行で平均化したデータを演算に使用するボトムデータ２０１とする（ステップＳ１０５）。 Next, the input data processing unit 111 determines whether or not the input data corresponds to the new filter definition 301 (step S104). When the input data does not correspond to the new filter definition 301 (step S104: negation), the input data processing unit 111 uses the data obtained by averaging the input data of the processing results in the previous layer at intervals as the bottom data 201 used for the calculation. (Step S105).

入力データが新フィルタ定義３０１に対応する場合（ステップＳ１０４：肯定）、入力データ処理部１１１は、前層における処理結果の入力データをそのまま演算に使用するボトムデータ２０１とする（ステップＳ１０６）。 When the input data corresponds to the new filter definition 301 (step S104: affirmative), the input data processing unit 111 uses the input data of the processing result in the previous layer as the bottom data 201 (step S106).

入力データ処理部１１１は、新フィルタ定義３０１に対応するボトムデータ２０１を乗算部１１２へ出力する。乗算部１１２、加算部１１３及び出力データ作成部１１４は、入力されたボトムデータ２０１と新フィルタ定義３０１に対応した重みデータ２２１とを用いてフォワード畳込演算を実行する（ステップＳ１０７）。 The input data processing unit 111 outputs the bottom data 201 corresponding to the new filter definition 301 to the multiplication unit 112. The multiplication unit 112, the addition unit 113, and the output data creation unit 114 execute a forward convolution operation using the input bottom data 201 and the weight data 221 corresponding to the new filter definition 301 (step S107).

次に、図１５を参照して、畳込演算部１０６によるバックワード畳込演算の流れについて説明する。図１５は、実施例１に係る畳込演算部によるバックワード畳込演算のフローチャートである。 Next, with reference to FIG. 15, the flow of the backward convolution operation by the convolution operation unit 106 will be described. FIG. 15 is a flowchart of a backward convolution calculation by the convolution calculation unit according to the first embodiment.

畳込演算部１０６は、新フィルタ定義３０１を使用するか否かを判定する（ステップＳ２０１）。新フィルタ定義３０１を使用しない場合（ステップＳ２０１：否定）、畳込演算部１０６は、入力データをそのままボトムデータ２０１として、通常のバックワード畳込演算を実行し（ステップＳ２０２）、フォワード畳込演算を終了する。 The convolution calculation unit 106 determines whether or not to use the new filter definition 301 (step S201). When the new filter definition 301 is not used (step S201: negation), the convolution calculation unit 106 executes a normal backward convolution operation (step S202) using the input data as the bottom data 201 as it is, and performs a forward convolution operation. To finish.

これに対して、新フィルタ定義３０１を使用する場合（ステップＳ２０１：肯定）、畳込演算部１０６は、逆伝播方向の最初の層か否かを判定する（ステップＳ２０３）。逆伝播方向の最初の層の場合（ステップＳ２０３：肯定）、畳込演算部１０６は、フォワード演算による出力データ３と期待値２０７との差分に対してバックワード他処理が施されたデータをトップ差分データ２０３として取得する（ステップＳ２０４）。 On the other hand, when the new filter definition 301 is used (step S201: affirmative), the convolution calculation unit 106 determines whether or not it is the first layer in the back propagation direction (step S203). In the case of the first layer in the back propagation direction (step S203: affirmative), the convolution calculation unit 106 tops the data to which backward processing or the like is applied to the difference between the output data 3 by the forward calculation and the expected value 207. Acquired as difference data 203 (step S204).

これに対して、逆伝播方向の最初の層以外の層の場合（ステップＳ２０３：否定）、畳込演算部１０６は、前層から出力されたボトム差分データ２０５に対してバックワード他処理が施されたデータをトップ差分データ２０３として取得する（ステップＳ２０５）。 On the other hand, in the case of a layer other than the first layer in the back propagation direction (step S203: negation), the convolution calculation unit 106 performs backward and other processing on the bottom difference data 205 output from the previous layer. The obtained data is acquired as the top difference data 203 (step S205).

そして、畳込演算部１０６は、ボトムデータ２０１、新フィルタ定義３０１を使用した重みデータ２２１及びトップ差分データ２０３を用いてバックワード重み差分演算及びバックワードボトム差分演算を実行する（ステップＳ２０６）。 Then, the convolution calculation unit 106 executes the backward weight difference calculation and the backward bottom difference calculation using the bottom data 201, the weight data 221 using the new filter definition 301, and the top difference data 203 (step S206).

以上に説明したように、本実施例に係る演算処理装置は、従来の正方形の行列のフィルタ定義よりも要素データの数が少ない新フィルタ定義を用いてフォワード畳込演算及びバックワード畳込演算を行う。次の表はフィルタサイズに応じた従来のフィルタ定義と新フィルタ定義との演算量の比を表す表である。ここで、新フィルタ定義は、中央の行から端の行に向かって１つずつ要素データを減らし、且つ、各行の半分の位置が要素データを減らす前の半分の位置に一致するようにずらすことで生成される定義である。 As described above, the arithmetic processing apparatus according to the present embodiment performs forward convolution operations and backward convolution operations using a new filter definition in which the number of element data is smaller than that of the conventional filter definition of a square matrix. Do. The following table shows the ratio of the amount of calculation between the conventional filter definition and the new filter definition according to the filter size. Here, the new filter definition is to reduce the element data one by one from the center row to the edge row, and shift the half position of each row so that it matches the half position before reducing the element data. Is the definition generated by.

このように、本実施例に係る演算処理装置は、フォワード畳込演算及びバックワード畳込演算における演算量を削減することができる。ここで、本実施例に係る演算処理装置は、入力データを変換する演算を行うが、入力データを変換する演算数は、畳込演算において削減される演算数より少ないため、演算量を低減することができる。また、本実施例に係る演算処理装置は、データ量削減によってメモリスループットの削減にも寄与することができる。高速フーリエ変換による高速化手法を用いるための条件を満たさないフィルタを用いる場合でも、本実施例に係る演算処理装置は、フォワード畳込演算及びバックワード畳込演算における演算量を削減することができる。したがって、本実施例に係る演算処理装置は、深層学習の演算において、使用する記憶装置の容量を抑えつつ演算効率を向上させることができる。 As described above, the arithmetic processing unit according to the present embodiment can reduce the amount of arithmetic operations in the forward convolution operation and the backward convolution operation. Here, the arithmetic processing apparatus according to the present embodiment performs an operation for converting the input data, but the number of operations for converting the input data is smaller than the number of operations reduced in the convolution operation, so that the amount of calculation is reduced. be able to. In addition, the arithmetic processing unit according to this embodiment can also contribute to the reduction of memory throughput by reducing the amount of data. Even when a filter that does not satisfy the conditions for using the high-speed method by the fast Fourier transform is used, the arithmetic processing unit according to the present embodiment can reduce the amount of arithmetic operations in the forward convolution operation and the backward convolution operation. .. Therefore, the arithmetic processing unit according to the present embodiment can improve the arithmetic efficiency while suppressing the capacity of the storage device used in the arithmetic of deep learning.

特に、３×３のサイズのフィルタは、深層学習では多用されるフィルタであり、その３×３のサイズのフィルタにおいても実施例で説明したように演算数が削減される。 In particular, a filter having a size of 3 × 3 is a filter often used in deep learning, and a filter having a size of 3 × 3 also reduces the number of operations as described in the embodiment.

また、本実施例に係る演算処理装置では、重みデータを小さくすることができ、フォワード畳込演算及びバックワード畳込演算におけるデータ量を少なく抑えることができる。 Further, in the arithmetic processing unit according to the present embodiment, the weight data can be reduced, and the amount of data in the forward convolution operation and the backward convolution operation can be suppressed to be small.

次に、実施例２について説明する。本実施例に係る演算処理装置は、新フィルタ定義に合わせてボトムデータを変換した場合に、そのボトムデータを用いて算出されたデータをそのまま使用してプーリング処理を行う。本実例に係る演算処理装置も、図１及び２で表される。以下では、実施例１と同様の各部の機能については説明を省略する。 Next, Example 2 will be described. When the bottom data is converted according to the new filter definition, the arithmetic processing unit according to the present embodiment performs the pooling process by using the data calculated using the bottom data as it is. The arithmetic processing unit according to this example is also shown in FIGS. 1 and 2. In the following, description of the functions of the same parts as in the first embodiment will be omitted.

図１６は、実施例２に係るプーリング処理部によるストライド数が２の場合のプーリング処理を説明するための図である。 FIG. 16 is a diagram for explaining a pooling process when the number of strides by the pooling process unit according to the second embodiment is 2.

プーリング処理部１０３は、畳込演算部１０１が出力したトップデータ２０９に対して活性化処理部１０２により正規化されたデータの入力を受ける。ここでは、８×８のボトムデータ２０１及び新フィルタ定義３０１を用いてフォワード畳込演算が行われた場合で説明する。すなわち、プーリング処理部１０３は、図１６に示すデータ４０１の入力を受ける。ここでは、データ４０１は、要素データｉ００〜ｉ６３を有する。要素データｉ００〜ｉ６３は、それぞれトップデータ２０９の要素データｔ００〜ｔ６３に対応する。 The pooling processing unit 103 receives the input of the data normalized by the activation processing unit 102 to the top data 209 output by the convolution calculation unit 101. Here, the case where the forward convolution operation is performed using the 8 × 8 bottom data 201 and the new filter definition 301 will be described. That is, the pooling processing unit 103 receives the input of the data 401 shown in FIG. Here, the data 401 has element data i00 to i63. The element data i00 to i63 correspond to the element data t00 to t63 of the top data 209, respectively.

プーリング処理部１０３は、図１６のデータ４０１上に示した太線枠４１１をプーリングサイズとして記憶する。そして、プーリング処理部１０３は、最初に、太線枠４１１の上の行がデータ４０１の１行目の最も若番の要素データに一致するように配置する。そして、太線枠４１１に含まれる要素データｉ００，ｉ０１，ｉ０８及びｉ０９を取得し、取得した要素データの平均や最大値の選択などのプーリング処理を行い値を取得する。そして、プーリング処理部１０３は、取得した値を出力するデータ４０２の要素データｐ００とする。 The pooling processing unit 103 stores the thick line frame 411 shown on the data 401 in FIG. 16 as the pooling size. Then, the pooling processing unit 103 is first arranged so that the line above the thick line frame 411 matches the youngest element data in the first line of the data 401. Then, the element data i00, i01, i08 and i09 included in the thick line frame 411 are acquired, and the pooling process such as selection of the average or maximum value of the acquired element data is performed to acquire the value. Then, the pooling processing unit 103 sets the element data p00 of the data 402 that outputs the acquired value.

次に、プーリング処理部１０３は、太線枠４１１を要素データ２つ分だけ行方向に進めながら、プーリング処理を行い値を取得していく。そして、太線枠４１１がデータ４１０の行の最後尾に達すると、プーリング処理部１０３は、要素データ２つ分だけ列方向に太線枠４１１を移動し、且つ、行の先頭に太線枠４１１を戻す。その後、プーリング処理部１０３は、太線枠４１１を要素データ２つ分だけ行方向に進めながら、同様のプーリング処理を繰返して値を取得する処理を、太線枠４１１の下の行がデータ４０１の一番下の行の最後尾に達するまで繰返す。そして、プーリング処理部１０３は、取得した値をそれぞれ出力するデータ４０２の要素データｐ０１〜ｐ１５としていく。 Next, the pooling processing unit 103 performs pooling processing and acquires a value while advancing the thick line frame 411 in the row direction by two element data. Then, when the thick line frame 411 reaches the end of the row of the data 410, the pooling processing unit 103 moves the thick line frame 411 in the column direction by two element data and returns the thick line frame 411 to the beginning of the row. .. After that, the pooling processing unit 103 repeats the same pooling process while advancing the thick line frame 411 in the row direction by two element data, and the line below the thick line frame 411 is one of the data 401. Repeat until the end of the bottom line is reached. Then, the pooling processing unit 103 sets the acquired values as the element data p01 to p15 of the data 402 for outputting each.

例えば、太線枠４１１が図１６で示すデータ４０１上の位置に配置された場合、プーリング処理部１０３は、要素データｉ１８，ｉ１９，ｉ２６及びｉ２７を取得する。そして、プーリング処理部１０３は、要素データｉ１８，ｉ１９，ｉ２６及びｉ２７を用いてプーリング処理を行い値を取得する。その後、プーリング処理部１０３は、取得した値をデータ４０２の要素データｐ０５とする。 For example, when the thick line frame 411 is arranged at the position on the data 401 shown in FIG. 16, the pooling processing unit 103 acquires the element data i18, i19, i26 and i27. Then, the pooling processing unit 103 performs pooling processing using the element data i18, i19, i26 and i27, and acquires a value. After that, the pooling processing unit 103 sets the acquired value as the element data p05 of the data 402.

プーリング処理部１０３は、要素データｐ００〜ｐ１５を取得してデータ４０２を完成させる。その後、プーリング処理部１０３は、データ４０２を次の演算処理層１０へ出力する。 The pooling processing unit 103 acquires the element data p00 to p15 and completes the data 402. After that, the pooling processing unit 103 outputs the data 402 to the next arithmetic processing layer 10.

図１７は、実施例２に係るプーリング処理部によるストライド数が１の場合のプーリング処理を説明するための図である。ストライド数が１の場合、プーリングの対象が１行ずつ下がるため、データ４０１のような配置形状の場合、２つの異なるプーリングサイズを用いる。 FIG. 17 is a diagram for explaining a pooling process when the number of strides by the pooling process unit according to the second embodiment is 1. When the number of strides is 1, the target of pooling is lowered line by line. Therefore, in the case of an arrangement shape such as data 401, two different pooling sizes are used.

プーリング処理部１０３は、図１７のデータ４０１上に示した太線枠４１２及び４１３をプーリングサイズとして記憶する。そして、プーリング処理部１０３は、データ４０２の奇数行の要素データを算出する場合、太線枠４１３のプーリングサイズを用いる。また、データ４０２の奇数行の要素データを算出する場合、太線枠４１２のプーリングサイズを用いる。 The pooling processing unit 103 stores the thick line frames 412 and 413 shown on the data 401 in FIG. 17 as the pooling size. Then, the pooling processing unit 103 uses the pooling size of the thick line frame 413 when calculating the element data of the odd-numbered rows of the data 402. Further, when calculating the element data of the odd-numbered rows of the data 402, the pooling size of the thick line frame 412 is used.

具体定には、プーリング処理部１０３は、最初に、太線枠４１３の上の行がデータ４０１の１行目の最も若番の要素データに一致するように配置する。そして、太線枠４１３に含まれる要素データｉ００，ｉ０１，ｉ０８及びｉ０９を取得し、取得した要素データの平均や最大値の選択などによるプーリング処理を行い値を取得する。そして、プーリング処理部１０３は、取得した値を出力するデータ４０２の要素データｐ００とする。その後、プーリング処理部１０３は、太線枠４１３がデータ４０１の行の最後尾に達するまで、太線枠４１３を要素データ分ずつ行方向に進めながら、プーリング処理による値の取得を行う。そして、プーリング処理部１０３は、取得した値をそれぞれ出力するデータ４０２の要素データｐ０１〜ｐ０６とする。 Specifically, the pooling processing unit 103 is first arranged so that the line above the thick line frame 413 matches the youngest element data in the first line of the data 401. Then, the element data i00, i01, i08 and i09 included in the thick line frame 413 are acquired, and the pooling process is performed by selecting the average or maximum value of the acquired element data to acquire the value. Then, the pooling processing unit 103 sets the element data p00 of the data 402 that outputs the acquired value. After that, the pooling processing unit 103 acquires the value by the pooling process while advancing the thick line frame 413 in the row direction by the element data until the thick line frame 413 reaches the end of the line of the data 401. Then, the pooling processing unit 103 sets the element data p01 to p06 of the data 402 to output the acquired values, respectively.

次に、プーリング処理部１０３は、太線枠４１２の上の行がデータ４０１の２行目の最も若番の要素データに一致するように配置する。そして、太線枠４１１に含まれる要素データｉ０８，ｉ０９，ｉ１６及びｉ１７を取得し、取得した要素データの平均や最大値の選択などによるプーリング処理を行い値を取得する。そして、プーリング処理部１０３は、取得した値を出力するデータ４０２の要素データｐ０７とする。その後、プーリング処理部１０３は、太線枠４１２がデータ４０１の行の最後尾に達するまで、太線枠４１２を要素データ分ずつ行方向に進めながら、プーリング処理による値の取得を行う。そして、プーリング処理部１０３は、取得した値をそれぞれ出力するデータ４０２の要素データｐ０８〜ｐ１３とする。 Next, the pooling processing unit 103 is arranged so that the line above the thick line frame 412 matches the element data of the youngest number in the second line of the data 401. Then, the element data i08, i09, i16 and i17 included in the thick line frame 411 are acquired, and the pooling process is performed by selecting the average or maximum value of the acquired element data to acquire the value. Then, the pooling processing unit 103 sets the element data p07 of the data 402 that outputs the acquired value. After that, the pooling processing unit 103 acquires the value by the pooling process while advancing the thick line frame 412 in the row direction by the element data until the thick line frame 412 reaches the end of the line of the data 401. Then, the pooling processing unit 103 sets the element data p08 to p13 of the data 402 to output the acquired values, respectively.

プーリング処理部１０３は、１行ずつ対象とする行を下げつつ、プーリングサイズを交互に用いてプーリング処理による値の取得を繰返す。そして、プーリング処理部１０３は、取得した値をそれぞれ出力するデータ４０２の要素データｐ１４〜ｐ４８としていく。 The pooling processing unit 103 repeats the acquisition of the value by the pooling processing by alternately using the pooling size while lowering the target line one by one. Then, the pooling processing unit 103 sets the acquired values as element data p14 to p48 of the data 402 for outputting each of them.

例えば、太線枠４１２が図１７で示すデータ４０１上の位置に配置された場合、プーリング処理部１０３は、要素データｉ０９，ｉ１０，ｉ１７及びｉ１８を取得する。そして、プーリング処理部１０３は、要素データｉ０９，ｉ１０，ｉ１７及びｉ１８を用いてプーリング処理を行い値を取得する。その後、プーリング処理部１０３は、取得した値をデータ４０２の要素データｐ０８とする。 For example, when the thick line frame 412 is arranged at the position on the data 401 shown in FIG. 17, the pooling processing unit 103 acquires the element data i09, i10, i17 and i18. Then, the pooling processing unit 103 performs pooling processing using the element data i09, i10, i17 and i18, and acquires a value. After that, the pooling processing unit 103 sets the acquired value as the element data p08 of the data 402.

また、太線枠４１３が図１７で示すデータ４０１上の位置に配置された場合、プーリング処理部１０３は、要素データｉ３２，ｉ３３，ｉ４０及びｉ４１を取得する。そして、プーリング処理部１０３は、要素データｉ３２，ｉ３３，ｉ４０及びｉ４１を用いてプーリング処理を行い値を取得する。その後、プーリング処理部１０３は、取得した値をデータ４０２の要素データｐ２８とする。 Further, when the thick line frame 413 is arranged at the position on the data 401 shown in FIG. 17, the pooling processing unit 103 acquires the element data i32, i33, i40 and i41. Then, the pooling processing unit 103 performs pooling processing using the element data i32, i33, i40 and i41, and acquires a value. After that, the pooling processing unit 103 sets the acquired value as the element data p28 of the data 402.

プーリング処理部１０３は、要素データｐ００〜ｐ４８を取得してデータ４０２を完成させる。その後、プーリング処理部１０３は、データ４０２を次の演算処理層１０へ出力する。 The pooling processing unit 103 acquires the element data p00 to p48 and completes the data 402. After that, the pooling processing unit 103 outputs the data 402 to the next arithmetic processing layer 10.

以上に説明したように、本実施例に係る演算処理装置は、フォワード畳込演算の演算結果であるトップデータをそのまま用いてプーリング処理を行うことができる。したがって、新フィルタ定義に合わせてボトムデータを変換してフォワード畳込演算を行った場合でも、処理を増やさずにプーリング処理を行うことができ、ネットワーク全体として演算処理の効率を向上させることができる。 As described above, the arithmetic processing unit according to the present embodiment can perform the pooling processing by using the top data which is the arithmetic result of the forward convolution operation as it is. Therefore, even when the bottom data is converted according to the new filter definition and the forward convolution operation is performed, the pooling process can be performed without increasing the process, and the efficiency of the operation process can be improved for the entire network. ..

次に、実施例３について説明する。本実施例に係る演算処理装置は、新フィルタ定義に合わせてボトムデータを変換した場合に、入力されるデータと出力するデータとを同じ大きさにするパディングを行う。本実例に係る演算処理装置も、図１〜４で表される。以下では、実施例１と同様の各部の機能については説明を省略する。 Next, Example 3 will be described. The arithmetic processing unit according to this embodiment performs padding to make the input data and the output data the same size when the bottom data is converted according to the new filter definition. The arithmetic processing unit according to this example is also represented by FIGS. 1 to 4. In the following, description of the functions of the same parts as in the first embodiment will be omitted.

図１８は、実施例３に係る畳込演算部によるフォワード畳込演算を説明するための図である。ここでは、８×８のボトムデータ２０１及び新フィルタ定義３０１を使用した重みデータ２２１を用いてフォワード畳込演算を行う場合で説明する。 FIG. 18 is a diagram for explaining a forward convolution operation by the convolution operation unit according to the third embodiment. Here, a case where the forward convolution operation is performed using the 8 × 8 bottom data 201 and the weight data 221 using the new filter definition 301 will be described.

入力データ処理部１１１は、ボトムデータ２０１の入力を受ける。そして、入力データ処理部１１１は、ボトムデータ２０１を新フィルタ定義２２１に合わせて変換しボトムデータ２１０とする。 The input data processing unit 111 receives the input of the bottom data 201. Then, the input data processing unit 111 converts the bottom data 201 according to the new filter definition 221 to obtain the bottom data 210.

そして、入力データ処理部１１１は、ボトムデータ２１０の周りに図１８に示すように値が０である要素データ２１３を付加し、ボトムデータ２１４を生成する。このボトムデータ２１０の周りに値が０である要素データ２１３を付加する処理が０パディングである。これにより、入力データ処理部１１１は、トップ差分データ２０３のサイズをボトムデータ２１０のサイズと一致させる。そして、入力データ処理部１１１は、ボトムデータ２１４を乗算部１１２へ出力する。 Then, the input data processing unit 111 adds element data 213 having a value of 0 around the bottom data 210 as shown in FIG. 18 to generate the bottom data 214. The process of adding the element data 213 having a value of 0 around the bottom data 210 is 0 padding. As a result, the input data processing unit 111 matches the size of the top difference data 203 with the size of the bottom data 210. Then, the input data processing unit 111 outputs the bottom data 214 to the multiplication unit 112.

乗算部１１２は、ボトムデータ２１４の入力を入力データ処理部１１１から受ける。そして、乗算部１１２は、ボトムデータ２１４に対して、重みデータ２２１を用いてフォワード畳込演算を実行する。これにより、乗算部１１２は、ボトムデータ２１０の要素データｂ００〜ｎｂ６３と同数のトップデータ２０９の要素データｔ００〜ｔ６３を算出する。 The multiplication unit 112 receives the input of the bottom data 214 from the input data processing unit 111. Then, the multiplication unit 112 executes a forward convolution operation on the bottom data 214 using the weight data 221. As a result, the multiplication unit 112 calculates the same number of element data t00 to t63 of the top data 209 as the element data b00 to nb63 of the bottom data 210.

ここで、８×８の行列のボトムデータ２０１の場合、０パディングを行うには３６個の要素データ２１３を用いる。これに対して、ボトムデータ２１０の場合、０パディングを行うには３４個の要素データ２１３を用いる。すなわち、ボトムデータ２１０を用いた方が、変換前のボトムデータ２０１に比べて、０パディングに用いる要素データ２１３が少なくて済む。 Here, in the case of the bottom data 201 of the 8 × 8 matrix, 36 element data 213 are used to perform 0 padding. On the other hand, in the case of the bottom data 210, 34 element data 213 are used to perform 0 padding. That is, when the bottom data 210 is used, the number of element data 213 used for 0 padding is smaller than that of the bottom data 201 before conversion.

以上に説明したように、本実施例に係る演算処理装置は、新フィルタ定義に合わせた変換後のボトムデータに対して０パディングを行いフォワード畳込演算を行う。この場合、変換前のボトムデータに対して０パディングを行うよりも少ない数の要素データの付加で済み、データ容量を小さくできるとともに演算効率を向上させることができる。 As described above, the arithmetic processing unit according to the present embodiment performs 0 padding on the converted bottom data according to the new filter definition and performs the forward convolution operation. In this case, it is sufficient to add a smaller number of element data than performing 0 padding on the bottom data before conversion, and the data capacity can be reduced and the calculation efficiency can be improved.

次に、実施例４について説明する。本実施例に係る演算処理装置は、３次元データに対して新フィルタ定義を用いてフォワード畳込演算及びバックワード畳込演算を行う。本実例に係る演算処理装置も、図１〜４で表される。本実施例に係る各部は、同様の符号を有する実施例１の各部と同様の処理を３次元データに対して実行する機能を有する。 Next, Example 4 will be described. The arithmetic processing unit according to this embodiment performs a forward convolution operation and a backward convolution operation on three-dimensional data using a new filter definition. The arithmetic processing unit according to this example is also represented by FIGS. 1 to 4. Each part according to the present embodiment has a function of executing the same processing as each part of the first embodiment having the same reference numerals to the three-dimensional data.

図１９は、実施例４に係る畳込演算部による新フィルタ定義を用いたフォワード畳込演算の一例を説明するための図である。重みデータ記憶部１１５は、３次元の新フィルタ定義を使用した重みデータ２２２を記憶する。ここで、重みデータ２２２に対応する従来のフィルタ定義は、３×３×３に要素データが並んだ立方体である。重みデータ２２２は、ｘ〜ｚ方向の正面図が実施例１の新フィルタ定義３０１と同様のデータの配置形状を有する。 FIG. 19 is a diagram for explaining an example of a forward convolution operation using a new filter definition by the convolution operation unit according to the fourth embodiment. The weight data storage unit 115 stores weight data 222 using the new three-dimensional filter definition. Here, the conventional filter definition corresponding to the weight data 222 is a cube in which element data are arranged in 3 × 3 × 3. The weight data 222 has the same data arrangement shape as the new filter definition 301 of the first embodiment in the front view in the x to z directions.

入力データ処理部１１１は、８×８×８の立方体であるボトムデータ２０１の入力を受ける。そして、入力データ処理部１１１は、ボトムデータ２０１の図１９の座標に対応するｙ軸方向及びｚ軸方向に並ぶ隣合う要素データを平均化する。これにより、入力データ処理部１１１は、ボトムデータ２０１のｙ軸方向及びｚ軸方向に隔行ずつ要素データの半分だけずらした見た目を有するボトムデータ２１０を生成する。そして、入力データ処理部１１１は、生成したボトムデータ２１０を乗算部１１２へ出力する。 The input data processing unit 111 receives the input of the bottom data 201, which is an 8 × 8 × 8 cube. Then, the input data processing unit 111 averages the adjacent element data arranged in the y-axis direction and the z-axis direction corresponding to the coordinates of FIG. 19 of the bottom data 201. As a result, the input data processing unit 111 generates the bottom data 210 having the appearance of the bottom data 201 shifted by half of the element data by intervals in the y-axis direction and the z-axis direction. Then, the input data processing unit 111 outputs the generated bottom data 210 to the multiplication unit 112.

乗算部１１２は、ボトムデータ２１０の入力を受ける。そして、乗算部１１２は、新フィルタ定義を使用した重みデータ２２２をボトムデータ２１０に対して用いて、フォワード畳込演算を行う。 The multiplication unit 112 receives the input of the bottom data 210. Then, the multiplication unit 112 uses the weight data 222 using the new filter definition for the bottom data 210 to perform a forward convolution operation.

また、畳込演算部１０６は、ボトムデータ２１０と重みデータ２２２とを用いたフォワード畳込演算で算出されたトップデータ２０９の配置形状と同様の配置形状を有するトップ差分データ２０３の入力を受ける。そして、畳込演算部１０６は、ボトムデータ２１０、重みデータ２２２及び取得したトップ差分データ２０３を用いてバックワード畳込演算を実行する。 Further, the convolution calculation unit 106 receives the input of the top difference data 203 having the same arrangement shape as the arrangement shape of the top data 209 calculated by the forward convolution operation using the bottom data 210 and the weight data 222. Then, the convolution calculation unit 106 executes a backward convolution operation using the bottom data 210, the weight data 222, and the acquired top difference data 203.

次に、図２０を参照して、実施例４に係る畳込演算部１０６による新フィルタ定義を用いたフォワード畳込演算の他の例を説明する。図２０は、実施例４に係る畳込演算部による新フィルタ定義を用いたフォワード畳込演算の他の例を説明するための図である。重みデータ記憶部１１５は、３次元の新フィルタ定義を使用した重みデータ２２３を記憶する。この、重みデータ２２３も、３×３×３に要素データが並んだ立方体に対応する新フィルタ定義である。重みデータ２２３も、ｘ〜ｚ方向の正面図が実施例１の新フィルタ定義３０１と同様のデータの配置形状を有する。 Next, with reference to FIG. 20, another example of the forward convolution operation using the new filter definition by the convolution operation unit 106 according to the fourth embodiment will be described. FIG. 20 is a diagram for explaining another example of the forward convolution operation using the new filter definition by the convolution operation unit according to the fourth embodiment. The weight data storage unit 115 stores weight data 223 using the new three-dimensional filter definition. This weight data 223 is also a new filter definition corresponding to a cube in which element data are arranged in 3 × 3 × 3. The weight data 223 also has the same data arrangement shape as the new filter definition 301 of the first embodiment in the front view in the x to z directions.

入力データ処理部１１１は、図１９の場合と同様にボトムデータ２１０を生成する。乗算部１１２は、新フィルタ定義を使用した重みデータ２２３をボトムデータ２１０に対して用いて、フォワード畳込演算を行う。また、畳込演算部１０６は、ボトムデータ２１０と重みデータ２２３とを用いたフォワード畳込演算で算出されたトップデータ２０９と同様のデータの配置形状を有するトップ差分データ２０３を用いてバックワード畳込演算を行う。 The input data processing unit 111 generates the bottom data 210 as in the case of FIG. The multiplication unit 112 uses the weight data 223 using the new filter definition for the bottom data 210 to perform a forward convolution operation. Further, the convolution calculation unit 106 uses the back word tatami using the top difference data 203 having the same data arrangement shape as the top data 209 calculated by the forward convolution calculation using the bottom data 210 and the weight data 223. Performs inclusive operations.

以上に説明したように、本実施例に係る演算処理装置は、３次元データに対しても従来よりも要素データの少ない新フィルタ定義を使用してフォワード畳込演算及びバックワード畳込演算を行う。したがって、本実施例に係る演算処理装置は、３次元データを用いた深層学習の演算において、使用する記憶装置の容量を抑えつつ演算効率を向上させることができる。 As described above, the arithmetic processing unit according to the present embodiment performs forward convolution operation and backward convolution operation for three-dimensional data by using a new filter definition having less element data than before. .. Therefore, the arithmetic processing unit according to the present embodiment can improve the arithmetic efficiency while suppressing the capacity of the storage apparatus used in the deep learning arithmetic using the three-dimensional data.

（プログラムの記述例）
図２１は、フォワード畳込演算のプログラムの記述例を説明するための図である。フォワード畳込演算は、図２１に示すようにボトムデータ２０１（ｂｏｔｔｏｍ＿ｙ）とトップデータ２０９（ｔｏｐ＿ｘ）とを用いた演算は掛け算と足し算で表現できる。フォワード畳込演算は、ボトムデータ２０１のデータ数Ｃｉ、トップ差分データ２０３のデータ数Ｃｏ、バッチ数ｍｂ、ストライド数Ｗ及びトップサイズを調節するためのパラメータとなるパッド数ｐａｄを指定して行なわれる。ここで、トップサイズの調整とは、トップサイズの水増しにあたる。 (Program description example)
FIG. 21 is a diagram for explaining a description example of a program for forward convolution operation. As shown in FIG. 21, the forward convolution operation can be expressed by multiplication and addition in the operation using the bottom data 201 (bottom_y) and the top data 209 (top_x). The forward convolution operation is performed by designating the number of data Ci of the bottom data 201, the number of data Co of the top difference data 203, the number of batches mb, the number of strides W, and the number of pads as parameters for adjusting the top size. .. Here, adjusting the top size corresponds to inflating the top size.

図２２は、バックワード畳込重み差分演算のプログラムの記述例を説明するための図である。バックワード畳込重み差分演算は、図２２に示すようにボトムデータ２０１（ｂｏｔｔｏｍ＿ｙ）とトップ差分データ２０３（ｔｏｐ＿ｘ）とを用いた演算は掛け算と足し算で表現できる。この場合、重み差分データ（ｅｗ）が算出される。バックワード畳込重み差分演算は、ボトムデータ２０１のデータ数Ｃｉ、トップ差分データ２０３のデータ数Ｃｏ、バッチ数ｍｂ、ストライド数Ｗ及びトップサイズを調節するためのパラメータとなるパッド数ｐａｄを指定して行なわれる。ここで、トップサイズの調整とは、トップサイズの水増しにあたる。 FIG. 22 is a diagram for explaining a description example of a program for backward convolution weight difference calculation. As shown in FIG. 22, the backward convolution weight difference calculation can be expressed by multiplication and addition in the calculation using the bottom data 201 (bottom_y) and the top difference data 203 (top_x). In this case, the weight difference data (ew) is calculated. In the backward convolution weight difference calculation, the number of data Ci of the bottom data 201, the number of data Co of the top difference data 203, the number of batches mb, the number of strides W, and the number of pads as parameters for adjusting the top size are specified. Is done. Here, adjusting the top size corresponds to inflating the top size.

図２３は、バックワード畳込ボトム差分演算のプログラムの記述例を説明するための図である。バックワード畳込ボトム差分演算は、図２３に示すようにボトムデータ２０１（ｂｏｔｔｏｍ＿ｙ）とトップ差分データ２０３（ｔｏｐ＿ｘ）と用いた演算は掛け算と足し算で表現できる。この場合、ボトム差分データ２０５（ｂｏｔｔｏｍ＿ｅｙ）が算出される。バックワード畳込ボトム差分演算は、ボトムデータ２０１のデータ数Ｃｉ、トップ差分データ２０３のデータ数Ｃｏ、バッチ数ｍｂ、ストライド数Ｗ及びトップサイズを調節するためのパラメータとなるパッド数ｐａｄを指定して行なわれる。ここで、トップサイズの調整とは、トップサイズの水増しにあたる。 FIG. 23 is a diagram for explaining a description example of a program for backward convolution bottom difference calculation. As shown in FIG. 23, the backward convolution bottom difference operation can be expressed by multiplication and addition of the operation using the bottom data 201 (bottom_y) and the top difference data 203 (top_x). In this case, the bottom difference data 205 (bottom_ey) is calculated. In the backward convolution bottom difference calculation, the number of data Ci of the bottom data 201, the number of data Co of the top difference data 203, the number of batches mb, the number of strides W, and the number of pads as parameters for adjusting the top size are specified. Is done. Here, adjusting the top size corresponds to inflating the top size.

（ハードウェア構成）
図２５は、演算処理装置のハードウェア構成図である。演算処理装置１は、ＣＰＵ（Central Processing Unit）９１、メモリ９２、アクセラレータ９３及びメモリ９４を有する。メモリ９２は、ＣＰＵ９１専用のメモリであり、ＣＰＵ９１に含まれてもよい。また、メモリ９４は、アクセラレータ９３のメモリであり、アクセラレータ９３に含まれてもよい。 (Hardware configuration)
FIG. 25 is a hardware configuration diagram of the arithmetic processing unit. The arithmetic processing unit 1 includes a CPU (Central Processing Unit) 91, a memory 92, an accelerator 93, and a memory 94. The memory 92 is a memory dedicated to the CPU 91 and may be included in the CPU 91. Further, the memory 94 is a memory of the accelerator 93 and may be included in the accelerator 93.

メモリ９２は、ＯＳ（Operating System）及び各演算処理層１０で使用される学習プログラムを含む各種プログラムを記憶する。また、メモリ９２は、入力データ２及び期待値２０７を記憶する。 The memory 92 stores various programs including an OS (Operating System) and a learning program used in each arithmetic processing layer 10. Further, the memory 92 stores the input data 2 and the expected value 207.

ＣＰＵ９１は、メモリ９２に格納されたＯＳを実行する。さらに、ＣＰＵ９１は、メモリ９２が有する学習プログラムを含む各種プログラム、並びに、入力データ２、重みデータ２０２及び期待値２０７を含む各種データをアクセラレータ９３へ出力する。重みデータ２０２には、使用する新フィルタ定義に応じて重みデータ２２１などを含む。そして、ＣＰＵ９１は、深層学習の処理実行をアクセラレータ９３に指示する。その後、ＣＰＵ９１は、学習後の重みデータ２０２をアクセラレータ９３から取得し、メモリ９２に格納された重みデータ２０２を更新する。 The CPU 91 executes the OS stored in the memory 92. Further, the CPU 91 outputs various programs including the learning program included in the memory 92, and various data including the input data 2, the weight data 202, and the expected value 207 to the accelerator 93. The weight data 202 includes weight data 221 and the like according to the new filter definition to be used. Then, the CPU 91 instructs the accelerator 93 to execute the deep learning process. After that, the CPU 91 acquires the trained weight data 202 from the accelerator 93 and updates the weight data 202 stored in the memory 92.

アクセラレータ９３は、例えば、ＧＰＵやＦＰＧＡ（Field Programmable Gate Array）などである。アクセラレータ９３は、ＣＰＵ９１から入力された学習プログラムを含む各種プログラム、並びに、入力データ２及び期待値２０７を含む各種データをメモリ９４に格納する。そして、アクセラレータ９３は、メモリ９４に格納した学習プログラムを含む各種プログラム及び各種データを用いて深層学習の処理を実行する。これにより、アクセラレータ９３は、図２で例示した演算処理層１０の畳込演算部１０１、活性化処理部１０２、プーリング処理部１０３、プーリング処理部１０４、活性化処理部１０５及び畳込演算部１０６の各機能を実現する。アクセラレータ９３は、各演算処理層１０における学習結果である重みデータ２０２をＣＰＵ９１へ出力する。アクセラレータ９３は、全ての演算処理層１０について同様に処理を実行する。ここで、アクセラレータ９３は、各演算処理層１０の処理毎にＣＰＵ９１からデータを取得してもよいし、各演算処理層１０の処理に使用するデータをまとめて取得してもよい。 The accelerator 93 is, for example, a GPU, an FPGA (Field Programmable Gate Array), or the like. The accelerator 93 stores various programs including the learning program input from the CPU 91, and various data including the input data 2 and the expected value 207 in the memory 94. Then, the accelerator 93 executes the deep learning process using various programs including the learning program stored in the memory 94 and various data. As a result, the accelerator 93 includes the convolution calculation unit 101, the activation processing unit 102, the pooling processing unit 103, the pooling processing unit 104, the activation processing unit 105, and the convolution calculation unit 106 of the arithmetic processing layer 10 illustrated in FIG. Realize each function of. The accelerator 93 outputs weight data 202, which is a learning result in each arithmetic processing layer 10, to the CPU 91. The accelerator 93 executes processing in the same manner for all the arithmetic processing layers 10. Here, the accelerator 93 may acquire data from the CPU 91 for each processing of each arithmetic processing layer 10, or may collectively acquire data used for processing of each arithmetic processing layer 10.

１演算処理装置
２入力データ
３出力データ
１０〜１４演算処理層
１０１，１０６畳込演算部
１０２，１０５活性化処理部
１０３，１０４プーリング処理部
１１１入力データ処理部
１１２乗算部
１１３加算部
１１４出力データ作成部
１１５重みデータ記憶部
２０１，２１０，２１１ボトムデータ
２０２，２２１，２２２，２２３重みデータ
２０３トップ差分データ
２０４重み差分データ
２０５ボトム差分データ
２０７期待値
２０９トップデータ 1 Arithmetic processing device 2 Input data 3 Output data 10 to 14 Arithmetic processing layer 101, 106 Convolution calculation unit 102, 105 Activation processing unit 103, 104 Pooling processing unit 111 Input data processing unit 112 Multiplying unit 113 Addition unit 114 Output data Creation unit 115 Weight data storage unit 2011,210,211 Bottom data 202,221,222,223 Weight data 203 Top difference data 204 Weight difference data 205 Bottom difference data 207 Expected value 209 Top data

Claims

A storage unit that stores the first data having the element data forming the matrix and the second data having the arrangement shape obtained by excluding a predetermined number of element data from the element data forming the matrix.
A conversion unit that converts the first data based on the arrangement shape of the second data, and
A calculation processing unit including a convolution calculation unit that performs a convolution calculation using the second data as a filter for the first data converted by the conversion unit.

The arithmetic processing unit according to claim 1, wherein the second data has a symmetrical arrangement shape in the vertical, horizontal, and diagonal directions.

The first data has the same number of element data in the row direction and the column direction.
In the second data, the element data including the row was removed one by one according to the distance from the middle row from the state where the same number of element data was arranged in the row direction and the column direction, and the element data was excluded. It has an arrangement shape in which the element data is arranged so that the position of half of the row and the position of half of the previous row excluding the element data match.
The arithmetic processing unit according to claim 1 or 2, wherein the conversion unit performs conversion for averaging adjacent element data in intervals of the first data.

The operation according to any one of claims 1 to 3, further comprising a pooling processing unit that executes a pooling process by using the value of element data included in the calculation result by the convolution calculation unit as it is. Processing equipment.

The convolution calculation unit adds element data having a value of 0 so as to surround the first data with a minimum number, and the convolution calculation unit adds the element data having a value of 0 to the first data. The arithmetic processing apparatus according to any one of claims 1 to 4, wherein the convolution operation is performed using the second data as a filter, and an arithmetic result having the same number of element data as the first data is acquired. ..

A control method for an arithmetic processing unit that stores first data having element data forming a matrix and second data having an arrangement shape obtained by removing a predetermined number of element data from the element data forming a matrix.
The first data is converted based on the arrangement shape of the second data.
A control method of an arithmetic processing unit, characterized in that a convolution operation is performed on the converted first data by using the second data as a filter.