Function arm_concatenation_s8_y¶
Defined in File arm_nnfunctions.h
Function Documentation¶
-
void arm_concatenation_s8_y(const int8_t *input, const uint16_t input_x, const uint16_t input_y, const uint16_t input_z, const uint16_t input_w, int8_t *output, const uint16_t output_y, const uint32_t offset_y)¶
int8/uint8 concatenation function to be used for concatenating N-tensors along the Y axis This function should be called for each input tensor to concatenate. The argument offset_y will be used to store the input tensor in the correct position in the output tensor
i.e. offset_y = 0 for(i = 0 i < num_input_tensors; ++i) { arm_concatenation_s8_y(&input[i], …, &output, …, …, offset_y) offset_y += input_y[i] }
This function assumes that the output tensor has:
The same width of the input tensor
The same number of channels of the input tensor
The same batch size of the input tensor
Unless specified otherwise, arguments are mandatory.
Input constraints offset_y is less than output_y
Note
This function, data layout independent, can be used to concatenate either int8 or uint8 tensors because it does not involve any arithmetic operation
- Parameters:
input – [in] Pointer to input tensor. Input tensor must not overlap with the output tensor.
input_x – [in] Width of input tensor
input_y – [in] Height of input tensor
input_z – [in] Channels in input tensor
input_w – [in] Batch size in input tensor
output – [out] Pointer to output tensor. Expected to be at least (input_z * input_w * input_x * input_y) + offset_y bytes.
output_y – [in] Height of output tensor
offset_y – [in] The offset on the Y axis to start concatenating the input tensor It is user responsibility to provide the correct value