Function arm_quantize_f32_s8

Function Documentation

arm_cmsis_nn_status arm_quantize_f32_s8(const float *input, int8_t *output, int32_t size, int32_t zero_point, float scale)

Quantize a floating-point array into int8_t format.

Parameters:
  • input[in] Pointer to the input float array.

  • output[out] Pointer to the output int8_t array.

  • size[in] Number of elements in the arrays.

  • zero_point[in] Zero point (offset) to apply during quantization.

  • scale[in] Scale factor to apply during quantization.

Returns:

The function returns ARM_CMSIS_NN_SUCCESS