- model
- Float module to be quantized.
- input_signature
- Input tensor with the same shape as real input of float module to be quantized, but the values can be random numbers.
- quant_mode
- A string that indicates which quantization mode the process is
using:
- calib: for calibration of quantization
- test: for evaluation of quantized model
- output_dir
- Directory for quantization result and intermediate files. The default value is quantize_result.
- bitwidth
- Global quantization bit width. The default value is 8.