Convolutional Neural Network with INT4 Optimization on Xilinx Devices (WP521) - INT8 provides better performance with comparable precision than floating point for AI inference. But when INT8 is unable to meet the desired performance with limited resources, INT4 optimization is the answer. With INT4 optimization, Xilinx can achieve up to a 77% performance boost on real hardware in comparison with the current INT8 solution. - WP521

wp521-4bit-optimization.pdf

Document ID
WP521
Release Date
2020-06-24
Revision
1.0.1 English