Perform row wise the reduce add of a matrix of the columns. Because for every row returns a number, the result is a vector of values that represent the magnitude of the reduce add operation for every row.
- Template params:
- T: type of the operation;
- LEN: number of elements to be processed in the kernel per iteration;
- INCREMENT: parameter that indicates how much iterations have been performed by the SIMD with respect to the intended total length;
- VECDIM: dimension of the SIMD to be performed. Addressed in the Xilinx UG1076, it depends on the type chosen;
- Function params:
- in1: elements of the vector to be passed to the kernel.
- out: elements of the result of the operation (vector) to be passed from the kernel.