template <typename t_DataType, unsigned int t_ParEntries, typename t_IndexType = unsigned int> void axpy(unsigned int p_n, const t_DataType p_alpha, hls::stream<WideType<t_DataType, t_ParEntries>>& p_x, hls::stream<WideType<t_DataType, t_ParEntries>>& p_y, hls::stream<WideType<t_DataType, t_ParEntries>>& p_r)
axpy function that compute Y = alpha*X + Y.
Parameters:
t_DataType | the data type of the vector entries |
t_LogParEntries | log2 of the number of parallelly processed entries in the input vector |
t_IndexType | the datatype of the index |
p_n | the number of entries in the input vector p_x, p_n % t_ParEntries == 0 |
p_x | the input stream of packed entries of vector X |
p_y | the input stream of packed entries of vector Y |
p_r | the output stream of packed entries of result vector Y |