template <typename t_DataType, unsigned int t_ParEntries, typename t_IndexType = unsigned int> void axpy(unsigned int p_n, const t_DataType p_alpha, hls::stream<WideType<t_DataType, t_ParEntries>>& p_x, hls::stream<WideType<t_DataType, t_ParEntries>>& p_y, hls::stream<WideType<t_DataType, t_ParEntries>>& p_r)
The axpy function that compute Y = alpha*X + Y.
Parameters:
t_DataType | The data type of the vector entries. |
t_LogParEntries | log2 of the number of parallelly processed entries in the input vector. |
t_IndexType | The datatype of the index. |
p_n | The number of entries in the input vector p_x, p_n % t_ParEntries == 0. |
p_x | The input stream of packed entries of vector X. |
p_y | The input stream of packed entries of vector Y. |
p_r | The output stream of packed entries of result vector Y. |