The input should be a set of integer vertex list with known size. The result returns a vertex list with each vertex corresponding to a similarity value. To optimize the calculation of dense and integer value, the design disables neither float datatype nor sparse mode. Furthermore, the support for Jaccard Similarity is also removed in the kernel so that it saves lots of hardware resources and realizes a design of 2-CU instantiation to get the best performance on the platform of U55C. The design support additional coefficients for each column of weight for better software flexibility.