The kernel is validated on Alveo U55C board at 220MHz frequency. The hardware resource utilization and benchmark results are shown in the two table below.
Name | LUT | Register | BRAM | URAM | DSP |
denseSimilarityKernel (int + 2CU + Coeffs) | 262317 | 233100 | 794 | 48 | 9 |
Datasets | Vertex | Edges | Similarity Type | FPGA Time / ms | TigerGraph (32 core 512 GB) | |
Time / ms | Speed up | |||||
Patients(1GB/CU) | 1250000 | 200 | Cosine | 7.0 | 585.7 | 83.5 |
Note
1. Tigergraph running on platform with Intel(R) Xeon(R) CPU E5-2640 v3 @2.600GHz, 32 Threads (16 Core(s)).
2. The uint + float version and integer version have relatively similar performance.
3. Time unit: ms.