The Dense Similarity Kernel is validated on Alveo U50 board at 260MHz frequency. The hardware resource utilization and benchmark results are shown in the two table below.
Name | LUT | Register | BRAM | URAM | DSP |
|
221256 | 329187 | 402 | 16 | 1273 |
|
134446 | 160671 | 402 | 16 | 807 |
|
272521 | 333259 | 618 | 48 | 2364 |
Datasets | Vertex | Edges | Similarity Type | FPGA Time / ms | TigerGraph (32 core 512 GB) | |
Time / ms | Speed up | |||||
Patients(1 GB) | 1250000 | 200 | Cosine | 11.2 | 585.7 | 52.3 |
Note
1. Tigergraph running on platform with Intel(R) Xeon(R) CPU E5-2640 v3 @2.600GHz, 32 Threads (16 Core(s)).
2. The uint + float version and integer version have relatively similar performance.
3. Time unit: ms.