The xclbin could be built in 300 MHz. The hardware resource utilization and benchmark results are shown in the following two tables.
Table 1 Hardware Resources
Name | LUT | BRAM | URAM | DSP | FF |
---|---|---|---|---|---|
blasKernel | 198418 | 66 | 24 | 1235 | 383276 |
Table 2 gemm_1CU Benchmark Results
M | N | K | Kernel Execution Time [ms] | API Execution Time [ms] |
---|---|---|---|---|
64 | 64 | 64 | 0.098555 | 305.583425 |