The xclbin could be built in 242 MHz. The hardware resource utilization and benchmark results are shown in the following two tables.
Table 1 Hardware Resources
Name | LUT | BRAM | URAM | DSP | FF |
---|---|---|---|---|---|
blasKernel | 250679 | 94 | 24 | 1224 | 430512 |
Table 2 Benchmark Results
M | N | K | Kernel Execution Time [ms] | API Execution Time [ms] | Kernel Eff [%] |
---|---|---|---|---|---|
64 | 64 | 64 | 0.010905 | 1.750123 | 38.802577 |
128 | 128 | 128 | 0.048517 | 13.802416 | 69.772592 |
256 | 256 | 256 | 0.328314 | 14.645931 | 82.485022 |
512 | 512 | 512 | 3.213388 | 18.199255 | 67.420400 |
1024 | 1024 | 1024 | 24.113855 | 45.519852 | 71.875005 |
2048 | 2048 | 2048 | 186.688153 | 264.195138 | 74.270743 |
4096 | 4096 | 4096 | 1469.773731 | 1708.938204 | 75.469945 |