Performance - Performance - 1.0 English - PG427

Cached DRAM Binary CAM LogiCORE IP Product Guide (PG427)

Document ID
PG427
Release Date
2023-10-18
Version
1.0 English

Lookup Performance

High capacity CDBCAM systems consists of a high capacity DBCAM and a lower capacity BCAM Cache Subsystem (BCS). The DBCAM uses DRAM for storage and has a higher latency and has a generally lower lookup rate than the on-chip BCAM. The BCS provides lower latency and a high lookup rate on-chip cache BCAM. The resulting lookup rate of such a system is inversely proportional to the Cache Miss Rate (CMR), but it cannot go beyond the maximum lookup rate of the BCS, hence the resulting lookup rate can be expressed by the formula:

  • LR = CLR, for CLR*CMR < DLR.
    Note: To achieve this rate, the AXI4-Stream rate must be higher than LR.
  • LR = DLR/CMR, for CLR*CMR > DLR.

where:

  • LR is a resulting lookup rate;
  • CLR is a cache (BCS) lookup rate;
  • DLR is a DBCAM lookup rate;
  • CMR is a cache (BCS) miss rate. For example, for 1 in 10 lookups being a miss, the CMR would be 0.1.

The expected performance is the function of 3 variables, where one of them is lookup flow content dependent. To visualize the expected performance, the graph below illustrates a typical case with the following characteristics as a function of the cache (BCS) miss rate:

  • CLR = 150 M lookups/s
  • DLR = 39 M lookups/s
Figure 1. Achievable Lookup Throughput Performance Generated by Your Tool

DBCAM Lookup Throughput Performance

Because the resulting lookup performance depends on DBCAM lookup performance, below are some examples of achievable DBCAM performance.

Table 1. DBCAM Lookup Throughput Performance and Latency
DRAM Type DBCAM Entry Size 1 [B] Mean DLR [Mlookups/s] Mean Lookup Latency [us]
DDR4 64 28 0.68
DDR4 128 23 0.75
HBM 64 39 0.75
HBM 128 28 1.17
  1. For DBCAM, the entry size is calculated as follows:
    • (KEY_WIDTH + RESPONSE_WIDTH) < 64B: 64B
    • (KEY_WIDTH + RESPONSE_WIDTH) >= 64B: 128B
  2. The HBM results reflect HBM clocking at 2400 MHz, and higher rates are possible at 3200 MHz clock frequency.
  3. Higher lookup rates can be achieved when using more than one HBM pseudo-channel. Lookup performance is capped at 50 Mlps in this release