The final size of the stage 1 bitstream varies based on many factors, including:
- IP: The size and shape of the first-stage Pblocks determine the number of frames required for stage 1. x8 and x16 configurations will require more GT quads in the stage 1 floorplan, which will lead to a larger stage 1 bitstream.
- Device: Wider devices require more routing frames to connect the IP to clocking resources.
- Design: Location of the reset pin is one of many factors introduced by the addition of the user application.
- GT Locations: The selection of the GT quads used affects the size of the stage 1 bitstream. For the most efficient use of resources, the GT quad adjacent to the PCI Express hard block should be used.
- Compression: As the device utilization increases, the effectiveness of compression decreases.
As a baseline, here are some sample bitstream sizes and configuration times for the example (PIO) design generated along with the PCIe IP.
Device | Full Bitstream | Full: BPI16 at 50 MHz |
Tandem Stage 1 2 | Tandem: BPI16 at 50 MHz |
---|---|---|---|---|
AU25P | 117.7 Mb | 147.2 ms | 14.4 Mb | 18.1 ms |
KU15P | 277.3 Mb | 346.6 ms | 17.6 Mb | 22.0 ms |
VU9P | 611.6 Mb | 764.5 ms | 17.5 Mb | 21.8 ms |
|
The amount of time it takes to load the stage 2 bitstream using the Tandem PCIe methodology depends on three additional factors:
- The width and speed of PCI Express link.
- The frequency of the clock used to program the MCAP.
- The efficiency at which the Root Port host can deliver the bitstream to the Endpoint FPGA design. For most designs this is the limiting factor.
The lower bandwidth of these three factors determines how fast the stage 2 bitstream is loaded.