As AI Engine designs grow larger, tracking the events produced while running the design can be useful to identify performance bottlenecks as well as understanding how the overall AI Engine is operating for the design. Of course, with larger designs more and more events will be produced causing a bottleneck of the events being recorded by the trace IP being used. To capture all this data effectively, and quickly, you should consider instantiating multiple event trace streams. These streams will spread out the event data coming from the AI Engine, letting it store them correctly and in a timely manner.
To increase the trace streams in a design, use the --num-trace-streams
option when you compile, which can have a value in the
range of 1 to 16. The following table provides guidance on the number of trace streams
to use, depending on the size of the design.
Number of AI Engines | Recommended Number of Streams |
---|---|
Less than 10 | 1 |
Between 10 and 20 | 2 |
Between 20 and 40 | 4 |
Between 40 and 80 | 8 |
Larger than 80 | 16 |
|
After the change to the AI Engine compiler option, recompile and re-link the XCLBIN file and libadf.a using the Vitis compiler with a config file as described in Linking the System in Vitis Unified Software Platform Documentation: Application Acceleration Development (UG1393).
v++ -l --config system.cfg ...