TDM FIR uses 32 AI Engine tiles with 32 IO streams
The 4k-pt IFFT is implemented using 2D architecture with resources split between 16 AI Engine tiles (compute) and PL (data transpose).
From a bandwidth perspective, the design requires 2 input and 4 output streams.
Custom HLS blocks (merge and split) are built to manage connectivity between the IPs.