AI Engines can directly communicate through the AXI4-Stream interconnect without any DMA and memory interaction. Data can transfer from one AI Engine to another or broadcast through the streaming interface. The data bandwidth of a streaming connection is 32-bit per cycle and built-in handshake and backpressure mechanisms are available.
The stream connection can be unicast or multicast.
Note: In multicast communication, data is sent to all the destination
ports at the same time and only when all destinations are ready to receive data.