This section showcases several Fast Fourier Transform (FFT) designs for AMD Versal™ AI Engine, starting with a single-core design using the AI Engine API. After establishing the single-core throughput and latency baseline using AMD Vitis™ AI Engine SW simulation tools, it presents several FFT design optimization techniques to improve these benchmarks. The Vitis DSP library uses these techniques to yield high-performance, scalable FFT IP spanning single-core to multicore designs.