The DFT library element is able to split the workload into multiple kernels connected in a chain with the use of template parameter TP_CASC_LEN
. Required computation as well as input data buffer and twiddle table will be split and distributed between number of kernels determined by TP_CASC_LEN
. Using multiple kernels, support for TP_POINT_SIZE
is extended beyond what can be stored on a single kernel.