Wide multipliers, where at least one port exceeds the maximum width supported by the DSP slice in the target architecture, require additional pipeline stages to achieve the maximum operating frequency of the DSP slice. The required number of stages depends on the multiplier width.
Adding extra stages to the output of wide multipliers in the RTL allows the synthesis tool to move them to optimal positions, making recoding straightforward.