In this example, 8 K divided by 4 K is two, so there are two rows. The data space does not matter for this calculation, so the matrix is two rows and 1 column resulting in three pipeline registers again.
Note: The tool reproduces the whole matrix to obtain the
extra 8 bits of data space needed to create the RAM. However, that reproduction does not
affect the calculation of pipeline registers.