Typical AOCL-DLP usage follows these layers:
Prepare data: Set up matrix layouts (row-major or column-major) and leading dimensions
Optional reordering: Optimize data layout for repeated use
Configure metadata: Set up
dlp_metadata_tfor fused post-operationsCall GEMM or element-wise: Execute the main computation
Optional unreordering: Convert outputs back to desired format