AOCL-Data Analytics - 5.0 English

AOCL Performance Tuning Guide (63859)

Document ID
63859
Release Date
2024-10-10
Version
5.0 English

11. AOCL-Data Analytics#

The binaries supplied for AOCL-Data Analytics (AOCL-DA) at https://www.amd.com/en/developer/aocl.html use AVX2 vector instructions. For Zen4 (Genoa/Bergamo/Siena) and Zen5 (Turin) machines, we recommend building AOCL-DA from source with AVX512 instructions enabled. This can be done using the -DARCH=znver4 or -DARCH=znver5 CMake options, depending on the Zen generation of the processors and the options supported by the compiler. See the user guide for further details.

AOCL-DA depends on BLAS and LAPACK for linear algebra computations. When building from source we recommend following the relevant tuning guides for AOCL-BLAS and AOCL-LAPACK. For optimal performance we also recommend building using the AOCC 5.0 compiler.

AOCL-DA provides multithreaded support using OpenMP. The number of threads should be set using the OMP_NUM_THREADS environment variable. Note that the parallel performance will be highly dependent on both the algorithm used and your data.