11. AOCL-Data Analytics#
The binaries supplied for AOCL-Data Analytics (AOCL-DA) at https://www.amd.com/en/developer/aocl.html
use AVX2 vector instructions. For Zen4 (Genoa/Bergamo/Siena) and Zen5 (Turin) machines, we recommend
building AOCL-DA from source with AVX512 instructions enabled. This can
be done using the -DARCH=znver4
or -DARCH=znver5
CMake options, depending on the Zen generation of the processors and the options supported by the compiler. See the user guide for further details.
AOCL-DA depends on BLAS and LAPACK for linear algebra computations. When building from source we recommend following the relevant tuning guides for AOCL-BLAS and AOCL-LAPACK. For optimal performance we also recommend building using the AOCC 5.0 compiler.
AOCL-DA provides multithreaded support using OpenMP. The number of
threads should be set using the OMP_NUM_THREADS
environment variable.
Note that the parallel performance will be highly dependent on both the
algorithm used and your data.