5. AOCL-DLP
AOCL-DLP is an optimized implementation of deep learning primitives for AMD Zen-based processors, providing high-performance implementations of fundamental operations including GEMM (General Matrix Multiplication), batch GEMM, element-wise operations, and matrix transformations with support for multiple data types (FP32, BF16, INT8) and post-operations.
API Categories
5.7. Quick API Lookup
5.7.1. Core GEMM Operations
Function Pattern |
Description |
|---|---|
|
Float32 precision GEMM |
|
BFloat16 inputs, float32 output |
|
Unsigned/signed 8-bit quantized GEMM |
|
Signed 8-bit quantized GEMM |
5.7.2. Batch Operations
Function Pattern |
Description |
|---|---|
|
Batch processing for multiple matrices |
5.7.3. Matrix Utilities
Function Pattern |
Description |
|---|---|
|
Get buffer size for matrix reordering |
|
Reorder matrix for optimal performance |
|
Convert reordered matrix back to normal format |
5.7.4. Element-wise Operations
Function Pattern |
Description |
|---|---|
|
Apply element-wise operations to matrices |
5.7.5. Utility Functions
Function Pattern |
Description |
|---|---|
|
GELU activation functions |
|
Softmax functions |
5.7.6. Library Management
Function |
Description |
|---|---|
|
Configure thread count |
|
Configure parallelization strategy |
|
Query hardware capabilities |
5.8. API Selection Guide
5.8.1. Choose the Right GEMM Variant
By Precision Requirements:
High Precision:
f32f32f32of32for maximum accuracyBalanced:
bf16bf16f32of32for good accuracy with reduced memoryQuantized:
u8s8s32os32ors8s8s32os8for inference
By Performance Needs:
Single Operation: Standard GEMM functions
Multiple Operations: Batch GEMM functions
Repeated Operations: Use matrix reordering
5.8.2. Data Type Naming Convention
Function names follow the pattern: [input_A][input_B][accumulation]o[output]
f32= float32bf16= bfloat16u8= uint8s8= int8s32= int32
Example: bf16bf16f32of32 = bfloat16 inputs, float32 accumulation and output
5.9. See Also
API Overview - API design principles and usage patterns
GEMM Operations - GEMM operations documentation
Post-Operations - Post-operations framework
6. Indices and Tables
Index
Module Index
Search Page