The processing system (PS) provides general-purpose, high-performance compute power with familiar operating environments. It includes the multi-core application processing unit (APU) subsystem and the multi-core real-time processing unit (RPU) subsystem. Linux and bare-metal software stacks can execute in the APU and RPU in a homogeneous or a heterogeneous environment.
The PMC subsystem provides device boot and management functions with its RCU BootROM code unit and the platform loader and management (PLM) firmware running in the PMCs PPU processor.
The device also includes a NoC interconnect, multiple DDR memory controllers, the PL, and integrated peripherals.
The following list describes the largest hardware view components that are available in Versal Prime Series Gen 2 and Versal AI Edge Series Gen 2 devices.
- APU
- The application processing unit (APU) consists of either
dual-core or quad-core clusters (one to four clusters per device) featuring
Cortex-A78AE processor cores, L1, L2,
L3, SLC caches, and related functionality. The Cortex-A78AE cores and caches are part of Arm MPCore IP.
Versal Prime Series Gen 2 and Versal AI Edge Series Gen 2 devices have Cortex-A78AE cores implement Armv8 64-bit architecture, and includes 1MB L3 cache per cluster and single 4MB SLC.
For more information, refer to Application Processing Unit section in Versal AI Edge Series Gen 2 and Prime Series Gen 2 Technical Reference Manual (AM026).
- NoC Interconnect
-
The Network on Chip (NoC) interconnect spans the entire device to enable processors and DMA units to reach destinations. The global address maps are based on the NoC interconnect.
The configurable NoC is an AXI4-based network to route high-bandwidth, real-time, and low-latency connections. The NoC extends in both horizontal and vertical directions to the edges of the device. The multichannel structures provide several options for routing and isolating traffic. The NoC is a full blocking crossbar between memory controllers, programmable logic, processing system, integrated hardware, integrated peripherals, and the platform management controller.
NoC connections include:
- DDR memory controller ports
- PL to PL connections
- Memory mapped access to integrated hardware and peripherals
- Connecting between and PL
NoC functionality is described in the Versal Adaptive SoC Programmable Network on Chip and Integrated Memory Controller LogiCORE IP Product Guide (PG313).
- RPU
- The real-time processing unit (RPU) consists of dual-core
clusters (two to five clusters per device) featuring Cortex-R52 processor cores,
based on the Armv8-R architecture. The dual
cores can operate as two Dual-Core Lock-Step (DCLS) processing units with each
core being the redundant copy of another one and inclusion of the necessary
redundant comparators.
For more information, refer to Real-time Processing Unit section in Versal AI Edge Series Gen 2 and Prime Series Gen 2 Technical Reference Manual (AM026).
- DDR Memory Controller
- The device includes one or more enhanced DDR5/LPDDR5 memory
controllers (DDRMC5E and DDRMC5X) that are accessible through the NoC
interconnect.
The DDR memory controller includes the option for AES-GCM or AES-XTS encryption. A built-in hardware masking feature is available when using AES-GCM encryption to provide resistance to dynamic power analysis (DPA) or side channel analysis (SCA).
For more information, refer to DDR Memory Controller section in Versal AI Edge Series Gen 2 and Prime Series Gen 2 Technical Reference Manual (AM026).
- Platform Management Controller
- The PMC includes a ROM code unit (RCU) processor, the platform processing unit (PPU) that runs the platform loader and manager (PLM) firmware, the boot interfaces, and the voltage/temperature system monitor (SYSMON).
- ROM Code Unit
- The RCU processor executes the BootROM code to provide hardware boot. The BootROM code is the first to run after a device-level reset; this can include a power-on reset or software reset. The BootROM code initializes the device, enables the boot interface, and processes the boot header. To finish the hardware boot process, the RCU loads the PLM firmware into the PMC microprocessor and relinquishes control of the system to the PLM. After the PLM takes system control, the RCU switches to a services mode, which includes system monitoring and service execution including in-place PLM firmware update.
- Platform Loader and Manager Firmware
- The PLM firmware performs several tasks that include device configuration and partial-reconfiguration of the programmable logic. The PLM firmware loads the Applications Security Unit (ASU) firmware to provide security functionality to the processing systems.
- Boot Interfaces
- The boot interfaces include those for flash memory controllers for autonomous boot modes and the select map and JTAG boot interface for managed boot modes.
- Voltage and Temperature System Monitor
- The voltage and temperatures in the SoC are measured by the system monitors (SYSMON). Various voltage rails are monitored to detect out of range measurements for safety and anomalies for security. It also monitors under and over temperature conditions.
- Security Features
- The PMC includes an AES accelerator with GCM modes, SHA2,
SHA3, public key cryptographic algorithms RSA with elliptic curve cryptography
(ECC), true random number generator (TRNG), a physically unclonable function
(PUF) to create a signature, and a public key infrastructure accelerator.
For more information, refer to Versal AI Edge Series Gen 2 and Prime Series Gen 2 Technical Reference Manual (AM026).
- Programmable Logic
- The PL is a scalable structure that provides the ability to
create many possible functions. The integrated hardware options have
interconnect interfaces and connections to the PL. The PL I/O includes both
LVCMOS buffers and gigabit transceivers that cover a wide range of applications
and frequencies. For more information, see the Programmable Logic section.
The programmable logic supports AXI SmartConnect core functionality that can be instantiated using a library of AMD LogiCORE™ IPs. The AXI SmartConnect core can be independent within the PL or extended and attached to the processing system through several AXI interfaces with and without coherency with the APU system cache.
- Cache Mesh Network
- To minimize memory latency and throughput, processor systems use caches. To keep caches coherent, the coherent interconnect is required. The CMN is based on the Arm CMN-600 AE with its snoop filter (SF) table feature. It provides tight memory coherency between the FPD System Cache and a PL system cache using the CHI interface protocol to support multiple heterogeneous processing environments. It is part of the FPD interconnect.
- Tightly coupled memory (TCM) in the RPU
- This memory is 512 KB and is mainly used by the RPU but can be accessed by the APU.
- On-chip memory (OCM)
- Versal Prime Series Gen 2 and Versal AI Edge Series Gen 2 have OCM size of 2 MB where each bank can be accessed through a dedicated 128-bit AXI interfaces via LPD interconnect.
- Application security unit (ASU)
- The ASU subsystem includes a RISC-V processor with AES, ECC,
SHA2, SHA3, TRNG, and RSA crypto accelerators. It also includes support for a
volatile user key vault and an interface to the PL for extending crypto
functionality. AMD provides ASU FW to support these features. The main purpose
of the ASU is to accelerate crypto operations and to act as the key management
unit for runtime applications running in the APU, RPU, or processors
instantiated within the PL. The ASU firmware is loaded during the boot process
by the PLM firmware. This is done in a secure manner with authentication and/or
decryption. The ASU is located in the LPD power domain.
For more information, see Versal AI Edge Series Gen 2 and Prime Series Gen 2 Technical Reference Manual (AM026).
- Multimedia interface
- Simultaneous transport of USB 3.2 Gen 2x1 (up to 10 Gb/s) and
DP 1.4 with HDCP 2.3 (up to 4 Kp60 or 8 Kp30) traffic over a single USB Type-C
connector USB 3.2 Gen 2x1 dual role device Display Port transmit controller DP
AUX-I2C PHY.
10 Gb ethernet MAC (10GbE) to support 1 G, 2.5 G, 5 G, and 10 G streams.
Two PCIecontrollers for Gen 5x4 support as root complex (RC) or endpoint (EP) Four shader controllers in two slices running up to 1 GHz with support for one or two partitions
- Processing System High-speed Connectivity
- USB 3.2 Gen 2x1, PCIe®, 10GbE, Display Port and Display Controller
- Graphics Processing Unit (GPU)
- The Arm® Mali™-G78AE GPU is part of the Valhall architecture
family. G78AE architecture supports a modular configuration that adapts to both
safety-critical and performance-driven applications by organizing its shader
cores into core clusters, slices, and partitions.
For more information, refer to Graphics Processing Unit section in Versal AI Edge Series Gen 2 and Prime Series Gen 2 Technical Reference Manual (AM026).
- Integrated Hardware Options
-
- AI Engine-ML v2
- Video codec unit (VCU)
- H.264 and H.265 encoding, decoding
- JPEG decode
- Two unit, multiple streams each
- Image signal processor (ISP)
-
- Contains 3 Tiles, each contains two ISPs
- Dual output
- Single and multi-stream
- Resolutions up to 4Kp60
- Power Domains
- The device includes several power domains. The LPD is required for running the PMC to boot, manage the device, and access the PMC I/O peripherals and flash memory controllers. The LPD also provides power to the RPU. The FPD is required for the APU, the system caches, and the memory coherency subsystem. The SoC power domain (SPD) is for the NoC interconnect and DDR memory controllers. The PL power domain is for logic instantiated in the PL and also the integrated logic and peripherals.