UALink-Ultra Accelerator Link
Training
 

Training

NEW COURSE AVAILABLE FOR BOOKING NOW!

Let MindShare Bring "Ultra Accelerator Link (UALink) Architecture" to Life for You

The UALink specification outlines a set of protocols and interfaces designed to form systems using multiple System Nodes aimed at AI applications. Each System Node normally includes one or more Host CPUs and Accelerators connected by various interconnects such as CXL, PCIe, AMD Infinity Fabric, etc. These nodes typically allow coherent access across Host and Accelerator memory within the node. Each node operates under a single OS image. UALink supports low-latency communication between Accelerators across System Nodes, allowing direct read, write, and atomic transactions. This inter-node communication is restricted to Accelerator initiated traffic. UALink protocol does not permit Host CPU access to memory across different System Nodes.

The UALink specification defines the UALink interface between different physical devices connected by a Link. The specification not only includes the traditional description of a serial bus stack (Transaction, Data Link and Physical Layer), but also includes a description of a UALink Protocol Level Interface (UPLI) also referred to as the Protocol Layer. The UPLI facilitates the standardization of an interface between a component and the UALink stack whether the component be an Accelerator core or a Switch core.

The UALink physical interface is a high bandwidth 200GT/s per Lane, maximum 4 Lanes, point-to-point serial interface Link between Accelerators and Switches, promoting a switching ecosystem for Accelerators. The UALink allows for up to 1024 Accelerators or endpoints in a network to communicate using a 10-bit Identifier.

MindShare’s comprehensive UALink 1.0 Architecture course provides a solid foundation of platform architectures and use cases of the UALink protocol. The course details the role of each of the layers in the stack. These include the Protocol Layer, Transaction Layer, Data Link Layer and Ethernet PHY Layer. We explain UALink Address Translation Model, Remote Memory Access (RMA) and UALink IO Coherency Model, UPLI operating rules, RAS, manageability and UALink Switch architecture, including configuration.

MindShare Courses On UALink Architecture:

Course Name
Classroom

Virtual Classroom

eLearning
UALink 1.0 Architecture 
3 days

3 days
Notify Me When Available
UALink Fundamentals 
1 day

1 day
Notify Me When Available

All of MindShare's classroom and virtual classroom courses can be customized to fit the needs of your group.


UALink Architecture Course Info

You Will Learn:

  • UALink spec terminology
  • Function of each layer that makes up the UALink port stack
  • UPLI interface definitions and operating rules along with interface signals
  • Tx and Rx compression caches
  • TL Flit and Half Flit formats as well as DL Flit format
  • DL message format
  • TL Flow Control and DL Replay support
  • Physical Layer encode/decode and FEC plus transmit and receive functions
  • UALink RAS support
  • Reset, signaling and connection
  • Manageability requirements
  • UALink Switch architecture and routing schemes

Course Length: 3 Days

Who Should Attend?

This course is hardware-oriented but is suitable for both hardware design and software engineers. The course is ideal for RTL-, chip-, system- or system board-level design, verification and validation engineers who need a broad and in-depth understanding of UALink system architecture

Course Outline:

 

3-Day Class

1-Day Class

Topics

UALink 1.0

UALink Fundamentals

UALink Features and Architecture Overview

 

 

Accelerator System Node and Multi-Node architecture

X

X

UPLI and Stack Layers overview

X

X

Address Translation Model

X

X

RAS overview

X

X

UALink Switch overview

X

X

UPLI and Protocol Layer

 

 

UPLI Interface and Interface Signals

X

 

Requests and Response paths 

X

 

Time Division Multiplexing

X

 

UPLI Flow Control

X

 

UPLI Transactions

X

 

Ordering Rules for Read and Write Requests and Responses

X

 

Transaction Layer

 

 

TL features

X

 

TL Flit and Half Flit formats, sequencing and packing

X

 

TL Write Flit Sequence Encoding

X

 

Tx and Rx Compression Caches

X

 

TL Flow Control and Backoff Modes

X

 

Data Link Layer

 

 

DL features

X

 

DL Flit formats and packing 

X

 

DL Messages

 X 

 

Transmitter Pacing

X

 

Link Level Replay

X

 

DL States and Errors

X

 

Physical Layer

 

 

PL features 

X

 

Reconciliation Sublayers

X

 

PCS/PMA modifications

X

 

PCS and FEC

X

 

Low Latency FEC Interleave

X

 

UPLI Reset, Signaling and Connection

 

 

UPLI Reset and Signaling

 

UPLI Control

X

 

RAS and Error Handling

 

 

RAS requirements and End-to-End Data Protection

X

 

RAS error types

X

 

RAS error handling

X

 

Manageability

For Accelerators and System Nodes

X

 

For Switches and Switch Platforms

X

 

For Pod Controllers and Virtual Pods

X

 

Manageability Workflows

X

 

UALink Switches

 

 

Bifurcation support

X

 

Lossless Request and Response delivery

X

 

Non-blocking architecture

X

 

Forward progress guarantee

X

 

Ordering Rules and Virtual Channels

X

 

Routing Table structure

X

 

Configuration

X

 

Switch debug recommendations and Latency/Performance goals

X

 

 

Recommended Prerequisite:

Computer architecture fundamentals. Suggested knowledge of high-speed serial bus architectures such as PCIe

Training Materials:

  1. Downloadable PDF version of the presentation slides
  2. UALink eLearning course when available