SoftBank Predicts AMD GPUs Will Enhance AI Capability Through ‘Divide and Conquer’ Computing Strategy

SoftBank has launched an ambitious project aimed at enhancing the performance of AMD’s Instinct AI chips specifically for AI applications. This initiative employs a cutting-edge “GPU partitioning”technique that has sparked considerable interest within the tech community.

SoftBank Implements Custom Orchestrator for AMD’s Instinct GPUs

While AMD’s AI infrastructure has not captured the attention of hyperscalers recently—particularly due to NVIDIA’s dominance and the recent Blackwell series unveiling—companies like SoftBank are still keen on leveraging AMD’s technology. As announced in a recent blog post, SoftBank’s tech division has introduced an Orchestrator that integrates seamlessly with AMD’s Instinct AI chips. This system dynamically allocates computational resources according to workload demands and resource availability.

In collaboration with AMD, SoftBank has developed an enhanced Orchestrator feature that leverages the GPU partitioning capabilities of AMD Instinct™ GPUs, which allow a single GPU to be used as multiple logical devices. This feature allows for the flexible and optimal allocation of GPU resources based on the requirements of the AI application, such as model size and concurrency.

– SoftBank

Technically speaking, SoftBank’s Orchestrator emphasizes efficient workload distribution within AMD’s Instinct GPUs. By leveraging multiple GPU instances configured on individual Accelerator Complex Dies (XCDs), it can operate in several modes, including a single-instance model (SPX) and configurations that support up to eight instances (CPX).This versatility provides a high level of granularity across different workloads. Additionally, the Orchestrator makes full use of AMD’s expansive memory capabilities, segmenting the high-bandwidth memory (HBM) into distinct regions for each GPU instance.

A diagram titled 'SoftBank Orchestrator: Optimizing AMD GPU Resources' compares 'Before: Monolithic Allocation' and 'After:' — Image Credits: SoftBank

With this Orchestrator, SoftBank aims to achieve refined control over computational resources, ensuring strict isolation at the hardware level to mitigate unpredictable latency issues. Although specific performance metrics have yet to be disclosed, SoftBank claims that their approach enhances “optimal resource allocation, ”particularly benefiting SLM and MLM workloads. Looking ahead, the company has plans to adapt such orchestrators for other AI accelerators, but for the moment, the focus remains on AMD technology.

Source&Images