AMD

Instinct MI325X

2.6K FP8 · 288GB HBM3e · 750W

68Score
LLM TrainingROCm Workloads

Specifications

ArchitectureCDNA 3+
Memory288GB HBM3e
Memory Bandwidth6,000 GB/s
FP16 TFLOPS1,307
FP8 TFLOPS2,614
BF16 TFLOPS1,307
INT8 TOPS2,614
TDP750W
InterconnectInfinity Fabric (896 GB/s) (896 GB/s)
EcosystemROCM
GenerationPrevious
Est. Price$22,000

Recommended Configuration

8× MI325X in OAM platform

Training Intelligence

ROCm
PyTorch
TensorFlow
JAX
DeepSpeed
Training Time Estimates
LLaMA 70B(70B)
~11 days64 GPUs
GPT-3 175B(175B)
~12 days1024 GPUs
Stable Diffusion XL(3.5B)
~18 hrs8 GPUs

Cloud cost: $6.50/hr

Ask AI Advisor