// news · open-source · compute2026-05-24source: zyphra / amd / whatllm

Zyphra ships ZAYA1-8B — 760M-active-parameter MoE running frontier-adjacent reasoning on AMD silicon

Zyphra released ZAYA1-8B in early May — an 8B-parameter MoE architecture with 760M active parameters per token, running frontier-adjacent reasoning workloads on AMD silicon rather than NVIDIA. The release is the strongest validation yet that AMD's AI software stack has matured enough to host production-grade efficient MoE training without the NVIDIA CUDA dependency.

The technical achievement is twofold. First, 760M active parameters per token is an aggressive sparsity ratio that delivers competitive reasoning performance at meaningfully lower inference cost than dense 8B models. Second, training and serving on AMD silicon (ROCm + MI300X-class hardware) at this architectural sophistication has been the open question for AMD's frontier AI ambitions for three years. ZAYA1-8B is the answer.

The market consequence is that AMD's AI growth trajectory — total data-center revenue forecast to rise 73% to $28.7B in 2026 — has architectural credibility behind it, not just supply-availability arguments. Frontier labs that have been compute-constrained by NVIDIA allocation now have a viable second-source path for both training and inference on novel architectures. The diversification of frontier AI compute away from NVIDIA-only is happening faster than the 2024 conventional wisdom expected.

See our analysis →

WhatLLM — New AI Models May 2026: The Frontier Took a Breath, Architecture Took the Stage → · S&P Global — AMD's next-generation AI chips set to power 2026 data center growth → · Hugging Face — Best Open-Source LLM Models in 2026 →