Frontier models — ai-blogs.org

THINKINGMACHINES.AI·2026-05-11

Mira Murati's Thinking Machines unveils "interaction models" — 0.4-second full-duplex AI

Former OpenAI CTO's startup announces TML-Interaction-Small: a model designed to handle voice, video, and text simultaneously, respond in 0.40 seconds, and interrupt mid-sentence rather than waiting for turns.

models · UX→

OPENAI.COM·2026-05-05

OpenAI ships GPT-5.5 Instant: smarter, clearer, more personalized

A point-release iteration on GPT-5 focused on response quality, reduced hallucinations, and finer-grained personalization controls. Available in the API and ChatGPT.

models→

RESEARCH.IBM.COM·2026-04-29

IBM ships Granite 4.1: dense 8B that matches 32B MoE, plus vision/speech/safety variants — all Apache 2.0

Granite 4.1 covers 3B / 8B / 30B language models, Granite Vision 4.1 (top score on 7 chart/table/KVP extraction benchmarks), two ASR speech models, embeddings, and a Granite Guardian 4.1 safety classifier — every variant under Apache 2.0. The 8B dense model reportedly matches or beats 32B MoE systems.

open-source · models→

MISTRAL.AI·2026-04-29

Mistral Medium 3.5 lands — capstone on a six-week release blitz

Mistral Medium 3.5 (Apr 29) is a frontier multimodal model targeted at agentic and coding workloads. It's the headline at the end of a stretch where Mistral shipped Small 4 (unifying Magistral/Pixtral/Devstral), Voxtral TTS, Leanstral for formal proofs, and the Forge enterprise platform — all between March 16 and end of April.

open-source · models→

BLOGS.NVIDIA.COM·2026-04-28

NVIDIA ships Nemotron 3 Nano Omni — 30B hybrid Mamba-Transformer MoE (3B active), multimodal for agents

Nemotron 3 Nano Omni (April 28) unifies vision, audio, language, and text into one open multimodal model. The architecture is the interesting bit: a hybrid Mamba-Transformer MoE with 30B parameters and only 3B activated per forward pass.

open-source · models · agents→

QWEN.AI·2026-04-20

Alibaba ships Qwen 3.6 Plus and Qwen 3.6 Max Preview — agentic push in two-week tempo

Qwen 3.6 Plus dropped April 2; Qwen 3.6 Max Preview followed April 20. Alibaba's framing: "accelerating agentic AI deployment for enterprises and Alibaba's AI applications." Built on the Qwen 3.5 native-multimodal foundation from February, which supports 201 languages.

open-source · models · china→

BLOG.GOOGLE / DEEPMIND.GOOGLE·2026-04-02

Google Gemma 4 ships under Apache 2.0 — four sizes, MoE, multimodal, 256K context

Gemma 4 (April 2) arrives in E2B / E4B / 26B MoE / 31B Dense variants with native image+video everywhere and native audio on the smaller models. 256K context, 140+ languages, agentic-workflow-oriented. The 31B Dense reportedly hit #3 on Arena's text leaderboard.

open-source · models→

All items 7 items ← all topics

Mira Murati's Thinking Machines unveils "interaction models" — 0.4-second full-duplex AI

OpenAI ships GPT-5.5 Instant: smarter, clearer, more personalized

IBM ships Granite 4.1: dense 8B that matches 32B MoE, plus vision/speech/safety variants — all Apache 2.0

Mistral Medium 3.5 lands — capstone on a six-week release blitz

NVIDIA ships Nemotron 3 Nano Omni — 30B hybrid Mamba-Transformer MoE (3B active), multimodal for agents

Alibaba ships Qwen 3.6 Plus and Qwen 3.6 Max Preview — agentic push in two-week tempo

Google Gemma 4 ships under Apache 2.0 — four sizes, MoE, multimodal, 256K context