// topic / open-source

Open-source

Weights, fine-tunes, runtimes, and the projects keeping the field auditable.

All items 65 items ← back to archive

DEEPCOGITO.COM·

Deep Cogito v2: open-source models that internalize their own reasoning

San Francisco startup founded by ex-Googlers ships four open-source hybrid reasoning models — 70B, 109B, 405B, 671B — using a technique called Iterated Distillation and Amplification (IDA) to distill search-time reasoning back into model weights.

open-source · research
TECHCOMMUNITY.MICROSOFT.COM·

Microsoft Phi-4 family expands: -mini, -multimodal, -reasoning, -reasoning-vision

Microsoft's small-language-model bet now includes Phi-4-mini, Phi-4-multimodal (text+audio+vision in one), Phi-4-reasoning, Phi-4-reasoning-plus, Phi-4-mini-reasoning, and Phi-4-reasoning-vision. Reportedly beats DeepSeek-R1-Distill-Llama-70B at most benchmarks despite far smaller size.

open-source · small-models
DEEPSEEK / HUGGINGFACE·2026-05-22

DeepSeek V4-Flash holds 1M context under MIT — 284B/13B-active MoE proves the Flash-tier-open-frontier convergence

DeepSeek's V4-Flash variant (284B total / 13B active parameters, 1M context, MIT license) holds production-tier capability at hyperscaler-routable scale. Combined with V4-Pro (1.6T total / 49B active, 80.6 SWE-Bench Verified, 90.1 GPQA Diamond), DeepSeek now ships the most operationally credible open-weight Pro/Flash split. The 1M context retention in Flash is the structural detail that erases the case for routing to Pro on long-document workloads.

open-source · frontier-models
DEEPSEEK / HUGGINGFACE·2026-05-22

DeepSeek V4 Pro vs Flash — the procurement decision tree clarifies at MIT-licensed weights

DeepSeek's V4 release (April 24) shipped two SKUs: V4-Pro (1.6T total / 49B active parameters, 80.6 SWE-Bench Verified, 90.1 GPQA Diamond) and V4-Flash (284B total / 13B active, 1M context). Both run under the MIT license, both ship at 1M context, and both clear the bar for production deployment on coding and reasoning workloads. The Pro/Flash bifurcation now mirrors the closed-flagship pricing curve at a fraction of the cost.

open-source · frontier-models
MISTRAL / CODERSERA·2026-05-22

Mistral Medium 3.5 lands as the EU-friendly coding pick — 77.6% SWE-Bench at sovereign-jurisdiction licensing

Mistral Medium 3.5 (April 29 release) lands at 77.6% on SWE-Bench Verified with EU-friendly licensing terms — the strongest sovereign-jurisdiction coding-model offering in the May 2026 lineup. Combined with Mistral Large 3 (675B / 41B active MoE) and the Voxtral TTS, Forge, and Leanstral releases earlier in the year, Mistral's 2026 H1 cadence is closer to Qwen's monthly tempo than to its prior quarterly pattern.

open-source · tools
ALIBABA / QWEN·2026-05-22

Qwen 3.6-35B-A3B and Qwen 3.6-27B ship as open weights — Alibaba presses the cadence advantage with monthly drops

Alibaba's Qwen 3.6-35B-A3B (Apr 2026) and Qwen 3.6-27B (Apr 2026) continue the team's roughly-monthly drop cadence across 2026 H1. Combined with Qwen 3.5 (Feb 2026, 397B MoE with unified vision-language and 201 languages) and Qwen 3.6 Plus / Max Preview (Apr 2/20), Alibaba now ships the most operationally aggressive open-weights release schedule among Tier 1 labs.

open-source · models
OPENROUTER / STATE OF AI·2026-05-21

Chinese open-weight models now account for more than 60% of OpenRouter usage — a 60× jump in 18 months

Air Street's State of AI May 2026 report shows Chinese open-weight models — DeepSeek, Qwen, Kimi, GLM — went from roughly 1% of OpenRouter usage in mid-2024 to more than 60% in May 2026. The shift tracks a 5–20× price-per-token gap to closed flagships and a near-elimination of the capability gap on most evaluation suites.

open-source · industry
DEEP COGITO·2026-05-21

Deep Cogito v2 ships 70B/109B/405B/671B open-weight family with Iterated Distillation & Amplification self-improvement loop

Deep Cogito's v2 release ships four open-weight sizes (70B, 109B, 405B, 671B) wired into an Iterated Distillation & Amplification (IDA) self-improvement loop. The release positions IDA as a deployable architecture rather than a research curiosity — the first open-weight family where the "model improves itself between checkpoints" methodology is shipped as the default training recipe.

open-source · frontier-models · research
DEEPSEEK·2026-05-21

DeepSeek V4 Flash quietly extends 1M context to standard tier — Apache-2.0 weights match closed-flagship reasoning on Pass@1

DeepSeek extended the 1M context window to its V4 Flash tier this week, pushing the cheaper standard SKU into a capability bracket previously occupied only by V4 Pro and closed flagships. Combined with the unchanged 80.6% SWE-Bench Verified ceiling and the MIT/Apache-2.0 license, the practical effect is to compress the price-quality gradient on long-context production workloads.

open-source · frontier-models
MISTRAL·2026-05-21

Mistral Medium 3.5 ships as the EU-friendly coding pick — 77.6% SWE-Bench Verified at open-weight Apache pricing

Mistral Medium 3.5, released April 29 and now widely available across cloud providers, hit 77.6% SWE-Bench Verified — putting it within striking distance of Qwen 3.5 and DeepSeek V4 on coding while shipping under Apache 2.0 from a Paris-based lab. For EU enterprises navigating data-residency-plus-IP-clarity procurement constraints, the model is the most defensible production-tier coding choice currently available.

open-source · tools
SOURCE·2026-05-21

The China-share tipping point — when did the OpenRouter graph cross 50%?

Sometime in early 2026, Chinese open-weight models crossed 50% of OpenRouter usage. The exact moment matters less than the realization: production share has already migrated. The policy conversation is debating a battle that's already moved one front forward.

analysis · open-source
DEEPSEEK / LLM-STATS·2026-05-20

DeepSeek V4 ships under MIT license — 1.6T Pro and 284B Flash, both at 1M context

DeepSeek released V4 (Pro at 1.6T total / 49B active, Flash at 284B total / 13B active) on April 24 under MIT licensing. Both variants ship with 1M token context. V4 Flash pricing of $0.14/M input is the floor for the open-weight frontier and is forcing competing labs to reprice or differentiate on capability.

open-source · model · china
META / MISTRAL / CODERSERA·2026-05-20

Meta Llama 4 and Mistral Medium 3.5 anchor the European-American open-weight tier

Meta shipped Llama 4 in April 2026 with Scout (17B active / 109B total MoE, runnable on 10GB VRAM) and Maverick (17B active / 400B total). Mistral Medium 3.5 launched April 29 — a 128B dense model hitting 77.6% on SWE-bench Verified, the best single-vendor coding stack outside the Anthropic and OpenAI labs.

open-source · model
MISTRAL AI·2026-05-20

Mistral Large 3 ships as 675B / 41B sparse MoE under Apache 2.0

Mistral Large 3 lands as a 675B-total / 41B-active sparse Mixture-of-Experts model under Apache 2.0 licensing. The architecture choice mirrors DeepSeek V4 and Llama 4 Maverick — the open-weight tier has converged on sparse MoE as the default frontier architecture.

open-source · architecture
SOURCE·2026-05-20

The open-weights rebound: capability parity at one-tenth the price

DeepSeek V4 under MIT, GLM-5.1 at $0.18/M, Kimi K2.6 at 256K context, Llama 4 Maverick. The open-weight frontier is now within a few SWE-bench points of closed flagships at one-tenth the input cost. The structural implications run deeper than pricing.

analysis · open-source · industry
PRESS / LABS·2026-05-19

Four Chinese open-weights labs shipped frontier-class models in a 12-day window

Z.ai (GLM-5.1), MiniMax (M2.7), Moonshot (Kimi K2.6), and DeepSeek (V4) all landed in a 12-day window in early-to-mid May 2026 — all clearing 75%+ on SWE-bench Verified, all priced below $0.30/M input tokens, all permissively licensed for commercial use.

open-source · model · china
AXIOS / SILICONANGLE·2026-05-19

Meta confirms open-source Avocado and Mango variants alongside closed flagships

Meta has confirmed it will release open-weights versions of its next two frontier models, codenamed Avocado and Mango, while keeping the largest variants proprietary — a hybrid strategy that splits the difference between Llama's open-source heritage and the closed-model economics of rival labs.

frontier-models · open-source · meta
MISTRAL.AI·2026-04-29

Mistral Medium 3.5 lands — capstone on a six-week release blitz

Mistral Medium 3.5 (Apr 29) is a frontier multimodal model targeted at agentic and coding workloads. It's the headline at the end of a stretch where Mistral shipped Small 4 (unifying Magistral/Pixtral/Devstral), Voxtral TTS, Leanstral for formal proofs, and the Forge enterprise platform — all between March 16 and end of April.

open-source · models
BLOG.GOOGLE / DEEPMIND.GOOGLE·2026-04-02

Google Gemma 4 ships under Apache 2.0 — four sizes, MoE, multimodal, 256K context

Gemma 4 (April 2) arrives in E2B / E4B / 26B MoE / 31B Dense variants with native image+video everywhere and native audio on the smaller models. 256K context, 140+ languages, agentic-workflow-oriented. The 31B Dense reportedly hit #3 on Arena's text leaderboard.

open-source · models
ALIBABA QWEN / MARKTECHPOST·2026-03-30

Alibaba Qwen 3.5 Omni — native multimodal text/audio/video with sub-300ms TTFT

Qwen 3.5 Omni (released March 30) is a native multimodal model handling text, audio, video, and real-time interaction. Real-time audio time-to-first-token comes in below 300ms with 95%+ ASR accuracy — the relevant numbers for actual voice-assistant deployment.

multimodal · open-source · models