// blog · analysis · robotics2026-05-215 min read

Three humanoid doctrines — Apptronik, Figure, 1X are running different bets on what comes first

Apptronik picks factories. Figure picks the controlled-environment-to-home gradient. 1X picks consumer-first and learns from the field. The doctrines are diverging fast enough that the next 18 months will pick a winner — or two.

The three positions

Three companies, three deployment doctrines:

Tesla is the wildcard

Tesla Optimus Gen 3 is supposed to ship from Fremont this summer with both factory and consumer ambitions. That's the wider bet — operate the same robot doctrine in factory and consumer simultaneously, scaling production through both channels. The other three companies all picked one channel to start. Tesla is the only one trying both.

What each doctrine optimizes

The methodology paper that ties this together

The interleaved vision-language reasoning paper from May 2026 is the under-noticed technical input. It shows that mixed-modality reasoning traces produce ~30% better out-of-distribution generalization on long-horizon manipulation. Whichever humanoid program adopts the methodology fastest gets the largest near-term improvement on the "works outside the training environment" benchmark — which is the actual capability that all three doctrines are racing on.

Humanoid Press → · CNBC — Apptronik → · RoboZaps — best humanoid robots 2026 →