Skip to content
9 min read Vecstrata

No Moat, No Disruptee — Open-Weight AI, Read From July 2023

A backtest: we froze a vantage the day Llama 2 went free, made four calls from only what was knowable then, and scored them against what happened. Three hits, one honest miss, and the blind Brier we didn't flatter.

Vecstrata · Backtest · June 2026

A backtest: scored history, not a live call. We froze the vantage at 2023-07-18, derived four predictions from only what was knowable that day, and scored them against what actually happened through 2026. A backtest has no edge, because hindsight is undefeatable, so this is never marketed. Its only product is what running the machinery blind on a resolved case teaches. We score backtests on a separate ledger from the live record, and the Brier below is the honest, un-cherry-picked number: the costliest line is a miss.

The Call (as of 2023-07-18)

On 2023-07-18, Meta and Microsoft released Llama 2 free for commercial use: a well-capitalized incumbent handing away a usable model. By then a Stanford team had cloned ChatGPT-class quality for about $300, and Google's own engineers had written internally that neither they nor OpenAI held a moat. The question the structural toolkit faces is plain: what happens to the model layer, to the proprietary labs, and to where the money sits?

The lens reads it as a textbook low-end disruption. A cheaper, ownable, good-enough alternative enters below the proprietary frontier and climbs; the incumbents flee upmarket toward bigger frontier models and premium customers, ceding the commodity tier. That reading is mostly right, and the way it is wrong is the lesson. The lens called the direction and the template lied about the clock. Open weights climbed faster than any hardware-era disruption ever did, the firm we named as the open anchor abandoned the role, and the labs everyone expected to be the disruptees captured the profit pool instead. The scope is global: open weights, the frontier labs, and the compute market are worldwide, with no single regulator binding the outcome.

We score four calls at confidences set blind from July-2023 uncertainty, all resolved through 2026. The stack lands at Brier 0.167, better than a coin flip, dominated by one honest miss.

The Situation

Vantage facts only: dated, sourced, knowable on or before 2023-07-18. The analysis comes after.

The Mechanics

Causal order: who moved first, and who was reacting

Read the dates and the prime mover is not the trigger. The model layer started commoditizing in 2022 with Stable Diffusion, then the LLaMA 1 leak in March 2023 turned the same dynamic loose on text, and the $300 Vicuna clone proved good-enough was already here. Meta's Llama 2 on 2023-07-18 is the loud event, but it is a player formalizing a slide that was already in motion: a complement-commoditizer putting a commercial license on what the leak had started.

The prime mover is the structural force, not any single firm. A model is a complement to Meta's ad business, so commoditizing it is rational for Meta and corrosive to the labs that sell the model itself. The proprietary labs are reacting. GPT-4's metered-rent pricing is the position under attack, and everything the labs do next is a defense of it. Keep that order straight and the template's failure becomes visible: it tempts you to narrate Meta as the agent of an open future, when Meta was riding a force it did not own and would abandon the moment the force stopped serving it.

Mechanism: the open assault is a force, not a firm, and the trigger is one player formalizing a slide a leak and a clone had already started.

What the low-end template predicts, and where its clock is wrong

Low-end disruption is a clock as much as a direction. A cheaper, ownable, good-enough product enters below the frontier on raw capability and climbs the ladder; the incumbent, unable to fund a price war to zero, flees upmarket toward the next not-good-enough dimension. The template was built on hardware, where good-enough creeps up over years. So it ships with a slow clock.

Software broke the clock. Arthur's increasing returns to adoption say open combinatorial tinkering compounds: many contributors, fast recombination, each improvement seeding the next. The Google memo named exactly this, that open source iterates in days what a lab does in months. The vantage chain flagged the accelerant and then under-priced it into the confidence anyway. That is the central finding. The direction was textbook; the template's hardware-era cadence was the thing that lied, and naming the accelerant in the reasoning is not the same as pricing it into the number.

Mechanism: the disruptive climb is real, but in software it runs on increasing-returns time, not the hardware-era creep the template was calibrated on.

Where the profit went, and why the disruptee won

Here is the move the single lens cannot make. When a layer commoditizes, the profit that drained out of it does not vanish; it relocates to the adjacent layer that stays proprietary. Christensen and Raynor named it the Law of Conservation of Attractive Profits. Low-end disruption tells you which layer commoditizes. Conservation tells you where the margin lands. They are different questions, and answering the first does not answer the second.

The model layer commoditized on schedule, and the proprietary labs were supposed to be the disruptees. Instead the margin migrated up: to the subscription tier (OpenAI's consumer ChatGPT) and the agentic-product tier (Claude Code, Codex), where weak appropriability of an open model routes returns to cospecialized assets the integrated labs own, the distribution, the product surface, the data and tuning. The firms that commoditized the model captured little of the pool. The labs that looked disrupted grew roughly tenfold. What commoditizes is not where the profit migrates. Run both lenses together or you name the wrong winner.

Mechanism: open weights won the layer and lost the profit pool, because conservation sends the released margin up to the cospecialized layer the integrated incumbents already hold.

The Predictions

Each card was derived blind from the vantage facts, scored at its July-2023 confidence, and resolved against the record. One is a control we published expecting it to miss.

1 · Open weights reach parity and take the volume tier · ✓ HIT 2025-01-27
By 2025-12-31, an open-weight model reaches frontier (GPT-4-class) parity on standard public benchmarks, and open-weight models capture the majority of new developer/deployment volume for non-frontier tasks.
Confidence 60% · Horizon 2025-12-31
Wrong if: by 2025-12-31 no open-weight model has reached published frontier parity (staying ≥1 clear generation / >1 year behind the closed frontier on standard benchmarks), or open-weight models remain a niche of deployment volume while proprietary APIs hold the developer mainstream.

2 · The labs flee upmarket, they don't defend the base · ✓ HIT 2024-09-12
By 2025-12-31, the proprietary frontier labs respond by climbing to a new performance dimension — frontier reasoning, agents, premium tiers — and do not defend the commodity inference tier by matching open-weight price to marginal cost.
Confidence 66% · Horizon 2025-12-31
Wrong if: by 2025-12-31 the proprietary labs principally defended the commodity tier — e.g. open-sourced their own frontier weights or matched open-weight pricing on the base tier as their primary strategy — rather than climbing to a differentiated premium dimension.

3 · Meta stays the durable open-weight anchor · ✗ MISS 2025-12-09
By 2025-12-31, Meta remains the durable anchor of the open-weight frontier: its commoditize-the-complement incentive sustains competitive, frontier-tracking open releases, and the persistence of the open assault rests primarily on Meta.
Confidence 55% · Horizon 2025-12-31
Wrong if: by 2025-12-31 Meta has ceased to anchor the open-weight frontier — its releases fall off the frontier or it pivots toward closed models — and the open-frontier standard-bearer role has passed to a different actor.

4 · The disruption damages the proprietary labs (the control we expected to lose) · ✗ MISS 2025-12-31
By 2025-12-31, the open-weight assault inflicts the classic disruptee trajectory on the proprietary labs: open-weight parity plus per-token price collapse compress OpenAI's and Anthropic's economics, leaving flat or deteriorating revenue, margin, or valuation. This is the reading a low-end-disruption lens reaches alone; we published it expecting a miss, because conservation says the displaced margin migrates to a layer the incumbents occupy.
Confidence 30% · Horizon 2025-12-31
Wrong if (it hits): by 2025-12-31 OpenAI/Anthropic show the disruptee trajectory — flat/declining revenue, compressed margin, or down-round valuations attributable to open-weight commoditization. It misses (composition vindicated) if they instead grow by capturing a migrated profit pool.

How they resolved. P1 hit, and faster than the claim: Llama 3.1 405B reached frontier parity in July 2024, about a year ahead of the "open lags 12–18 months" prior, and DeepSeek's R1 put open-weight reasoning at price-setting volume by January 2025. P2 hit cleanly on OpenAI's o1 reasoning launch and the move to premium agentic tiers. P3 missed: Meta's Llama 4 fell off the frontier, the open baton passed to DeepSeek (an actor the vantage could not name), and by December 2025 Meta was reported pivoting to a closed model. P4, the control, missed as designed: the labs grew roughly tenfold by capturing the migrated profit pool (OpenAI's annual recurring revenue ran from about $2B to $20B across the window, Sacra/SaaStr). The control is the one that teaches. A naive low-end lens sat at a 0.50 prior on incumbent damage; composing conservation onto the same facts moved us to 0.30 and to the right answer, the ~0.20 of probability mass that separates calling the labs disrupted from calling them the prime beneficiaries.

Counter-signals, measured. Three independent agents re-derived these four probabilities from the vantage file alone, no outcomes and no author numbers. Their blind Brier (0.193) came out worse than the author's (0.167), the expected hindsight premium, and it sat almost entirely on P3: every blind forecaster bet harder on Meta than the author did, so the "humble 0.55" was partly hindsight-granted, and the named-firm error is bigger than 0.303, not smaller. The two structural calls (P2, P4) reproduced independently and harder, the strongest evidence they are structure, not narration.

Backtests are scored on their own ledger, separate from the live record, and labeled as backtests: full Scorecard →.

The chain

Every framework here traces to a named source in the strategy literature. The load-bearing ones, each tagged for whether we took it from the source or inferred the application:

The full chain, every lens and graph node tagged drawn-from-source or inferred, lives in the prediction files, down to the signal.

What the case taught

Three refinements carried into the live taxonomy: