
Open Models Are The New Linux: DeepSWE And The Infrastructure War Against Closed AI
DeepSWE shows closed labs still lead frontier coding agents, but open-weight models are starting to price the infrastructure layer. That is exactly how Linux won.

DeepSWE shows closed labs still lead frontier coding agents, but open-weight models are starting to price the infrastructure layer. That is exactly how Linux won.
Fast reads on model releases, compute strategy, policy pressure, and the companies fighting over the AI stack.

Sakana AI and NVIDIA's TwELL shows why sparse LLMs were not blocked by theory. They were blocked by GPU execution economics.

RLMs treat prompts as environments, not inputs. The MIT paper behind Recursive Language Models, the REPL execution loop, and why the AI industry is adopting it.

From a $1B nonprofit pledge to a $300B for-profit empire: the decade-long origin story of OpenAI, from founding through ChatGPT to $12B in annualized revenue.

Qwen3.5-397B-A17B scores 88.4 on GPQA Diamond at $0.60/M, yet was absent from Anthropic's distillation report. That absence explains more than the names that were included.

Anthropic named DeepSeek, Moonshot, and MiniMax for industrial-scale Claude distillation. Evidence is real, but calling it an 'attack' ignores how AI was built.

Anthropic was mocked as the AI company that couldn't ship. Claude 2 was the punchline. Four years later, they own enterprise AI at a $380B valuation.

Google doubled ARC-AGI-2 from 31% to 77% in one update. Gemini 3.1 Pro leads 13 of 16 benchmarks at $2/M tokens, undercutting Claude Opus 4.6 by 60%.

Claude Sonnet 4.6 is the new claude.ai default - preferred over Opus 4.5 59% of the time, with 1M context and $3/$15 per million tokens.

OpenClaw hit 190K+ GitHub stars before OpenAI acquired it - despite 512 security vulnerabilities including an 8.8 CVSS remote code execution flaw.

ByteDance's Seed2.0 reveals a complete AI ecosystem - frontier LLMs, multimodal vision, agentic coding, and cinema-grade video at a fraction of Western pricing.

MiniMax M2.5 - 230B params, 10B active - scores within 0.6 points of Claude Opus 4.6 on SWE-bench at 20x lower cost, backed by a Hong Kong IPO.

Claude Opus 4.6 commands 40% of enterprise AI spend, found 500+ zero-day vulnerabilities, and Claude Code hit $1.1B ARR as Anthropic raised $30B.