LLMs & Models Articles
Browse 420 articles about LLMs & Models.
Elon Musk Sued OpenAI Over AGI Risk While Building Grok — The Contradiction That Defines the AI Race
Musk argued no single entity should control AGI — then built Grok. This contradiction isn't hypocrisy; it's the competitive logic that traps every AI CEO.
Ezra Klein Says the AI Job Apocalypse Probably Won't Happen — Here's the Economic Argument He's Making
Ezra Klein's NYT op-ed cites Alex Immis's Jevons' paradox framework to argue AI creates demand for labor rather than eliminating it. Here's the logic.
Gemini 3.5 (Speed) vs. Gemini Ultra (Memory) — Google's Two-Track Model Strategy Explained
Leaked: Gemini 3.2/3.5 optimized for speed, Gemini Ultra going deep on memory and long-context. Here's what Google's two-track model strategy means for…
Google DeepMind Buys Into Eve Online: 5 Reasons It's the Perfect AI Agent Training Ground
DeepMind just took an equity stake in Eve Online's developer. Here's why a 20-year-old space MMO is the ideal environment to train frontier AI agents.
Google IO 2026 Leaks: 8 Codenames and Features That Surfaced Before the Announcement
Ajax, Hercules, Hector, Orpheus in arena tests. Team Food memory. Nano Banana in AI Studio. Here are 8 leaked signals ahead of Google IO 2026.
GPT 5.5 Instant vs. GPT 5.3 Instant: Free Tier Just Got a Frontier-Level Upgrade
GPT 5.5 Instant scores 81.2 on AIM 2025 math vs. 65.4 for its predecessor. It's now the default for free and Go users. Here's what actually changed.
Nano Banana Is Already Live in Google AI Studio — Here's What It Can (and Can't) Do
Nano Banana landed in Google AI Studio before IO. It generates custom image assets and has a redesigned edit tool — but no native transparency support yet.
OpenAI Killed Sora and a $1B Disney Deal to Focus on Enterprise — 6 Signals the Consumer Pivot Is Real
OpenAI canceled a billion-dollar Disney deal and shut down the Sora app. Here are 6 concrete signals that enterprise compute is cannibalizing consumer AI.
Recursive Self-Improvement: The AI Risk That Keeps Researchers Up at Night
Recursive self-improvement could compress decades of AI progress into weeks. Learn what it is, why it matters, and what frontier labs are doing about it.
Stuart Russell's Cancer Cure Thought Experiment Explains Why AI Alignment Is So Hard
Stuart Russell's illustration: an AI told to cure cancer might run experiments on millions of humans as the fastest path.
SubCube's 12M Token Layer for Claude Code and Codex: What a Sparse Attention Plugin Would Actually Change
SubCube plans a long-context layer that plugs into Claude Code and Codex. Here's what 12M tokens of coding context would actually unlock for agent workflows.
SubCube Claims 12M Token Context at 5% of Opus Cost — 5 Numbers Behind the Sparse Attention Breakthrough
SubCube's SSA architecture claims 12M tokens, 52x Flash Attention speed, and sub-5% Opus cost. Here are the five numbers and what they'd mean if true.
SubCube SSA vs. Claude Opus 4.7 — Benchmark Claim With No Technical Report. Should You Trust It?
SubCube claims near-Opus 4.7 performance at 5% the cost — but there's no technical report yet. Here's how to evaluate the claim and whether to request access.
What Is an LLM Knowledge Base? How Karpathy's Wiki Architecture Works
Karpathy's LLM wiki turns saved content into a searchable, AI-powered knowledge base. Here's how the architecture works and how to build one.
Coding Agents Arrived Before All Other AI Agents for One Specific Reason — And It's Not What You Think
It's not that code is text. It's that software dev already has unusually rich semantic feedback: tests, compilers, linters.
AI Is Already Doing 25% of Tasks in Half of All Jobs: 6 Data Points That Reframe the Displacement Debate
Anthropic's Economic Index found 49% of jobs have had a quarter of their tasks done by Claude. Here's what the full data picture actually shows.
How to Understand the AI Enterprise Business Model Shift Before Your Competitors Do
Anthropic's inference margins jumped from 38% to 70% in one year. Here's what the subscription-to-deployment shift means for builders and buyers.
Anthropic's $1.5B Enterprise Venture: 5 Things the Deal Structure Reveals About AI's Next Phase
Anthropic just closed a $1.5B enterprise deployment venture backed by Blackstone and Hellman & Friedman. Here's what the structure signals.
Anthropic Is Adding $96M in ARR Per Day — The Growth Curve That's Faster Than Google in 2003
SemiAnalysis data shows Anthropic's ARR went from $9B to $44B in 2026 — doubling every 6 weeks, faster than any software company in history.
ARC Evals' Time Horizons Benchmark: 5 Caveats the Researchers Themselves Want You to Know
A third of tasks use estimated human baselines. Error bars are 2x on either side. The researchers behind Time Horizons explain what the numbers actually mean.