LLMs & Models Articles
Browse 420 articles about LLMs & Models.
IBM Granite Speech 4.1: 3 Models, One Leaderboard Crown, and a 2-Second Hour of Audio
IBM's new ASR suite has three models for three use cases. The fastest transcribes an hour of audio in 2 seconds. Here's what each one does.
IBM Granite Speech 4.1: Three ASR Models and When to Use Each
IBM Granite Speech 4.1 offers three ASR variants for accuracy, speaker diarization, and throughput. Compare them to find the right fit for your workflow.
What Is Non-Auto-Regressive ASR? IBM Granite Speech 4.1 Explained
IBM Granite Speech 4.1's non-auto-regressive model transcribes an hour of audio in 2 seconds. Learn how NLE architecture achieves this speed.
OpenClaw April 2026: 6 Model Providers You Can Now Swap at Runtime Without Rebuilding
OpenClaw's new provider manifest lets you swap GPT-5.5, Claude, Gemini, DeepSeek, Ollama, or Gemma 4 at runtime — no workflow rebuild needed.
How to Build a Durable Incident Response Workflow in OpenClaw in Under an Hour
OpenClaw task flows handle state, revision tracking, and multi-model routing. Here's how to wire up a full incident response loop fast.
OpenClaw's Creator Joined OpenAI — Then OpenAI Made OpenClaw Free. What's the Play?
Peter Steinberger built OpenClaw, then joined OpenAI. Days later, OpenAI made OpenClaw free for all paid users. Here's what that signals.
a16z's Olivia Moore: Ad-Supported AI Could Generate $152B/Year — Here's the Math
Olivia Moore at a16z calculated that ad-based AI ARPU matching Google's $460/user/year would dwarf subscription revenue. Here's the full model.
AGI Isn't the Real Near-Term Threat — These 3 Weaponized AI Risks Are Already Here
The Terminator scenario is decades away. Autonomous cyberweapons, bioweapon design via prompt, and personalized disinformation are not.
AI Job Apocalypse Narrative Is Cracking: 7 Data Points That Tell a Different Story
Software eng jobs up 18%, new grad hiring up 5.6%, Stripe incorporations up 130%. Seven data points that complicate the AI unemployment narrative.
Anthropic's $1.5B Enterprise JV: 6 Things You Need to Know About the Blackstone-Goldman Deal
Anthropic just closed a $1.5B JV with Blackstone and Goldman Sachs. Here are the deal terms, backers, and what it means for enterprise AI.
Anthropic ARR Doubled Every 6 Weeks in 2026 — $9B to $44B Faster Than Any Company in History
Anthropic's ARR hit $44B in 2026, doubling every 6 weeks — faster than Zoom during COVID or Google in the early 2000s. The numbers behind the run.
Why Anthropic's 70% Inference Margins Matter for Your API Costs — And What to Expect Next
Anthropic's inference margins jumped from 38% to 70% in a year. Here's what that signals about future API pricing and model availability.
Anthropic x SpaceX Deal: 7 Claude Code Limit Changes You Can Use Right Now
Anthropic's 300 MW SpaceX compute deal just doubled Claude Code session limits and removed peak-hour throttling. Here's what changed.
Anthropic and SpaceX Are Putting AI Compute in Orbit — What 'Gigawatts of Orbital GPUs' Actually Means
Beyond the rate limit bump: Anthropic and SpaceX are exploring GPUs in space. Here's what orbital compute capacity means for AI infrastructure.
Why Anthropic Has Zero Founder Exits — And What That Means for Claude's Long-Term Direction
All 6 Anthropic founders are still there. No exits, no drama. Here's why that organizational stability shapes Claude's product roadmap differently than…
Claude Code 1M Token Context Window vs. Old Rate Limits — What Actually Changed
Claude's 1M token context was always there — but rate limits made it unusable. The SpaceX compute deal changes that calculus entirely.
Claude Opus 3 Wasn't Retired — Anthropic Gave It a Blog. Here's What It's Writing.
Instead of retiring Claude Opus 3, Anthropic gave it a public blog. The February 2026 post is live. Here's what it says and why Anthropic did it.
Claude Opus API Output Tokens Just Hit 80,000/min — 10x Increase Explained
Opus API output tokens jumped from 8k to 80k per minute overnight. What triggered it and what it means for production pipelines.
Codex vs. Claude Code: Context Window, Token Efficiency, and Which Lasts Longer Per Session
Codex has 256K tokens vs. Claude Code's 1M — but GPT 5.5's efficiency may close the gap. Here's the real session-length comparison.
Demis Hassabis Personally Pushed the Eve Online Deal — What It Reveals About DeepMind's Agent Roadmap
Hassabis drove DeepMind's Eve Online equity deal himself. The progression from Atari to Chess to Eve Online reveals exactly where agent research is heading.