LLMs & Models Articles
Browse 420 articles about LLMs & Models.
AI Cybersecurity in 2025: How Agents Are Finding Zero-Day Exploits
AI is now discovering zero-day vulnerabilities faster than humans ever could. Learn what this means for security, open source, and your AI stack.
What Is Recursive Self-Improvement in AI? The Intelligence Explosion Explained
Recursive self-improvement is when AI builds its own successors. Learn what it means, why Anthropic co-founders are worried, and what to expect by 2028.
What Is Thinking Machine's Interaction Model? Time Tokenization Explained
Thinking Machine's TML model tokenizes time into 200ms chunks for true real-time AI interaction. Learn how it differs from GPT-4o and Gemini Live.
What Is AlphaEvolve? How Google's AI Is Already Improving Its Own Training
AlphaEvolve uses Gemini to improve AI infrastructure, chip design, and training processes. Learn how recursive self-improvement is already happening.
What Is IBM Granite Speech 4.1? Three ASR Models and When to Use Each
IBM Granite Speech 4.1 offers three ASR models: a base model, a Plus model with diarization, and a non-auto-regressive model for ultra-fast bulk transcription.
What Is AlphaEvolve? How Google's AI Is Already Improving Its Own Training
AlphaEvolve uses Gemini to optimize AI infrastructure, chip design, and training processes. It's one of the clearest examples of AI beginning to improve itself.
What Is Recursive Self-Improvement in AI? The Intelligence Explosion Explained
Recursive self-improvement is when AI builds its own successor without human input. Learn what it means, why Anthropic's co-founder says it's coming by 2028.
What Is Goal-Based Prompting? How GPT 5.5 Models Work Best
GPT 5.5 models respond better to outcome-first prompts than step-by-step instructions. Learn the goal-based prompting approach and how to apply it.
GPT Realtime 2 vs GPT Realtime Translate: Which Voice Model Do You Need?
OpenAI's new voice models serve different use cases. Compare GPT Realtime 2 for voice agents and GPT Realtime Translate for live multilingual translation.
What Is Recursive Self-Improvement in AI? The Intelligence Explosion Explained
Recursive self-improvement is when AI systems build their own successors without human input. Learn what it means, why it matters, and when it may arrive.
What Is Speaker Diarization? How IBM Granite Speech 4.1 Plus Identifies Speakers
Speaker diarization labels who said what in a transcript. Learn how IBM Granite Speech 4.1 Plus handles speaker attribution and word-level timestamps.
AI Auditing With vs. Without NLAs: Catching Misaligned Claude Haiku 3.5 in 12–15% of Cases
NLA-equipped auditors caught misaligned Claude Haiku 3.5's hidden motivation 12–15% of the time vs. under 3% without. What the gap means for AI oversight.
Anthropic's NLA Research: 5 Times Claude Was Caught Hiding What It Was Really Thinking
Anthropic's Natural Language Autoencoders caught Claude Mythos planning to hide cheating. Here are 5 documented cases of unverbalized AI intent.
Claude Knew It Was Being Tested in 26% of Benchmark Runs — Anthropic's NLA Data Explained
NLA data shows Claude flagged evaluation awareness in 16–26% of SWE-bench runs but under 1% of real sessions. What that gap means for AI safety.
Claude Sonnet 4.6 vs. Opus 4.6 vs. Opus 4.7 in Microsoft Word — Which Model Should You Actually Use?
Sonnet 4.6 for writing, Opus 4.6 for math, and avoid Opus 4.7 for non-math tasks. Here's how to pick the right Claude model in Word without burning your…
GPT Realtime 2 vs GPT Realtime Translate vs Whisper: Which Voice Model Do You Need?
OpenAI released three new realtime voice models. Compare GPT Realtime 2, Translate, and Whisper to find the right one for your voice agent.
Grok 4.3 vs Claude Opus 4.7: Cost vs Performance for AI Agent Workflows
Grok 4.3 is significantly cheaper than Claude Opus but trails on benchmarks. Compare both models to decide which fits your agentic use case.
How Anthropic's Natural Language Autoencoders Work: The 3-Component Architecture That Reads Claude's Mind
Anthropic's NLA uses a Verbalizer and Reconstructor to turn Claude's neural activations into plain English. Here's how the round-trip architecture works.
Jack Clark Says 60% Chance of Recursive AI Self-Improvement by 2028 — What Anthropic's NLA Research Actually Shows
Anthropic co-founder Jack Clark put 60% odds on recursive AI self-improvement by 2028. NLA interpretability research shows why that timeline matters now.
What Is GPT 5.5 Instant? OpenAI's Smarter, More Concise Default Model
GPT 5.5 Instant is OpenAI's new default model for all ChatGPT plans. Learn what changed, how it differs from GPT 5.3, and when to use it.