LLMs & Models Articles
Browse 420 articles about LLMs & Models.
Gemma 4 vs Qwen 3.5: Which Open-Weight Model Should You Use for Local AI Workflows?
Compare Gemma 4 and Qwen 3.5 on performance, size, context window, and local deployment to find the best open-weight model for your agentic workflows.
What Is Google Gemma 4? The Apache 2.0 Open-Weight Model With Native Audio and Vision
Gemma 4 is Google's first truly open-source model family under Apache 2.0. It runs on phones, supports audio and vision, and rivals closed-source models.
What Is Qwen 3.5 Omni? Alibaba's Multimodal Model That Builds Apps From Your Camera
Qwen 3.5 Omni handles text, image, audio, and video and can build a website from a camera description. Here's what it does and how to use it.
What Is Qwen 3.6 Plus? Alibaba's 1M Token Agentic Coding Model Explained
Qwen 3.6 Plus is Alibaba's frontier-level model built for real-world agents, agentic coding, and multimodal vision with a 1M token context window by default.
What Is Gemma 4's Apache 2.0 License? Why It Matters More Than the Model Itself
Gemma 4 ships under Apache 2.0—not a custom restricted license. Here's what that means for commercial use, fine-tuning, and building on top of Google's models.
How to Run Claude Code for Free Using Ollama and Open Router
Learn two ways to use Claude Code without paying for Anthropic tokens: run open-source models locally with Ollama or route through Open Router's free tier.
How to Run Gemma 4 Locally with Ollama: Step-by-Step Setup Guide
Learn how to download and run Google's Gemma 4 locally using Ollama, check VRAM requirements, and connect it to Claude Code for free.
MAI Transcribe 1 vs OpenAI Whisper vs Gemini Flash: Which Speech Model Wins?
Compare Microsoft MAI Transcribe 1, OpenAI Whisper, and Gemini 3.1 Flash on accuracy, noise handling, and multilingual support.
How to Use Open Router Free Models With Claude Code to Cut AI Costs by 99%
Configure Claude Code to route through Open Router's free model tier instead of Anthropic's paid API. A step-by-step guide with the exact settings.json setup.
Open-Source vs Closed-Source AI Models: Which Should You Use for Agentic Workflows?
Compare open-weight models like Gemma 4 and Qwen 3.6 against closed models like Claude Opus and GPT-5.4 for agentic coding and automation tasks.
Why You Should Use an Agentic Harness With Qwen 3.6 Plus (Not Just Chat Mode)
Qwen 3.6 Plus performs dramatically better inside an agentic harness than in chat mode. Here's why and how to set it up with OpenCode.
Qwen 3.6 Plus vs Claude Opus 4.6: Which Model Is Better for Agentic Coding?
Compare Qwen 3.6 Plus and Claude Opus 4.6 on agentic coding benchmarks, context window, multimodal support, and real-world task performance.
What Is Microsoft MAI Transcribe 1? The Speech Model That Beats Whisper and Gemini
MAI Transcribe 1 is Microsoft's new speech recognition model that outperforms Whisper, Gemini Flash, and Scribe V2 across 25 languages.
What Is the Qwen 3.5 Omni Model? Alibaba's Multimodal AI That Builds Apps From Your Camera
Qwen 3.5 Omni understands text, image, audio, and video—and can build a functional website from a camera description. Here's what it can do.
What Is Qwen 3.6 Plus? Alibaba's Agentic Coding Model With 1M Token Context
Qwen 3.6 Plus is Alibaba's frontier agentic coding model with a 1M token context window, multimodal reasoning, and computer use capabilities.
Why Your AI Agent Needs a Harness: Qwen 3.6 Plus vs Chat Mode Performance
Running Qwen 3.6 Plus in a chat session vs an agentic harness produces dramatically different results. Here's what the difference looks like in practice.
Gemma 4 31B vs Qwen 3.5: Which Open-Weight Model Should You Use for Agentic Workflows?
Compare Gemma 4 31B and Qwen 3.5 on benchmarks, agentic capabilities, and local deployment to find the best open model for your AI workflows.
Gemma 4 for Edge Deployment: How the E2B and E4B Models Run on Phones and Raspberry Pi
Gemma 4's edge models support native audio, vision, and function calling in under 4B effective parameters. Here's what that means for on-device AI apps.
Qwen 3.6 Plus Review: Alibaba's Frontier-Level Agentic Coding Model
Qwen 3.6 Plus is Alibaba's latest proprietary model with 1M context and strong agentic coding. Learn how it performs and when to use it in a harness.
What Is Gemma 4? Google's Open-Weight Model Family With Apache 2.0 License
Gemma 4 is Google's newest open-weight model family with Apache 2.0 licensing, native multimodality, and function calling built in from the ground up.