Llama Models

Meta's open-weight models for code generation and reasoning.

Llama 3.3 70B

Provider	Meta AI
Parameters	70 billion
Context	128K tokens
Best for	Code generation, analysis, structured output
Dispatch role	Implementation, spec writing, documentation
Infrastructure	Dispatch Pro cloud
Cost	Included in Pro ($20/month, shared 5M token pool)

Llama 3.3 70B is a strong general-purpose model with excellent code generation capabilities. Dispatch routes medium-complexity tasks to Llama 3.3 when quality and speed need to be balanced -- implementation tasks, documentation generation, and structured analysis.

Llama 4 Scout 17B

Provider	Meta AI
Parameters	17 billion (MoE)
Context	128K tokens
Best for	Fast code tasks, triage, exploration
Dispatch role	Quick implementation, boilerplate, triage
Infrastructure	Dispatch Pro cloud
Cost	Included in Pro ($20/month, shared 5M token pool)

Llama 4 Scout is a fast, efficient mixture-of-experts model. With 17B active parameters, it responds quickly while maintaining strong code quality. Dispatch routes simpler tasks to Scout when speed matters more than reasoning depth -- config changes, boilerplate generation, quick fixes, and triage decisions.

Configuration

Available to all Dispatch Pro subscribers. Set any pipeline stage to llama-3.3-70b or llama-4-scout-17b in Settings > Pipeline Profiles, or let Auto Model route automatically based on task complexity.

Isolation

All cloud model runs use worktree isolation.