AI Model Selector

Choose model with performance trade-offs

Claude Sonnet 4

Estimated Latency

2-4s

Single call

Estimated Cost

$3.00

Per 1M tokens

Best For

Balanced quality + speed

Production RAG, customer-facing (YOUR CURRENT)

✅ Good match for your requirements

Real-World Performance

Your RAG Strategy

Vector search: 500ms Reranking: 1s AI (Claude Sonnet 4): 2-4s ═══════════ Total: 3.5-5.5s

How to Use

1. Select Pattern

Choose your workflow type (RAG, Sequential, Parallel, Batch)

2. Set Priority

Speed, Quality, Cost, or Balanced

3. Pick Model

See real-time performance estimates and trade-offs

4. Compare

See improvement vs. Claude Sonnet 4 (your current)

Integration with Your Workflows

RAG Strategy (Hormozi Coach)

Current: Claude Sonnet 4 (2-4s) + Vector (500ms) + Rerank (1s) = 3.5-5.5s total

With Haiku: Claude Haiku (0.3-0.8s) + Vector (500ms) + Rerank (1s) = 1.8-2.3s total

Savings: 70% faster, 95% cheaper, -20% quality

AI Automation System (6-Stage Pipeline)

Current: 7x GPT-4o (1-2s each) = 7-14s total

With Mini: 7x GPT-4o Mini (0.5-1s each) = 3.5-7s total

Savings: 50% faster, 80% cheaper, -15% quality