Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.heygarth.ai/llms.txt

Use this file to discover all available pages before exploring further.

Anthropic automatically upgrades your API account through usage tiers as your cumulative credit purchases increase. Each tier unlocks higher rate limits — more requests per minute, more input tokens per minute, and more output tokens per minute.
Rate limits are enforced at the organisation level, measured separately per model. You can monitor your current tier and live usage in Anthropic Console → Limits.

Spend tiers

Advancing to the next tier requires a cumulative credit purchase that meets the threshold below. You advance immediately upon reaching it.
TierCredit purchase requiredMax single purchaseMonthly spend limit
Tier 1$5$100$100
Tier 2$40$500$500
Tier 3$200$1,000$1,000
Tier 4$400$200,000$200,000
Monthly InvoicingContact salesNo limit
Credit purchase is the cumulative amount deposited into your Anthropic account (excluding tax), not your total spend. You advance as soon as you cross the threshold — there is no waiting period.

Rate limits by tier

Rate limits are measured in three dimensions:
  • RPM — requests per minute
  • ITPM — uncached input tokens per minute (cached tokens do not count for most models)
  • OTPM — output tokens per minute

Tier 1

Entry-level limits after a $5 credit purchase. Suitable for evaluation and small pilots.
ModelRPMITPMOTPM
Claude Sonnet 4.x5030,0008,000
Claude Haiku 4.55050,00010,000

Tier 2

Unlocked after $40 in cumulative credit purchases. Suitable for small teams with moderate PR volume.
ModelRPMITPMOTPM
Claude Sonnet 4.x1,000450,00090,000
Claude Haiku 4.51,000450,00090,000

Tier 3

Unlocked after $200 in cumulative credit purchases. Suitable for growing engineering teams.
ModelRPMITPMOTPM
Claude Sonnet 4.x2,000800,000160,000
Claude Haiku 4.52,0001,000,000200,000

Tier 4

Unlocked after $400 in cumulative credit purchases. Suitable for large organisations with high PR velocity.
ModelRPMITPMOTPM
Claude Sonnet 4.x4,0002,000,000400,000
Claude Haiku 4.54,0004,000,000800,000
Sonnet 4.x limits apply to combined traffic across claude-sonnet-4-6, claude-sonnet-4-5, and claude-sonnet-4.

How ITPM works (cache-aware)

A key advantage of the Claude API is that cached input tokens do not count towards your ITPM rate limit on current models. Only uncached input tokens and tokens being written to cache consume your ITPM quota.
Token typeCounts towards ITPM?
Uncached input tokensYes
Cache creation tokensYes
Cache read tokensNo (current models)
Output tokensCounted under OTPM separately
This means effective throughput is significantly higher than the raw ITPM number suggests. If 80% of your input tokens are served from cache, you can process 5× more total input tokens per minute than your ITPM limit implies.
Garth uses prompt caching for system instructions and repository context. Teams with many similar PRs (e.g. same monorepo) benefit automatically from high cache hit rates, increasing effective throughput without needing a higher tier.

Choosing the right tier for your team

Tier 1 — Evaluation or very small teams

Fewer than 10 developers, infrequent PRs, or you are trialling BYOK before committing. Requires only a $5 credit deposit.

Tier 2 — Small to medium teams

10–50 developers with regular PR activity. The 1,000 RPM / 450,000 ITPM limit on Sonnet handles dozens of concurrent reviews comfortably. Requires $40 total in credit purchases.

Tier 3 — Large engineering orgs

50–200 developers or multiple busy repositories. Tier 3 doubles the RPM and nearly doubles ITPM versus Tier 2. Requires $200 in credit purchases.

Tier 4 — High-velocity platforms

200+ developers, monorepos, or CI pipelines generating a high volume of short-lived PRs. Requires $400 in credit purchases. For limits beyond Tier 4, contact Anthropic sales for monthly invoicing with no spend cap.

Token consumption per review

Each Garth review consumes input tokens (your diff and context) and output tokens (the review comments). Estimates below are for Claude Sonnet 4.x.
Review sizeTypical input tokensTypical output tokens
Small PR (1–5 files, < 200 lines)3,000 – 8,000500 – 1,500
Medium PR (5–20 files, 200–800 lines)8,000 – 25,0001,500 – 4,000
Large PR (20+ files, 800+ lines)25,000 – 80,0004,000 – 10,000
Keeping PRs focused (under 400 lines changed) reduces token usage per review and generally improves comment accuracy.

Rate limit errors

If your account hits a rate limit, Garth retries automatically with exponential backoff. Sustained limits (e.g. many large PRs merging simultaneously) may delay review posting. What you will see:
  • A delayed review comment once the retry succeeds
  • A dashboard notification if retries are exhausted and the review is dropped
How to resolve:
  • Advance to the next tier by purchasing additional credits in the Anthropic Console
  • Enable or increase prompt caching to reduce ITPM consumption
  • Contact support if you need help sizing the right tier

Getting an Anthropic API key

1

Create an Anthropic account

Go to console.anthropic.com and sign up or log in.
2

Purchase credits

Navigate to Billing and purchase at least $5 in credits. This activates Tier 1 immediately. Purchase $40 total to reach Tier 2, $200 for Tier 3, or $400 for Tier 4.
3

Generate an API key

Go to Settings → API Keys in the Anthropic Console and click Create Key. Give it a descriptive name such as garth-byok.
4

Copy and store the key

Copy the key — it is shown only once. Paste it into Garth’s Settings → Integrations → LLM.
Never commit your Anthropic API key to a repository. Garth’s secure vault is the correct place to store it. Anthropic’s secret scan — and Garth’s own code scan — will flag any key found in source code.

Next steps

BYOK overview

Add your Claude API key to Garth’s secure vault and set it as your active provider.

Anthropic Console — Limits

View your current tier and live rate limit usage in the Anthropic Console.