Tools

The AI tools we'd actually use.

45 tools across 11 categories. Each with a one-line take on what it's good for. Curated, opinionated, no affiliate links.

LLM APIs(6)

Anthropic Claude API
paid
Frontier models (Opus / Sonnet / Haiku) with prompt caching, tool use, and extended thinking. Best for serious production work.
OpenAI API
paid
GPT-5 family + o-series reasoning models. Native structured output, broad tool/function-calling ecosystem.
Google Gemini API
paid
2.5 family. Cheap Flash tier, long context on Pro, native multimodal. Vertex AI for enterprise.
AWS Bedrock
paid
Hosted Claude / Llama / Titan / Mistral / Cohere on AWS. Best when the rest of your stack is on AWS.
Together AI
paid
Hosted open-source models (Llama, Mistral, DeepSeek). Cheaper than building your own GPU stack.
Groq
paid
Specialized inference hardware. Famously fast on open-source models. Great for low-latency demos.

Open models(4)

Inference / serving(7)

The AI tools we'd actually use.

LLM APIs(6)

Anthropic Claude API

OpenAI API

Google Gemini API

AWS Bedrock

Together AI

Groq

Open models(4)

Llama 4

Mistral / Mixtral

Qwen 3

DeepSeek

Inference / serving(7)

vLLM

Hugging Face TGI

NVIDIA Triton + TensorRT-LLM

SGLang

Modal

Replicate

Baseten

Agent frameworks(4)

LangGraph

CrewAI

Anthropic Agent SDK

OpenAI Agents SDK

Vector databases(6)

pgvector

Pinecone

Weaviate

Qdrant

LanceDB

Chroma

Eval / testing(4)

Promptfoo

RAGAS

Braintrust

TruLens

Observability(4)

LangSmith

Helicone

PostHog (LLM Observability)

OpenTelemetry GenAI

Training / fine-tuning(5)

Hugging Face Transformers

PEFT

TRL

Unsloth

Axolotl

Prompt tools(2)

Anthropic Workbench

OpenAI Playground

AI-native IDEs(3)

Cursor

Claude Code

Continue