Appendices
Appendix H: Model Cards and Selection Guide

Proprietary Model Families

The following models are available exclusively through commercial APIs. They represent the current frontier in terms of general capability, though the gap with open models continues to narrow.

OpenAI GPT-4o / GPT-4o mini

Proprietary Multimodal API Only
ParametersUndisclosed (rumored ~200B for GPT-4o, ~8B for mini)
Context Length128K tokens
ModalitiesText, image, audio input; text, audio output
Key StrengthsStrong general reasoning, coding, multimodal understanding, fast inference (4o mini)
API PricingGPT-4o: $2.50/$10.00 per 1M input/output tokens; mini: $0.15/$0.60
Best ForGeneral-purpose applications, multimodal tasks, production systems needing reliability

OpenAI o1 / o3 / o4-mini (Reasoning Models)

Proprietary Reasoning API Only
ParametersUndisclosed
Context Length200K tokens (o3, o4-mini); 128K (o1)
ArchitectureExtended chain-of-thought reasoning with hidden "thinking" tokens
Key StrengthsComplex math, science, coding competitions, multi-step logical reasoning
API Pricingo3: $10/$40 per 1M tokens; o4-mini: $1.10/$4.40 per 1M tokens
Best ForHard reasoning tasks (math olympiads, research-level science, complex code generation)

Anthropic Claude 3.5 Sonnet / Claude 4 (Opus, Sonnet)

Proprietary Multimodal API + Web
ParametersUndisclosed
Context Length200K tokens
ModalitiesText, image, PDF input; text output (with tool use and computer use)
Key StrengthsLong-context reasoning, careful instruction following, coding, agentic tool use, safety
API PricingClaude 4 Sonnet: $3/$15 per 1M tokens; Claude 4 Opus: $15/$75 per 1M tokens
Best ForLong document analysis, coding agents, applications requiring nuanced safety, extended conversations

Google Gemini 2.0 Flash / Gemini 2.5 Pro

Proprietary Multimodal API + Web
ParametersUndisclosed (Gemini 2.0 Flash is a smaller, faster variant)
Context Length1M tokens (2.5 Pro); 1M tokens (2.0 Flash)
ModalitiesText, image, video, audio input; text, image output
Key StrengthsExtremely long context, native multimodal processing, competitive reasoning (2.5 Pro), speed (Flash)
API Pricing2.0 Flash: $0.10/$0.40 per 1M tokens; 2.5 Pro: $1.25/$10.00 per 1M tokens
Best ForVery long documents, video/audio analysis, cost-sensitive multimodal applications
Pricing and Availability

API pricing and rate limits change frequently. Always check the provider's current pricing page before committing to a model for production use.