AI Marketplace
Your one-stop destination for AI models, vector databases, and storage solutions from leading providers with enterprise-grade performance and governance.
| Model | Provider | Context | Category |
|---|---|---|---|
Claude Sonnet 4.5 anthropic/claude-sonnet-4.5 | Anthropic AWS Bedrock +1 | 200K | Premium |
Claude Sonnet 4 anthropic/claude-sonnet-4 | Anthropic AWS Bedrock +1 | 200K | Premium |
Claude 3.7 Sonnet anthropic/claude-3.7-sonnet | Anthropic AWS Bedrock +1 | 200K | Premium |
Claude Haiku 4.5 anthropic/claude-haiku-4.5 | Anthropic AWS Bedrock +1 | 200K | Standard |
Claude 3.5 Haiku anthropic/claude-3.5-haiku | Anthropic AWS Bedrock +1 | 200K | Standard |
Claude 3 Haiku anthropic/claude-3-haiku | Anthropic AWS Bedrock +1 | 200K | Budget |
GPT-4.1 Mini openai/gpt-4.1-mini | Azure OpenAI | 1M | Standard |
GPT-5 openai/gpt-5 | Azure OpenAI | 400K | Premium |
GPT-5 Nano openai/gpt-5-nano | Azure OpenAI | 400K | Budget |
GPT-5 Mini openai/gpt-5-mini | Azure OpenAI | 400K | Standard |
GPT-4o openai/gpt-4o | Azure OpenAI | 128K | Premium |
GPT-4o Mini openai/gpt-4o-mini | Azure OpenAI | 128K | Budget |
Text Embedding 3 Small openai/text-embedding-3-small | Azure OpenAI | — | Embedding |
Gemini 2.5 Flash Lite google/gemini-2.5-flash-lite | Google Vertex AI | 1M | Budget |
Gemini 2.5 Pro google/gemini-2.5-pro | Google Vertex AI | 1M | Premium |
Gemini 2.0 Flash google/gemini-2.0-flash | Google Vertex AI | 1M | Budget |
Gemini 2.0 Flash Lite google/gemini-2.0-flash-lite | Google Vertex AI | 1M | Budget |
Gemini 2.5 Flash google/gemini-2.5-flash | Google Vertex AI | 1M | Standard |
Grok-4 Fast Reasoning xai/grok-4-fast-reasoning | xAI | 2M | Standard |
Gemma 2 9B gemma2-9b-it | Groq | 8K | Standard |
Llama 3.1 8B Instant llama-3.1-8b-instant | Groq | 128K | Standard |
Llama 3.3 70B Versatile llama-3.3-70b-versatile | Groq | 128K | Premium |
Llama Guard 4 12B meta-llama/llama-guard-4-12b | Groq | 128K | Standard |
Whisper Large V3 whisper-large-v3 | Groq | — | Audio |
Whisper Large V3 Turbo whisper-large-v3-turbo | Groq | — | Audio |
DeepSeek R1 Distill Llama 70B deepseek-r1-distill-llama-70b | Groq | 128K | Premium |
Llama 4 Maverick 17B meta-llama/llama-4-maverick-17b-128e-instruct | Groq | 128K | Standard |
Llama 4 Scout 17B meta-llama/llama-4-scout-17b-16e-instruct | Groq | 128K | Standard |
Qwen 3 32B qwen/qwen3-32b | Groq | 128K | Standard |
Command R+ cohere/command-r-plus | Cohere | 128K | Premium |
Command R cohere/command-r | Cohere | 128K | Standard |
Command R7B cohere/command-r7b-12-2024 | Cohere | 128K | Standard |
Command Light cohere/command-light | Cohere | 4K | Budget |
Embed English v3.0 cohere/embed-english-v3.0 | Cohere | — | Embedding |
Embed Multilingual v3.0 cohere/embed-multilingual-v3.0 | Cohere | — | Embedding |