๐ Free LLM API Providers โ Consolidated Directory
Was this page helpful?
Loading OmniRoute...
The ultimate aggregated reference for all permanently free LLM API providers. Consolidated from 6 community repositories. Use with OmniRoute to route through 25+ free providers simultaneously.
Last consolidated: May 2026 ยท Sources: awesome-free-llm-apis, awesome-free-llm-apis2, free-llm-api-resources, Free-LLM-Collection, FREE-LLM-API-Provider, gpt4free
| Groq | |||||||
| Cerebras | |||||||
| Mistral AI | |||||||
| Google Gemini | |||||||
| NVIDIA NIM | |||||||
| Ollama Cloud | |||||||
| OpenRouter | |||||||
| GitHub Models | |||||||
| Cloudflare AI | |||||||
| Hugging Face | |||||||
| Cohere | |||||||
| Pollinations | |||||||
| Z.AI (Zhipu) | |||||||
| SiliconFlow | |||||||
| Kilo Code | |||||||
| LLM7.io | |||||||
| Kluster AI | |||||||
| ModelScope | |||||||
| IBM watsonx |
Get API Key ยท Base URL:
Get API Key ยท Base URL:
Get API Key ยท Base URL:
Get API Key ยท Base URL:
Pricing
Get API Key ยท Base URL:
Get API Key ยท Base URL:
Explore Models ยท Base URL:
129 models, 40 RPM. Phone verification required.
Notable models: DeepSeek-R1, DeepSeek-V3.2, Nemotron Ultra 253B, Llama 3.1 405B, Qwen3 Coder 480B, Mistral Large 3, Kimi K2, GLM-5.1, MiniMax M2.7, Gemma 4 31B, + 100 more.
Get API Key ยท Base URL:
). 20 RPM.
Notable free models: DeepSeek R1, DeepSeek V3, Qwen3 Coder 480B, Llama 4 Scout/Maverick, GPT-OSS 120B, Nemotron 3 Super 120B, MiniMax M2.5, Gemma 4 31B, Devstral, + 23 more.
Marketplace ยท Base URL:
Notable models: GPT-4.1, GPT-4o, GPT-5, GPT-5-mini, o3-mini, o4-mini, DeepSeek-R1, Llama 4 Scout/Maverick, Codestral, Mistral Medium 3, Phi-4, Grok-3.
Get Token ยท 10,000 Neurons/day free. 50+ models.
Notable models: Llama 3.3 70B, Llama 4 Scout, Qwen3 30B-A3B, QwQ 32B, DeepSeek R1 Distill, Gemma 4 26B, GLM 4.7 Flash, Nemotron 3 120B, Kimi K2.5/K2.6, Mistral Small 3.1, GPT-OSS 120B/20B, + 40 more.
Get Token ยท Base URL:
Get Key ยท Base URL:
Notable models: GPT-OSS 120B, DeepSeek V3.2/V4, Kimi K2/K2.5/K2.6, GLM-5/5.1, Qwen3 Coder 480B, Gemini 3 Flash, MiniMax M2.7, Cogito 2.1 671B, Nemotron 3 Super 120B.
Get Key ยท Base URL:
text + image + video + audio all free.
Text models: openai, openai-large, openai-reasoning, gemini, mistral, llama. Image models: flux, gpt-image, seedream, kontext. Video: wan-fast. Audio: tts-1, 30+ ElevenLabs voices.
Get Key ยท Base URL:
Get Key ยท Base URL:
.
Get Token ยท Base URL:
Get Key ยท DeepSeek-R1, Llama 4 Maverick, Qwen3-235B + more.
Docs ยท Free models (Big Pickle Stealth, MiniMax M2.5 Free, Arcee Large).
Docs ยท $5/month free credits. Routes to various providers.
Get Token ยท Base URL:
Models: DeepSeek V4 Pro/Flash, DeepSeek V3.2, GLM-5/5.1, MiniMax M2.5, Qwen3-235B, Qwen3 Coder 480B, Ling-2.6-1T.
ยท GPT-5.4-mini, DeepSeek-V4, and more.
ยท 10 RPM. Keys valid 6 months.
Models: intern-latest, intern-s1-pro, internvl3.5-241b-a28b.
ยท 30 concurrent requests.
Models: GLM-4-Flash, GLM-4V-Flash, GLM-4.1V-Thinking-Flash, GLM-4.6V-Flash, GLM-4.7-Flash.
| Baseten | |||
| NLP Cloud | |||
| AI21 | |||
| Upstage | |||
| Modal | |||
| SambaNova | |||
| Scaleway | |||
| Alibaba Cloud | |||
| Fireworks | |||
| Nebius | |||
| Inference.net | |||
| Hyperbolic | |||
| Novita |
all providers listed above as connections. Here's how to maximize free usage:
"priority" or "round-robin" strategy to distribute load across free tiers.
| Groq | ||
| Cerebras | ||
| Mistral | ||
| Google Gemini | ||
| NVIDIA NIM | ||
| OpenRouter |
# These providers work out of the box with OmniRoute: GROQ_API_KEY=your-key CEREBRAS_API_KEY=your-key MISTRAL_API_KEY=your-key GOOGLE_AI_API_KEY=your-key NVIDIA_API_KEY=your-key OPENROUTER_API_KEY=your-key GITHUB_TOKEN=your-token CLOUDFLARE_API_TOKEN=your-token COHERE_API_KEY=your-key SILICONFLOW_API_KEY=your-key
| Requests/Day | |
| Tokens/Month | |
| Models Available | |
| Cost |
| RPM | |
| RPD | |
| RPH | |
| RPS | |
| TPM | |
| TPD | |
| Neurons |
| awesome-free-llm-apis | |
| awesome-free-llm-apis2 | |
| free-llm-api-resources | |
| Free-LLM-Collection | |
| FREE-LLM-API-Provider | |
| gpt4free |
Disclaimer: Rate limits change frequently. Always verify with the provider's official documentation before relying on specific limits. Trial credits and time-limited promotions are separated from permanent free tiers.