-
-
- OpenAI - OpenAI offer 5$ credits when creating an API
- account. Around 10 000 small question to GPT-4 Omni or 100 000 to
- GPT-3.5 Turbo.
-
-
-
- -
- GPT 4 Omni
-
- -
- GPT 4 Turbo
-
- -
- GPT 4
-
- -
- GPT 3.5 Turbo
-
-
-
-
+
OpenAI - OpenAI offer 5$ credits when creating an API
+ account. Around 10 000 small question to GPT-4 Omni or 100 000 to
+ GPT-3.5 Turbo.
-
-
- Anthropic - Anthropic offer 5$ credits when creating
- an API account. Around 2 000 small question to Claude 3 Opus or 120
- 000 to Claude Haiku.
-
-
-
- -
- Claude 3 Opus
-
- -
- Claude 3.5 Sonnet
-
- -
- Claude 3 Haiku
-
-
-
-
-
-
- Mistral - Mistral do not offer free credits.
-
-
-
- -
- Mixtral 8x22b
-
- -
- Mixtral 8x7b
-
- -
- Mistral 7b
-
- -
- Mistral Large
-
- -
- Mistral Small
-
- -
- Codestral
-
-
-
-
+
Anthropic - Anthropic offer 5$ credits when creating an
+ API account. Around 2 000 small question to Claude 3 Opus or 120 000 to
+ Claude Haiku.
-
-
- Groq - Groq offer a free tier with limit of tokens
- and request per minutes. The rate is plenty for a chatbot. 30 messages
- and between 6 000 and 30 000 tokens per minute. Per tokens coming
- soon.
-
-
-
- -
- Llama 3 70b
-
- -
- Llama 3 8b
-
- -
- Mixtral 8x7b
-
- -
- Gemma2 9b
-
- -
- Gemma 7b
-
-
-
-
-
-
-
- Google - Like Groq, Google offer a free tier with
- limit of tokens and request per minutes. The rate is plenty for a
- chatbot. 15 messages and 1 000 000 tokens per minute. Per tokens also
- available.
-
-
-
- -
- Gemini 1.5 pro
-
- -
- Gemini 1.5 flash
-
- -
- Gemini 1.0 pro
-
-
-
-
+
Mistral - Mistral do not offer free credits.
-
-
- Perplexity - Perplexity do not offer a free tier or
- credits. Perplexity offer what they call 'online' models that can
- search online. So you can ask for the current weather for example.
- Those models have additional cost of 5$ per 1 000 requests.
-
-
-
- -
- Sonar Large
-
- -
- Sonar Large Online
-
- -
- Sonar Small
-
- -
- Sonar Small Online
-
- -
- Llama 70b
-
- -
- Llama 7b
-
- -
- Mixtral 8x7b
-
-
-
-
-
-
- Fireworks - Fireworks AI offer 1$ of free credits
- when creating an account. Firework AI have a lot of open source
- models. I may add fine tuned models in the future.
-
-
-
- -
- FireLLaVA-13B
-
- -
- Mixtral MoE 8x7B Instruct
-
- -
- Mixtral MoE 8x22B Instruct
-
- -
- Llama 3 70B Instruct
-
- -
- Bleat
-
- -
- Chinese Llama 2 LoRA 7B
-
- -
- DBRX Instruct
-
- -
- Gemma 7B Instruct
-
- -
- Hermes 2 Pro Mistral 7b
-
- -
- Japanese StableLM Instruct Beta 70B
-
- -
- Japanese Stable LM Instruct Gamma 7B
-
- -
- Llama 2 13B French
-
- -
- Llama2 13B Guanaco QLoRA GGML
-
- -
- Llama 7B Summarize
-
- -
- Llama 2 13B
-
- -
- Llama 2 13B Chat
-
- -
- Llama 2 70B Chat
-
- -
- Llama 2 7B
-
- -
- Llama 2 7B Chat
-
- -
- Llama 3 70B Instruct (HF version)
-
- -
- Llama 3 8B (HF version)
-
- -
- Llama 3 8B Instruct
-
- -
- Llama 3 8B Instruct (HF version)
-
- -
- LLaVA V1.6 Yi 34B
-
- -
- Mistral 7B
-
- -
- Mistral 7B Instruct
-
- -
- Mistral 7B Instruct v0.2
-
- -
- Mistral 7B Instruct v0p3
-
- -
- Mixtral MoE 8x22B
-
- -
- Mixtral MoE 8x22B Instruct (HF version)
-
- -
- Mixtral MoE 8x7B
-
- -
- Mixtral MoE 8x7B Instruct (HF version)
-
- -
- MythoMax L2 13b
-
- -
- Nous Hermes 2 - Mixtral 8x7B - DPO (fp8)
-
- -
- Phi 3 Mini 128K Instruct
-
- -
- Phi 3 Vision 128K Instruct
-
- -
- Qwen1.5 72B Chat
-
- -
- StableLM 2 Zephyr 1.6B
-
- -
- StableLM Zephyr 3B
-
- -
- StarCoder 15.5B
-
- -
- StarCoder 7B
-
- -
- Traditional Chinese Llama2
-
- -
- Capybara 34B
-
- -
- Yi Large
-
-
-
-
+
Groq - Groq offer a free tier with limit of tokens and
+ request per minutes. The rate is plenty for a chatbot. 30 messages and
+ between 6 000 and 30 000 tokens per minute. Per tokens coming soon.
-
Hugging face - You can also use custom endpoints. I only
- tested hugging face but in theory, as long as the key is valid and it use
- the openai api, it should work. This part need some testing and
- improvement.
+
+
Google - Like Groq, Google offer a free tier with limit
+ of tokens and request per minutes. The rate is plenty for a chatbot. 15
+ messages and 1 000 000 tokens per minute. Per tokens also available.
+
+
+
Perplexity - Perplexity do not offer a free tier or
+ credits. Perplexity offer what they call 'online' models that can search
+ online. So you can ask for the current weather for example. Those models
+ have additional cost of 5$ per 1 000 requests.
+
+
+
Fireworks - Fireworks AI offer 1$ of free credits when
+ creating an account. Firework AI have a lot of open source models. I may
+ add fine tuned models in the future.
+
+
+
Custom endpoint - You can also use custom endpoints as long as the key is valid and it use
+ the openai api.
+
+
+
Nvidia NIM - Available soon.
Goose AI - Chat API will be available soon.
+