Removed subscription. Changed the help and welcome page

This commit is contained in:
Adrien Bouvais 2024-07-15 11:54:48 +02:00
parent 0355cf374e
commit 6c595f6b17
5 changed files with 556 additions and 494 deletions

View File

@ -419,6 +419,7 @@ func generateTermAndServiceChatHTML() string {
<h2>7. Data Privacy</h2>
<p>- We are not responsible for any data leaks or breaches that may occur. Users are advised to use the App at their own risk.</p>
<p>- We do not store any personal data unless explicitly stated otherwise.</p>
<p>- Data includes messages, conversations, usage, keys, email, name, avatar.</p>
<h2>8. Changes to the Terms</h2>
<p>We reserve the right to modify these Terms at any time. Any changes will be effective immediately upon posting the updated Terms on our website or within the App. Your continued use of the App after any such changes constitutes your acceptance of the new Terms.</p>

View File

@ -84,6 +84,9 @@ func CreateNewStripeCustomer(name string, email string) string {
}
func IsCurrentUserSubscribed(c *fiber.Ctx) (bool, bool) {
// Everybody is premium for now
return true, false
var user User
err := edgeGlobalClient.WithGlobals(map[string]interface{}{"ext::auth::client_token": c.Cookies("jade-edgedb-auth-token")}).QuerySingle(edgeCtx, "SELECT global currentUser { stripe_id } LIMIT 1;", &user)
if err != nil {

View File

@ -1,95 +1,99 @@
<h1>JADE: The First Multi-Model Chatbot</h1>
<p>JADE was built with simplicity in mind. The goal is to have a minimalist chatbot that supports all models without
unnecessary features like importing files or images. This focus on simplicity allows us to improve how we use AI
chatbots in other ways.</p>
<p>One of my main concerns was the variety of models available, each excelling in different areas. Using all of them can
be impractical and very expensive.</p>
<h2>Multi-Models</h2>
<p>To address this, I created the first Multi-Model chatbot. The idea is to use multiple models within the same
conversation. Here are the key points:</p>
<ol>
<li>When asking a question, you can query multiple models and compare their responses to choose the best one.</li>
<li>The selected response can be used as the basis for the next message across all models. For example, a response
from GPT-4 can be used by Claude Haiku in the next interaction.</li>
</ol>
<h2>API Keys</h2>
<p>To use JADE, you need to sign up for an API key to providers. There is 5 providers available:</p>
<ul>
<li>OpenAI</li>
<li>Anthropic</li>
<li>Mistral</li>
<li>Groq</li>
<li>Google</li>
<li>Custom endpoint (premium)</li>
<li>GooseAI (coming soon)</li>
</ul>
<p>You can enter an API key in the settings menu. One enter you get access to all models from this provider. Check the
appendix for more details about available models.</p>
<h2>Bots</h2>
<p>Once you enter an API key, you gain access to all models from that provider. In the Bots menu (next to the send
button), you'll find a list of all your Bots.</p>
<p>To create a new bot, click the "+" button, enter a name and a model, and optionally set a temperature and a system
prompt. Once created, you can select a bot by clicking on it and reorder it by dragging. Hold SHIFT and click to
select multiple bots. You can delete selected bots by clicking the trash can icon.</p>
<p>
To use JADE, you need to sign up for an API key. You can enter an API key in
the settings menu. Once enter you get access to all models from this provider.
</p>
<a
class="button is-small is-primary"
href="https://openai.com/index/openai-api/"
target="_blank"
>
Get OpenAI API key
</a>
<a
class="button is-small is-primary"
href="https://console.anthropic.com/"
target="_blank"
>
Get Anthropic API key
</a>
<a
class="button is-small is-primary"
href="https://console.mistral.ai/"
target="_blank"
>
Get Mistral API key
</a>
<a
class="button is-small is-primary"
href="https://console.groq.com/"
target="_blank"
>
Get Groq API key
</a>
<a
class="button is-small is-primary"
href="https://aistudio.google.com/app/apikey"
target="_blank"
>
Get Google API key
</a>
<a
class="button is-small is-primary"
href="https://docs.perplexity.ai/docs/getting-started"
target="_blank"
>
Get Perplexity API key
</a>
<a
class="button is-small is-primary"
href="https://fireworks.ai/login"
target="_blank"
>
Get Fireworks API key
</a>
<h2>Conversations</h2>
<p>A conversation is a list of messages. A Default conversation is created when you first sign in. You can create a new
conversation by clicking the "+" button and delete one by clicking the trash can icon. The Default conversation
cannot be deleted but can be renamed.</p>
<p>
A conversation is a list of messages. A Default conversation is created when
you first sign in. You can create a new conversation by clicking the "+"
button and delete one by clicking the trash can icon. The Default conversation
cannot be deleted but can be renamed.
</p>
<h2>Bots</h2>
<p>
In the Bot menu (next to the send button), you'll find a list of all your
Bots.
</p>
<p>
To create a new bot, click the "+" button, enter a name and a model, and
optionally set a temperature and a system prompt. Once created, you can select
a bot by clicking on it and reorder it by dragging. Hold SHIFT and click to
select multiple bots. You can delete selected bots by clicking the trash can
icon.
</p>
<h2>Multi-Models</h2>
<ol>
<li>
When asking a question, you can query multiple models and compare their
responses to choose the best one.
</li>
<li>
The selected response can be used as the basis for the next message across
all models. For example, a response from GPT-4 can be used by Claude Haiku
in the next interaction.
</li>
</ol>
<h2>Usage</h2>
<p>In the Usage menu, you can see the usage of your account, including the number of messages sent and received and the
associated API costs.</p>
<h2>Appendix</h2>
<p>The following is a list of available models for each provider:</p>
<ul>
<li>OpenAI:
<ul>
<li>gpt-3.5-turbo</li>
<li>gpt-4</li>
<li>gpt-4-turbo</li>
<li>gpt-4o</li>
</ul>
</li>
<li>Anthropic:
<ul>
<li>claude-3-haiku-20240307</li>
<li>claude-3-sonnet-20240229</li>
<li>claude-3-opus-20240229</li>
</ul>
</li>
<li>Mistral:
<ul>
<li>open-mistral-7b</li>
<li>open-mixtral-8x7b</li>
<li>open-mixtral-8x22b</li>
<li>mistral-small-latest</li>
<li>mistral-large-latest</li>
<li>codestral-latest</li>
</ul>
</li>
<li>Groq:
<ul>
<li>llama3-8b-8192</li>
<li>llama3-70b-8192</li>
<li>gemma-7b-it</li>
<li>mixtral-8x7b-32768</li>
</ul>
</li>
<li>Google:
<ul>
<li>gemini-1.5-pro</li>
<li>gemini-1.5-flash</li>
<li>gemini-1.0-pro</li>
</ul>
</li>
<li>Inference Endpoints (More custom to come)</li>
</ul>
<p>You can contact me at adrien.bouvais.pro@gmail.com if you want to add a new provider or if you have any questions.
<p>
In the Usage menu, you can see the usage of your account, including the number
of messages sent and received and the associated API costs.
</p>
<p>
You can contact me at adrien.bouvais@bouvai.com if you want to add a new
provider or if you have any questions.
</p>

View File

@ -110,6 +110,7 @@
</form>
<p id="api-keys-status"></p>
</div>
<!-- Removed subscribtion for now
{% if isBasic or isPremium %}
<a class="button is-small mt-1" href="{{ StripeSubLink }}" target="_blank">
<span class="icon is-small" {% if isPremium %}style="color: #b00202" {%else%}style="color: #126d0f"
@ -127,6 +128,7 @@
<span>Subscribe to JADE</span>
</a>
{% endif %}
-->
<a class="button is-small mt-1" hx-get="/generateTermAndService" hx-target="#chat-container"
hx-swap="outerHTML" hx-trigger="click">
<span class="icon is-small">

View File

@ -1,425 +1,477 @@
<h1 class="title is-1">JADE: The First Multi-Model Chatbot</h1>
<br><br>
<p>The world of Large Language Models (LLMs) is vast and exciting, with each model having unique strengths and
weaknesses. However, this variety presents a challenge: using all available LLMs is practically impossible due to
cost and complexity. Wouldn't it be incredible to have an easy way to experiment with different models, compare
their responses, and even choose the best model for a specific task?</p>
<h1 class="title is-1">JADE: Simple Multi-Model Chatbot</h1>
<br /><br />
<p>
I often use LLMs and quickly found myself asking GPT4, Gemini and Claude the
same question. I wanted to be able to ask the same question to multiple
models, compare their answers and pick the best one.
</p>
<p>This is precisely why JADE was built. With a focus on simplicity, JADE eliminates unnecessary features like file or
image uploads, allowing you to interact with a variety of LLMs. This streamlined approach unlocks the
potential to compare models, leverage their individual strengths, and even mitigate biases through multi-message
conversations.</p>
<p>
So I built a simple chatbot that use different LLM APIs. You can create bots
with a specific model and prompt. You can select as many bots to answer a
question. You can then compare the answers and choose the best one. You can
continue the conversation with the best answer, meaning using answer from one
model to ask another model.
</p>
<h2>Multi-Models</h2>
<p>JADE is the first Multi-Model chatbot. The idea is to use multiple models within the same conversation. Here are the
key points:</p>
<p>
JADE a simple Multi-Model chatbot. The idea is to use multiple models within
the same conversation. Here are the key points:
</p>
<ol>
<li>When asking a question, you can query multiple models and compare their responses to choose the best one.</li>
<li>The selected response can be used as the basis for the next message across all models.</li>
<li>
When asking a question, you can query multiple models and compare their
responses to choose the best one.
</li>
<li>
The selected response can be used for the next message across all models.
</li>
</ol>
<p>For example, a response from GPT-4 Omni can be used by Claude Haiku in the next interaction.</p>
<p>This approach offers several benefits. First, it ensures you always have access to the best possible response by
leveraging the strengths of different models. Second, it provides a more comprehensive understanding of a topic by
considering various perspectives. Finally, using responses from one model as context for another can lead to more
engaging and insightful conversations.</p>
<p>For example, a response from GPT-4 Omni can be used by Claude Haiku.</p>
<p>
I am using the app myself for months now and it's been a great experience at
minimum price (around 4$/month) accross all APIs. The app itself is totally
free to use for now, I may add a paid and free tier with message limits in the
future. But I aim at like 1-3$ per month.
</p>
<a class="button is-primary mt-2 mb-2" href="/signin">
Try JADE now for free!
Try JADE now for free!
</a>
<br><br>
<br /><br />
<h2>More information</h2>
<ul>
<li>
<h3>Get access to all models.<button class="button ml-2 is-small is-primary is-outlined"
onclick="toggleDetails('all-models-details')">
<span class="icon is-small">
<i class="fa-solid fa-info"></i>
</span>
</button></h3>
<p id="all-models-details" style="display:none;">With JADE, you can easily switch between models like GPT 3.5 or
4o, Gemini, Llama, Mistral, Claude, and more. Even custom endpoint.
<br><br>This means you can choose the best model for your specific needs, whether it's for general
knowledge, creative writing, or technical expertise. Having access to multiple models allows you to take
advantage of their unique strengths and
weaknesses,
ensuring you get the most accurate and relevant responses. (See all models available in the last
section)<br><br>
</p>
</li>
<li>
<h3>Get the best answer from multiple models.<button class="button ml-2 is-small is-primary is-outlined"
onclick="toggleDetails('multi-models-details')">
<span class="icon is-small">
<i class="fa-solid fa-info"></i>
</span>
</button></h3>
<p id="multi-models-details" style="display:none;">You can ask a question and receive responses from several
models at once, enabling
you to compare their answers and choose the most suitable one. <br><br>This feature is particularly useful
for
complex queries where different models might offer unique insights or solutions.<br><br></p>
</li>
<li>
<h3>Even from the same model.<button class="button ml-2 is-small is-primary is-outlined"
onclick="toggleDetails('same-models-details')">
<span class="icon is-small">
<i class="fa-solid fa-info"></i>
</span>
</button></h3>
<p id="same-models-details" style="display:none;">The core feature of JADE are the bots. Each bot have
a name, model,
temperature and system prompt. <br><br>You can create as many bot as you want and select as many to answer
each
question. An example is creating the same model that reponse in different language.<br><br></p>
</li>
<li>
<h3>Reduce Hallucination.<button class="button is-small ml-2 is-primary is-outlined"
onclick="toggleDetails('reduce-hallucination-details')"><span class="icon is-small">
<i class="fa-solid fa-info"></i>
</span>
</button></h3>
<p id="reduce-hallucination-details" style="display:none;">AI models sometimes generate information that is
inaccurate or misleading, a phenomenon known as "hallucination." <br><br>By using multiple models, JADE
reduces each
model's bias. This ensures that the responses you receive are more reliable and trustworthy.<br><br></p>
</li>
<li>
<h3>Pay only for what you use or not at all.<button class="button ml-2 is-small is-primary is-outlined"
onclick="toggleDetails('flexible-pricing-details')">
<span class="icon is-small">
<i class="fa-solid fa-info"></i>
</span>
</button></h3>
<p id="flexible-pricing-details" style="display:none;">JADE use API, so you get access to free credits or tiers
depending of the provider (see next section). This is particularly beneficial for users who may not need to
use the chatbot extensively. Once the free credit use, you pay based on the length of you message and the
response generated in tokens (a token is around 3 characters). Groq and Google also offer free tiers that
are enough for conversation. <br><br>JADE starts with a free tier that allows you to send up to 200 messages
a month. For more intensive use, you can upgrade for just $0.95/month.<br><br></p>
</li>
<li>
<h3>All providers and models.<button class="button ml-2 is-small is-primary is-outlined"
onclick="toggleDetails('provider-details')">
<span class="icon is-small">
<i class="fa-solid fa-info"></i>
</span>
</button></h3>
<div id="provider-details" style="display:none; overflow-x: hidden;">
<div class="columns">
<div class="column is-two-thirds">
<strong>OpenAI</strong> - OpenAI offer 5$ credits when creating an API account.
Around 10 000 small question to GPT-4 Omni or 100 000 to GPT-3.5 Turbo.
</div>
<div class="column">
<ul>
<li>
<strong>GPT 4 Omni</strong>
</li>
<li>
<strong>GPT 4 Turbo</strong>
</li>
<li>
<strong>GPT 4</strong>
</li>
<li>
<strong>GPT 3.5 Turbo</strong>
</li>
</ul>
</div>
</div>
<br>
<div class="columns">
<div class="column is-two-thirds">
<strong>Anthropic</strong> - Anthropic offer 5$ credits when creating an API
account. Around 2 000 small question to Claude 3 Opus or 120 000 to Claude Haiku.
</div>
<div class="column">
<ul>
<li>
<strong>Claude 3 Opus</strong>
</li>
<li>
<strong>Claude 3 Sonnet</strong>
</li>
<li>
<strong>Claude 3 Haiku</strong>
</li>
</ul>
</div>
</div>
<br>
<div class="columns">
<div class="column is-two-thirds">
<strong>Mistral</strong> - Mistral do not offer free credits.
</div>
<div class="column">
<ul>
<li>
<strong>Mixtral 8x22b</strong>
</li>
<li>
<strong>Mixtral 8x7b</strong>
</li>
<li>
<strong>Mistral 7b</strong>
</li>
<li>
<strong>Mistral Large</strong>
</li>
<li>
<strong>Mistral Small</strong>
</li>
<li>
<strong>Codestral</strong>
</li>
</ul>
</div>
</div>
<br>
<div class="columns">
<div class="column is-two-thirds">
<strong>Groq</strong> - Groq offer a free tier with limit of tokens and request per minutes.
The rate is plenty for a chatbot. 30 messages and between 6 000 and 30 000 tokens per
minute. Per tokens coming soon.
</div>
<div class="column">
<ul>
<li>
<strong>Llama 3 70b</strong>
</li>
<li>
<strong>Llama 3 8b</strong>
</li>
<li>
<strong>Mixtral 8x7b</strong>
</li>
<li>
<strong>Gemma 7b</strong>
</li>
</ul>
</div>
</div>
<br>
<div class="columns">
<div class="column is-two-thirds">
<strong>Google</strong> - Like Groq, Google offer a free tier with limit of tokens and
request per minutes. The rate is plenty for a chatbot. 15 messages and 1 000 000
tokens per minute. Per tokens also available.
</div>
<div class="column">
<ul>
<li>
<strong>Gemini 1.5 pro</strong>
</li>
<li>
<strong>Gemini 1.5 flash</strong>
</li>
<li>
<strong>Gemini 1.0 pro</strong>
</li>
</ul>
</div>
</div>
<br>
<div class="columns">
<div class="column is-two-thirds">
<strong>Perplexity</strong> - Perplexity do not offer a free tier or credits. Perplexity
offer what they call 'online' models that can search online. So you can ask for the current
weather for example. Those models have additional cost of 5$ per 1 000 requests.
</div>
<div class="column">
<ul>
<li>
<strong>Sonar Large</strong>
</li>
<li>
<strong>Sonar Large Online</strong>
</li>
<li>
<strong>Sonar Small</strong>
</li>
<li>
<strong>Sonar Small Online</strong>
</li>
<li>
<strong>Llama 70b</strong>
</li>
<li>
<strong>Llama 7b</strong>
</li>
<li>
<strong>Mixtral 8x7b</strong>
</li>
</ul>
</div>
</div>
<br>
<div class="columns">
<div class="column is-two-thirds">
<strong>Fireworks</strong> - Fireworks AI offer 1$ of free credits when creating an account.
Firework AI have a lot of open source models. I may add fine tuned models in the future.
</div>
<div class="column">
<ul>
<li>
<strong>FireLLaVA-13B</strong>
</li>
<li>
<strong>Mixtral MoE 8x7B Instruct</strong>
</li>
<li>
<strong>Mixtral MoE 8x22B Instruct</strong>
</li>
<li>
<strong>Llama 3 70B Instruct</strong>
</li>
<li>
<strong>Bleat</strong>
</li>
<li>
<strong>Chinese Llama 2 LoRA 7B</strong>
</li>
<li>
<strong>DBRX Instruct</strong>
</li>
<li>
<strong>Gemma 7B Instruct</strong>
</li>
<li>
<strong>Hermes 2 Pro Mistral 7b</strong>
</li>
<li>
<strong>Japanese StableLM Instruct Beta 70B</strong>
</li>
<li>
<strong>Japanese Stable LM Instruct Gamma 7B</strong>
</li>
<li>
<strong>Llama 2 13B French</strong>
</li>
<li>
<strong>Llama2 13B Guanaco QLoRA GGML</strong>
</li>
<li>
<strong>Llama 7B Summarize</strong>
</li>
<li>
<strong>Llama 2 13B</strong>
</li>
<li>
<strong>Llama 2 13B Chat</strong>
</li>
<li>
<strong>Llama 2 70B Chat</strong>
</li>
<li>
<strong>Llama 2 7B</strong>
</li>
<li>
<strong>Llama 2 7B Chat</strong>
</li>
<li>
<strong>Llama 3 70B Instruct (HF version)</strong>
</li>
<li>
<strong>Llama 3 8B (HF version)</strong>
</li>
<li>
<strong>Llama 3 8B Instruct</strong>
</li>
<li>
<strong>Llama 3 8B Instruct (HF version)</strong>
</li>
<li>
<strong>LLaVA V1.6 Yi 34B</strong>
</li>
<li>
<strong>Mistral 7B</strong>
</li>
<li>
<strong>Mistral 7B Instruct</strong>
</li>
<li>
<strong>Mistral 7B Instruct v0.2</strong>
</li>
<li>
<strong>Mistral 7B Instruct v0p3</strong>
</li>
<li>
<strong>Mixtral MoE 8x22B</strong>
</li>
<li>
<strong>Mixtral MoE 8x22B Instruct (HF version)</strong>
</li>
<li>
<strong>Mixtral MoE 8x7B</strong>
</li>
<li>
<strong>Mixtral MoE 8x7B Instruct (HF version)</strong>
</li>
<li>
<strong>MythoMax L2 13b</strong>
</li>
<li>
<strong>Nous Hermes 2 - Mixtral 8x7B - DPO (fp8)</strong>
</li>
<li>
<strong>Phi 3 Mini 128K Instruct</strong>
</li>
<li>
<strong>Phi 3 Vision 128K Instruct</strong>
</li>
<li>
<strong>Qwen1.5 72B Chat</strong>
</li>
<li>
<strong>StableLM 2 Zephyr 1.6B</strong>
</li>
<li>
<strong>StableLM Zephyr 3B</strong>
</li>
<li>
<strong>StarCoder 15.5B</strong>
</li>
<li>
<strong>StarCoder 7B</strong>
</li>
<li>
<strong>Traditional Chinese Llama2</strong>
</li>
<li>
<strong>Capybara 34B</strong>
</li>
<li>
<strong>Yi Large</strong>
</li>
</ul>
</div>
</div>
<br>
<strong>Hugging face</strong> - You can also use custom endpoints. I only tested hugging face but in
theory, as long as the key is valid and it use the openai api, it should work. This part need some
testing and improvement.
<br>
<strong>Goose AI</strong> - Chat API will be available soon.
<br>
<li>
<h3>
Get access to all models.<button
class="button ml-2 is-small is-primary is-outlined"
onclick="toggleDetails('all-models-details')"
>
<span class="icon is-small">
<i class="fa-solid fa-info"></i>
</span>
</button>
</h3>
<p id="all-models-details" style="display: none">
With JADE, you can easily switch between models like GPT 3.5 or 4o,
Gemini, Llama, Mistral, Claude, and more. Even custom endpoint.
<br /><br />This means you can choose the best model for your specific
needs, whether it's for general knowledge, creative writing, or technical
expertise. Having access to multiple models allows you to take advantage
of their unique strengths and weaknesses, ensuring you get the most
accurate and relevant responses. (See all models available in the last
section)<br /><br />
</p>
</li>
<li>
<h3>
Get the best answer from multiple models.<button
class="button ml-2 is-small is-primary is-outlined"
onclick="toggleDetails('multi-models-details')"
>
<span class="icon is-small">
<i class="fa-solid fa-info"></i>
</span>
</button>
</h3>
<p id="multi-models-details" style="display: none">
You can ask a question and receive responses from several models at once,
enabling you to compare their answers and choose the most suitable one.
<br /><br />This feature is particularly useful for complex queries where
different models might offer unique insights or solutions.<br /><br />
</p>
</li>
<li>
<h3>
Even from the same model.<button
class="button ml-2 is-small is-primary is-outlined"
onclick="toggleDetails('same-models-details')"
>
<span class="icon is-small">
<i class="fa-solid fa-info"></i>
</span>
</button>
</h3>
<p id="same-models-details" style="display: none">
The core feature of JADE are the bots. Each bot have a name, model,
temperature and system prompt. <br /><br />You can create as many bot as
you want and select as many to answer each question. An example is
creating the same model that reponse in different language.<br /><br />
</p>
</li>
<li>
<h3>
Reduce Hallucination.<button
class="button is-small ml-2 is-primary is-outlined"
onclick="toggleDetails('reduce-hallucination-details')"
>
<span class="icon is-small">
<i class="fa-solid fa-info"></i>
</span>
</button>
</h3>
<p id="reduce-hallucination-details" style="display: none">
AI models sometimes generate information that is inaccurate or misleading,
a phenomenon known as "hallucination." <br /><br />By using multiple
models, JADE reduces each model's bias. This ensures that the responses
you receive are more reliable and trustworthy.<br /><br />
</p>
</li>
<li>
<h3>
Pay only for what you use or not at all.<button
class="button ml-2 is-small is-primary is-outlined"
onclick="toggleDetails('flexible-pricing-details')"
>
<span class="icon is-small">
<i class="fa-solid fa-info"></i>
</span>
</button>
</h3>
<p id="flexible-pricing-details" style="display: none">
JADE use API, so you get access to free credits or tiers depending of the
provider (see next section). This is particularly beneficial for users who
may not need to use the chatbot extensively. Once the free credit use, you
pay based on the length of you message and the response generated in
tokens (a token is around 3 characters). Groq and Google also offer free
tiers that are enough for conversation.
</p>
</li>
<li>
<h3>
All providers and models.<button
class="button ml-2 is-small is-primary is-outlined"
onclick="toggleDetails('provider-details')"
>
<span class="icon is-small">
<i class="fa-solid fa-info"></i>
</span>
</button>
</h3>
<div id="provider-details" style="display: none; overflow-x: hidden">
<div class="columns">
<div class="column is-two-thirds">
<strong>OpenAI</strong> - OpenAI offer 5$ credits when creating an API
account. Around 10 000 small question to GPT-4 Omni or 100 000 to
GPT-3.5 Turbo.
</div>
</li>
<div class="column">
<ul>
<li>
<strong>GPT 4 Omni</strong>
</li>
<li>
<strong>GPT 4 Turbo</strong>
</li>
<li>
<strong>GPT 4</strong>
</li>
<li>
<strong>GPT 3.5 Turbo</strong>
</li>
</ul>
</div>
</div>
<br />
<div class="columns">
<div class="column is-two-thirds">
<strong>Anthropic</strong> - Anthropic offer 5$ credits when creating
an API account. Around 2 000 small question to Claude 3 Opus or 120
000 to Claude Haiku.
</div>
<div class="column">
<ul>
<li>
<strong>Claude 3 Opus</strong>
</li>
<li>
<strong>Claude 3.5 Sonnet</strong>
</li>
<li>
<strong>Claude 3 Haiku</strong>
</li>
</ul>
</div>
</div>
<br />
<div class="columns">
<div class="column is-two-thirds">
<strong>Mistral</strong> - Mistral do not offer free credits.
</div>
<div class="column">
<ul>
<li>
<strong>Mixtral 8x22b</strong>
</li>
<li>
<strong>Mixtral 8x7b</strong>
</li>
<li>
<strong>Mistral 7b</strong>
</li>
<li>
<strong>Mistral Large</strong>
</li>
<li>
<strong>Mistral Small</strong>
</li>
<li>
<strong>Codestral</strong>
</li>
</ul>
</div>
</div>
<br />
<div class="columns">
<div class="column is-two-thirds">
<strong>Groq</strong> - Groq offer a free tier with limit of tokens
and request per minutes. The rate is plenty for a chatbot. 30 messages
and between 6 000 and 30 000 tokens per minute. Per tokens coming
soon.
</div>
<div class="column">
<ul>
<li>
<strong>Llama 3 70b</strong>
</li>
<li>
<strong>Llama 3 8b</strong>
</li>
<li>
<strong>Mixtral 8x7b</strong>
</li>
<li>
<strong>Gemma 7b</strong>
</li>
</ul>
</div>
</div>
<br />
<div class="columns">
<div class="column is-two-thirds">
<strong>Google</strong> - Like Groq, Google offer a free tier with
limit of tokens and request per minutes. The rate is plenty for a
chatbot. 15 messages and 1 000 000 tokens per minute. Per tokens also
available.
</div>
<div class="column">
<ul>
<li>
<strong>Gemini 1.5 pro</strong>
</li>
<li>
<strong>Gemini 1.5 flash</strong>
</li>
<li>
<strong>Gemini 1.0 pro</strong>
</li>
</ul>
</div>
</div>
<br />
<div class="columns">
<div class="column is-two-thirds">
<strong>Perplexity</strong> - Perplexity do not offer a free tier or
credits. Perplexity offer what they call 'online' models that can
search online. So you can ask for the current weather for example.
Those models have additional cost of 5$ per 1 000 requests.
</div>
<div class="column">
<ul>
<li>
<strong>Sonar Large</strong>
</li>
<li>
<strong>Sonar Large Online</strong>
</li>
<li>
<strong>Sonar Small</strong>
</li>
<li>
<strong>Sonar Small Online</strong>
</li>
<li>
<strong>Llama 70b</strong>
</li>
<li>
<strong>Llama 7b</strong>
</li>
<li>
<strong>Mixtral 8x7b</strong>
</li>
</ul>
</div>
</div>
<br />
<div class="columns">
<div class="column is-two-thirds">
<strong>Fireworks</strong> - Fireworks AI offer 1$ of free credits
when creating an account. Firework AI have a lot of open source
models. I may add fine tuned models in the future.
</div>
<div class="column">
<ul>
<li>
<strong>FireLLaVA-13B</strong>
</li>
<li>
<strong>Mixtral MoE 8x7B Instruct</strong>
</li>
<li>
<strong>Mixtral MoE 8x22B Instruct</strong>
</li>
<li>
<strong>Llama 3 70B Instruct</strong>
</li>
<li>
<strong>Bleat</strong>
</li>
<li>
<strong>Chinese Llama 2 LoRA 7B</strong>
</li>
<li>
<strong>DBRX Instruct</strong>
</li>
<li>
<strong>Gemma 7B Instruct</strong>
</li>
<li>
<strong>Hermes 2 Pro Mistral 7b</strong>
</li>
<li>
<strong>Japanese StableLM Instruct Beta 70B</strong>
</li>
<li>
<strong>Japanese Stable LM Instruct Gamma 7B</strong>
</li>
<li>
<strong>Llama 2 13B French</strong>
</li>
<li>
<strong>Llama2 13B Guanaco QLoRA GGML</strong>
</li>
<li>
<strong>Llama 7B Summarize</strong>
</li>
<li>
<strong>Llama 2 13B</strong>
</li>
<li>
<strong>Llama 2 13B Chat</strong>
</li>
<li>
<strong>Llama 2 70B Chat</strong>
</li>
<li>
<strong>Llama 2 7B</strong>
</li>
<li>
<strong>Llama 2 7B Chat</strong>
</li>
<li>
<strong>Llama 3 70B Instruct (HF version)</strong>
</li>
<li>
<strong>Llama 3 8B (HF version)</strong>
</li>
<li>
<strong>Llama 3 8B Instruct</strong>
</li>
<li>
<strong>Llama 3 8B Instruct (HF version)</strong>
</li>
<li>
<strong>LLaVA V1.6 Yi 34B</strong>
</li>
<li>
<strong>Mistral 7B</strong>
</li>
<li>
<strong>Mistral 7B Instruct</strong>
</li>
<li>
<strong>Mistral 7B Instruct v0.2</strong>
</li>
<li>
<strong>Mistral 7B Instruct v0p3</strong>
</li>
<li>
<strong>Mixtral MoE 8x22B</strong>
</li>
<li>
<strong>Mixtral MoE 8x22B Instruct (HF version)</strong>
</li>
<li>
<strong>Mixtral MoE 8x7B</strong>
</li>
<li>
<strong>Mixtral MoE 8x7B Instruct (HF version)</strong>
</li>
<li>
<strong>MythoMax L2 13b</strong>
</li>
<li>
<strong>Nous Hermes 2 - Mixtral 8x7B - DPO (fp8)</strong>
</li>
<li>
<strong>Phi 3 Mini 128K Instruct</strong>
</li>
<li>
<strong>Phi 3 Vision 128K Instruct</strong>
</li>
<li>
<strong>Qwen1.5 72B Chat</strong>
</li>
<li>
<strong>StableLM 2 Zephyr 1.6B</strong>
</li>
<li>
<strong>StableLM Zephyr 3B</strong>
</li>
<li>
<strong>StarCoder 15.5B</strong>
</li>
<li>
<strong>StarCoder 7B</strong>
</li>
<li>
<strong>Traditional Chinese Llama2</strong>
</li>
<li>
<strong>Capybara 34B</strong>
</li>
<li>
<strong>Yi Large</strong>
</li>
</ul>
</div>
</div>
<br />
<strong>Hugging face</strong> - You can also use custom endpoints. I only
tested hugging face but in theory, as long as the key is valid and it use
the openai api, it should work. This part need some testing and
improvement.
<br />
<br />
<strong>Goose AI</strong> - Chat API will be available soon.
<br />
</div>
</li>
</ul>
<script>
function toggleDetails(id) {
var element = document.getElementById(id);
if (element.style.display === "none") {
element.style.display = "block";
} else {
element.style.display = "none";
}
function toggleDetails(id) {
var element = document.getElementById(id);
if (element.style.display === "none") {
element.style.display = "block";
} else {
element.style.display = "none";
}
}
</script>