Closed VeriHeppi closed 2 months ago
Google Cloud (Vertex AI):
Microsoft Azure (OpenAI API):
Amazon Web Services (AWS) (Bedrock)
Free tier doesn't seem to have LLM service
AWS Rapid Ramp Credits:
Meta Llama:
Possible Ireland option in Europe with slightly less cost.
Also has it's own LLM (Amazon Titan Text)
Amazon Bedrock Guardrails for free sensitive information filter and word filters.
Batch support located in US region (-50% price).
IBM Cloud (Watson NLP):
Summary table:
Google Cloud (Vertex AI) | Microsoft Azure (OpenAI API) | Amazon Web Services (Bedrock) | |
---|---|---|---|
Free credits | $300 | $200 | $300 with approval process |
Batch mode (-50% price) | Only Gemini | Only GPT | Yes |
LLM's | Gemini, Mistral(partner) | GTP | Llama, Mistral |
Input (<128k): | Gemini 1.5 Pro: $0.00125 per 1k chars ($0.005 per 1k tokens) Mistral Large (2407) $2 per million tokens ($0.002 per 1k tokens) |
gpt-4o-2024-08-06: $0.0025 per 1k tokens |
Llama 3.1 (70B) $0.00099 Llama 3.1 (405B) $0.00532 Mistral 7B $0.0002 Mistral 8*7B $0.00059 Mistral Large (24.02) $0.0052 per 1k tokens |
Output: | Gemini 1.5 Pro: $0.00375 per 1k chars ($0.015 per 1k tokens) Mistral Large (2407) $6 per million tokens ($0.006 per 1k tokens) |
gpt-4o-2024-08-06: $0.011 per 1k tokens |
Llama 3.1 (70B) $0.00099 Llama 3.1 (405B) $0.016 Mistral 7B $0.00026 Mistral 8*7B $0.00091 Mistral Large (24.02) $0.0156 per 1k tokens |
Fixed pricing table (above).
Notes:
Possible use-case:
Result: Microsoft Azure.
Check pricing for the most popular cloud services, also available free credits to be used for testing.