Pricing for Private AI
Competitive Pricing: Predictable Costs, no hidden Fees
Overview
Safe Swiss Cloud provides a suite of industry standard AI products as part of its “Private AI” (PAI) product family. It includes conversional AI/chat, access to large language models (LLMs) via API, Enterprise and Developer tools, integration services like MCP servers to make systems accessible to Ai, support for agentic AI including workflows and support services.
PAI Chat Pricing
This web based Chatbot is for use by end users to chat with one of the available Large Language Models.
There is an option to allow the LLM to access the Internet so that it can answer questions with current Internet data. This option can optionally be blocked.
| Service | Description | Units | Price in CHF / EUR |
|---|---|---|---|
| PAI Chat | Chatbot with advanced functionality, including single sign on (SSO), a selection of LLMs – large language models – many with 70 billion parameters or more, document and image uploading for analysis and summaries, chat histories etc.* | Per user / month (on demand) Per user / month (annual) | 35.00 30.00 |
| Web Search | An anonymous web search is provided with the client. A “meta search” server at Safe Swiss Cloud ensures that the web search requests cannot be traced back to the actual user. This search server uses non-tracking web search services like DuckDuckGo, Startpage, Wikipedia etc. Users can turn the web search on and off for each request in the PAI Chat client. This function can be completely disabled for a customer if desired. | Number of requests | No charge |
PAI API Pricing
All prices are in CHF / EUR. The prices are all based on the number (in millions) of input and output tokens used for each model per month. Allows programmers to access more than 25 LLMs via an OpenAI API compatible Application Programming Interface.
There is a minimum charge of CHF 95.- (EUR 100.- for non-Swiss customers) per month for all the input and output tokens consumed for all models. The prices per model are as follows:
| Model | Type | Price per million input tokens (CHF/EUR) | Price per million output tokens (CHF/EUR) | Details |
|---|---|---|---|---|
| apertus-70b | Chat | 0.712 | 2.553 | Optimised for multilingual dialogue use cases. |
| bge-m3 | Embedding | 0.496 | n/a | Optimised for embeddings and parse retrieval with support for Multi-Functionality, Multilinguality, and Multi-Granularity. |
| bge-reranker-v2 | Reranker | 0.009 | n/a | Optimised for Reranker to get relevance score. |
| deepseek-ocr | OCR | 0.443 | 1.770 | Optimised for scanning documents – optical character recognition |
| deepseek-v32 | Chat | 0.708 | 2.124 | Deepseek Version 3.2 |
| gemma-12b-it | Multimodal | 0.310 | 0.496 | Optimised for handling text and image input and generating text output. |
| gemma4-31b | Multimodal | 0.136 | 0.374 | Optimized for handling text and image input and generating text output. |
| glm45-air-110b | Chat | 0.487 | 1.938 | Optimised for chats |
| gpt-oss-120b | Chat | 0.133 | 0.531 | Optimised for powerful reasoning, agentic tasks, and versatile developer use cases. |
| granite-33-8b | Chat | 0.177 | 0.177 | Optimised for Reasoning and instruction-following capabilities. |
| granite-emb-278m | Embedding | 0.089 | n/a | Optimised for Embeddings. |
| granite-vision-2b | Multimodal | 0.089 | 0.089 | Optimized for compact and efficient vision-language model |
| kimi-k2 | Chat | 0.886 | 2.657 | Optimised for multi-lingual chats |
| llama4-maverick | Chat and multimodal | 0.310 | 1.239 | Optimised for text and multimodal experiences. |
| llama4-scout-17b | Chat and multimodal | 0.221 | 0.735 | Optimised for text and multimodal experiences. |
| miner-u25 | Vision – Language | 0.437 | 0.265 | Optimized for document parsing that achieves state-of-the-art accuracy with high computational efficiency. |
| mistral-v03-7b | Chat only | 0.177 | 0.177 | Optimised for multilingual dialogue use cases. |
| qwen3-8b | Reasoning | 0.031 | 0.122 | Optimised for thinking and reasoning. |
| qwq-32b | Reasoning | 1.062 | 1.062 | Optimised for thinking and reasoning. |
| qwen3-vl-235b | Multimodal | 0.805 | 2.300 | Optimised for text and multimodal experiences. |
| whisper-large-v3 | Speech to Text | 0.007 per minute | n/a | For converting speech to text. |
PAI Tools Pricing
| Product | Description | Units | Price in CHF / EUR |
|---|---|---|---|
| PAI Tools for PAI Chat | User Management (self service) IAM integration Role based model access | Fixed | 100.- |
PAI Integration Services Pricing
| Service | Unit | Price |
|---|---|---|
| PAI Integraion e.g. creating an MCP server | Per hour. Based on actual hours worked. | 250.00 |
| MCP Maintenance | Per hour. | Request an estimate. Support Packages can be used for this. |
| Managed MCP Service.Includes security updates, backups, restores, monitoring. | Fixed price per month | Depends on the MCP server. Request quotation |
| Workflow Integrations | Per hour. | Request an estimate. Support Packages can be used for this. |
| Managed workflow service, e.g. based on n8n. Includes security updates, backups, restores, monitoring. | Fixed price per month | Depends on the complexity of the workflow. Request quotation |
PAI Workflow Hosting Pricing
Safe Swiss Cloud provides sovereign and private hosting for various AI workflow solutions like n8n. This allow customers to create agents that automate tasks and workflows.
| Service | Description | Price |
|---|---|---|
| Managed Workflow Hosting n8n | Includes server infrastructure, security, backups, monitoring and application management. | 226.- |
| n8n Enterprise License | This is needed for multi-user installations. | Not included* |
PAI Support
All Safe Swiss Cloud Support packages can be used for PAI support. SLA: 7×24 availability with a maximum response time of 1 hour.
| Annual Support Packages | Included Hours (valid 12 months) | Expiration of support hours | Price/month CHF/EUR | Contract Duration (months) | Billing |
|---|---|---|---|---|---|
| Annual Support 10 | 10 | Yearly | 130 | 12 | Monthly |
| Annual Support 25 | 25 | Yearly | 330 | 12 | Monthly |
| Annual Support 50 | 50 | Yearly | 650 | 12 | Monthly |
| Annual Support 100 | 100 | Yearly | 1’250 | 12 | Monthly |
| Annual Support 200 | 200 | Yearly | 2’420 | 12 | Monthly |
| Annual Support 300 | 300 | Yearly | 3’000 | 12 | Monthly |
Annual support packages, paid monthly: the support hours are for a full year and expire after 12 months.
After 12 months the support package renews automatically for a year, unless cancelled. The notice period is a minimum of 30 days and termination can only be at the end of a calendar month.
All support packages can be used for AI, cloud computing and all Safe Swiss Cloud IT services.
Definitions and Abbreviations:
- PAI = Private AI
- LLM = Large Language Model
- API = Application Programming Interface
- SSO = Single Sign On
- Token = a unit of processing for an AI, typically part of a word