Tinfoil

Verifiably private AI inference inside secure hardware enclaves.

Categories: InferenceSecurity
Pricing: FREEMIUM
Source: Open core
Hosting: Cloud
Platforms: WebAPI
Models: Multi-model
Verified: Jun 20, 2026

Tinfoil runs open-weight LLMs inside confidential-computing GPU enclaves so that neither Tinfoil nor the cloud provider can see your prompts or data — and the setup is remotely attestable rather than a policy promise. It offers a private chat, an OpenAI-compatible inference API, and Tinfoil Containers for arbitrary Docker workloads, serving models like GPT-OSS, Llama 3.3, and Kimi K2. The software stack is open source and verifiable.

Pros & cons

Hardware-enforced, verifiable privacy
Open-source, attestable stack
OpenAI-compatible API
Free private chat to try
Runs popular open-weight models

Open-weight models only, no GPT-4/Claude
Smaller model selection than big clouds
Enclave approach adds some overhead

Tinfoil

Together AI

Fireworks AI

OpenRouter