Skip to content

InfraNorthflank

Northflank

Deploy apps, databases, and AI/GPU workloads on any cloud.

Category
Infra
Pricing
FREEMIUM
Hosting
Hybrid
Platforms
WebAPICLI
Models
Model-agnostic
Verified
Jun 14, 2026

Northflank is a developer platform for deploying applications, databases, jobs, and AI/GPU workloads from Git to production. It abstracts away Kubernetes and runs on Northflank's managed cloud or in your own AWS, GCP, Azure, or bare-metal account (BYOC), billed by the second. Teams like Writer, Sentry, and Chai Discovery run on it.

Pros & cons

  • Apps, databases, jobs & GPUs in one platform
  • BYOC: run in your own cloud or bare metal
  • Per-second billing, no idle GPU charges
  • Abstracts Kubernetes; Git-to-production
  • Free always-on Sandbox tier
  • Smaller ecosystem than hyperscalers
  • Breadth can mean a learning curve
  • Pay-as-you-go costs need monitoring at scale

Tags

Further reading

View all Infra
  • View Vercel details
    InfraFREEMIUM

    Vercel

    Vercel

    Frontend cloud for React/Next. Edge functions + image opt + analytics.

    Next.js-native hosting with fast deploys, edge functions, image optimization, and a free Speed Insights tier. Strong default for the React/Next ecosystem.

    Worth knowing

    Founded in 2015 as ZEIT by Next.js and Socket.IO creator Guillermo Rauch, and rebranded to Vercel in April 2020.

    • hosting
    • edge
    • nextjs
    • ci
  • View Modal details
    InferenceFREEMIUM

    Modal

    Modal Labs

    Serverless GPUs. Run training, inference, batch jobs from Python.

    Define cloud workloads in Python, deploy with one command — GPU access on demand, fast cold starts, fair-share pricing. The default 'I need to fine-tune a model from a Jupyter cell' platform.

    Worth knowing

    Co-founded by Erik Bernhardsson, who built Spotify's recommender; raised a $355M Series C at a $4.65B valuation in 2026.

    • gpu
    • serverless
    • python
    • training
  • View Cerebrium details
    InfraPAID

    Cerebrium

    Cerebrium

    Serverless GPU infrastructure for real-time AI — voice, video, and LLM workloads.

    A serverless GPU platform for deploying real-time AI workloads — voice agents, video models, and LLMs — with cold starts in seconds, instant autoscaling, and multi-region failover. Bring custom code, Dockerfiles, or frameworks like vLLM and pay per second of compute across 12+ GPU types.

    Worth knowing

    Cape Town-founded and YC-backed; raised an $8.5M seed led by Gradient in 2025 to scale its real-time serverless GPU platform.

    • gpu
    • serverless
    • real-time
    • voice-agents
  • View Beam details
    InfraFREEMIUM

    Beam

    Beam

    On-demand serverless GPU compute for AI, from Python.

    A serverless cloud for deploying AI inference endpoints, agent sandboxes, task queues, and containerized GPU workloads with a few lines of Python. It handles fast cold starts, autoscaling, and Docker-in-Docker execution across multiple cloud backends, and supports bring-your-own-compute. The Developer tier is free with recurring monthly credit; paid tiers add team features and scale, billed pay-as-you-go by GPU usage.

    Worth knowing

    A YC-backed startup that began in 2021 as Slai before becoming Beam; its beta9 runtime is AGPL-3.0.

    • gpu
    • serverless
    • python
    • inference
    • +1