Skip to content

Fine-tuningLamini

Lamini

Enterprise platform to tune and run open LLMs in your own environment.

Pricing
PAID
Hosting
Hybrid
Platforms
WebAPI
Models
Multi-model
Verified
Jun 15, 2026

Lamini is an enterprise LLM platform for fine-tuning open models and serving them, designed to run on-prem, in a VPC, or on Lamini's cloud — including on AMD GPUs. It pairs tuning (LoRA/PEFT and memory tuning to reduce hallucinations) with an inference stack and agentic pipelines, accessed via a Python client, REST API, or web UI. Built for teams that need to keep models and data in-house.

Pros & cons

  • Runs on-prem or in your own VPC
  • Supports AMD GPUs, not just NVIDIA
  • Memory tuning to cut hallucinations
  • Founded by an MLPerf and ex-NVIDIA team
  • Enterprise-focused pricing
  • Open models only
  • Smaller ecosystem than hyperscalers

Tags

Further reading

View all Fine-tuning
  • View Predibase details
    Fine-tuningPAID

    Predibase

    Predibase (Rubrik)

    Fine-tune open-source LLMs and serve them in production.

    Predibase is an enterprise platform for fine-tuning open-source models and serving them in production. It pairs a post-training stack — supervised fine-tuning plus an end-to-end reinforcement fine-tuning (RFT) flow — with an optimized inference engine, and its open-source LoRAX framework serves many fine-tuned LoRA adapters from a single GPU. Runs as managed SaaS or inside your own VPC.

    Worth knowing

    Founded in 2021 by AI engineers from Google and Uber; acquired by data-security firm Rubrik in June 2025 for a reported $100M+.

    • fine-tuning
    • lora
    • rft
    • inference
    • +1
  • View OpenPipe details
    Fine-tuningFREEMIUM

    OpenPipe

    OpenPipe

    Replace frontier-model spend with a fine-tuned small model.

    Captures your production OpenAI / Anthropic calls, builds a dataset, fine-tunes a small open-weights model on your traffic, then serves the swap behind your existing SDK. The pitch: 10x cost reduction at parity.

    Worth knowing

    Acquired by CoreWeave in September 2025, folding its reinforcement-learning agent-training stack into CoreWeave's AI cloud.

    • fine-tuning
    • cost-reduction
    • drop-in
    • open-weights
  • View Together AI details
    InferenceFREEMIUM

    Together AI

    Together

    Fine-tuning + inference for open-weights models. Broad coverage.

    Hosted inference and fine-tuning across hundreds of open-weights models (Llama, Mistral, DeepSeek, Qwen, etc.). Strong pricing for inference-at-scale; LoRA + full fine-tuning supported.

    Worth knowing

    Co-founded by Stanford's Percy Liang and FlashAttention author Tri Dao; raised $305M at a $3.3B valuation.

    • inference
    • fine-tuning
    • open-weights
    • lora
  • View Tinker details
    Fine-tuningPAID

    Tinker

    Thinking Machines Lab

    Managed fine-tuning API with low-level control over the training loop.

    Tinker is Thinking Machines Lab's training API for fine-tuning open-weight LLMs. It exposes low-level primitives — forward_backward, optim_step, sample — so researchers keep full control of data and algorithms while the service handles distributed GPU scheduling and failure recovery. LoRA-based runs cover models from small Llamas up to large mixture-of-experts like Qwen-235B and Kimi K2, and trained weights can be downloaded.

    Worth knowing

    The debut product of Thinking Machines Lab, the startup founded by ex-OpenAI CTO Mira Murati; launched October 2025.

    • fine-tuning
    • lora
    • post-training
    • research
    • +1