Loading…
Fine-tuning · Unsloth AI
Fine-tune open LLMs 2x faster with far less VRAM. Open source.
An open-source (Apache-2.0) framework for fine-tuning and running open-weight models with custom CUDA kernels — roughly 2x faster training and large VRAM savings, so 7B–13B models fit on a single consumer GPU. Free tier runs on Colab/Kaggle or locally; Pro and Enterprise tiers add multi-GPU and multi-node speedups. Exports to GGUF/Safetensors for llama.cpp, vLLM, and Ollama.
Model support
Fine-tunes open-weight models (LoRA/QLoRA + full) with reduced memory use.
Where it runs
Tags