Loading…
Fine-tuningAxolotl AI
Open-source post-training for LLMs — LoRA to RL, all from one YAML config.
An open-source (Apache-2.0) framework that streamlines post-training for open-weight models: full fine-tuning, LoRA/QLoRA, preference tuning (DPO, IPO, KTO, ORPO), reinforcement learning (GRPO), reward modeling and quantization-aware training, configured through a single YAML file with no scripting. Wraps Hugging Face Transformers, PEFT, TRL and DeepSpeed, and supports dozens of model families including multimodal vision and audio models.
Pros & cons
Tags
Further reading