Fine-Tuning

Fine-Tuning Local LLMs with QLoRA: From Experiment to Production GGUF

The full pipeline: dataset curation, QLoRA training on a single A100, evaluation, quantisation to GGUF and serving with vLLM. What the tutorials leave out.