Fine-Tuning Local LLMs with QLoRA: From Experiment to Production GGUF
The full pipeline: dataset curation, QLoRA training on a single A100, evaluation, quantisation to GGUF and serving with vLLM. What the tutorials leave out.
Read more →The full pipeline: dataset curation, QLoRA training on a single A100, evaluation, quantisation to GGUF and serving with vLLM. What the tutorials leave out.
Read more →