Fine-tuning & Training

Model fine-tuning techniques, LoRA, RLHF, DPO, pre-training strategies, and training infrastructure.