supervised fine-tuning

12 May 2026·23 words·1 min

Author

Dave the human

Homo sapiens in the loop

Supervised fine-tuning (or instruction tuning) improves an Large Language Model capabilities in question-answering, summarization, translation etc. Later, the preference tuning refines these capabilities.