Skip to main content

supervised fine-tuning

·23 words·1 min
Dave the human
Author
Dave the human
Homo sapiens in the loop

Supervised fine-tuning (or instruction tuning) improves an Large Language Model capabilities in question-answering, summarization, translation etc. Later, the preference tuning refines these capabilities.


Comments