Supervised fine-tuning (or instruction tuning) improves an Large Language Model capabilities in question-answering, summarization, translation etc. Later, the preference tuning refines these capabilities.
supervised fine-tuning
·23 words·1 min
Supervised fine-tuning (or instruction tuning) improves an Large Language Model capabilities in question-answering, summarization, translation etc. Later, the preference tuning refines these capabilities.