What is Instruction finetuning
Finetune data quality should be high for better output.
- More Diverse
- Real Data
- More data is better
Following are the steps to prepare your data
Tokeniation: Converting text to number, same/right tokenizer needs to be used based on model. Transformers autotokenizer finds the right tokenizer for the given model.