sft-demo Data Preparation sample.py: generate train and test data subsets from the original Alpaca dataset. Model Training pretrained model: distilgpt2