Single GPU Billion-scale Model Training via Parameter-Efficient Finetuning
How to take advantage of large foundation models with the help of parameter-efficient finetuning.
In the tutorial, we will use combine IA^3, BitFit, and gradient checkpointing to finetune FLAN-T5-XL.