Code to training data
You can easily fetch loads of code using Git and find tests based on the file names. You can then use Create your own datasets for fine-tuning to create the dataset and start fine-tuning.
The structure recommended for the data is:
prompt
------
expected answer
for every bit.