Multi-dataset training complete - streaming approach

#4
by ziadrone - opened

Trained on 2 datasets: gsm8k, aqua_rat. Final loss: 3.8474

shivash changed pull request status to merged

Sign up or log in to comment