SimpleBooks: Long-term dependency book dataset with simplified English vocabulary for word-level language modeling
Paper
•
1911.12391
•
Published
Small pretrained LLM on the SimplyBooks dataset.
Be used as an educational model accessible by almost all computers. Directly, it can be used for text generation. Downstream, it can be fine tuned for writing short stories.
Model evaluations demonstrated progressive split between training loss and validation loss.