shuoxing/llama3-8b-full-pretrain-mix-mid-tweet-1m-en-gpt-no-packing-epoch-2 266k • Updated Nov 16, 2025
shuoxing/llama3-8b-full-pretrain-mix-mid-tweet-1m-en-gpt-no-packing-epoch-1 266k • Updated Nov 16, 2025
shuoxing/llama3-8b-full-pretrain-mix-low-tweet-1m-en-gpt-no-packing-epoch-3 266k • Updated Nov 16, 2025
shuoxing/llama3-8b-full-pretrain-mix-low-tweet-1m-en-gpt-no-packing Text Generation • 266k • Updated Nov 16, 2025
shuoxing/llama3-8b-full-pretrain-mix-low-tweet-1m-en-gpt-no-packing-epoch-2 266k • Updated Nov 16, 2025
shuoxing/llama3-8b-full-pretrain-mix-low-tweet-1m-en-gpt-no-packing-epoch-1 266k • Updated Nov 16, 2025
shuoxing/llama3-8b-full-pretrain-control-tweet-1m-en-gpt-no-packing-epoch-3 266k • Updated Nov 16, 2025
shuoxing/llama3-8b-full-pretrain-control-tweet-1m-en-gpt-no-packing-epoch-2 266k • Updated Nov 16, 2025
shuoxing/llama3-8b-full-pretrain-control-tweet-1m-en-gpt-no-packing-epoch-1 266k • Updated Nov 16, 2025
shuoxing/llama3-8b-full-pretrain-control-tweet-1m-en-gpt-no-packing Text Generation • 266k • Updated Nov 16, 2025
shuoxing/llama3-8b-full-pretrain-junk-tweet-1m-en-gpt-no-packing Text Generation • 266k • Updated Nov 16, 2025