Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
shivash
/
enhanced-hybrid-transformer-768d-trained
like
0
Safetensors
llama
Model card
Files
Files and versions
xet
Community
5
Multi-dataset training complete - streaming approach
#4
by
ziadrone
- opened
Sep 23, 2025
base:
refs/heads/main
←
from:
refs/pr/4
Discussion
Files changed
+300411
-0
Multi-dataset training complete - streaming approach
cc68c1a6
ziadrone
Sep 23, 2025
Trained on 2 datasets: gsm8k, aqua_rat. Final loss: 3.8474
See translation
shivash
changed pull request status to
merged
Sep 23, 2025
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Comment
·
Sign up
or
log in
to comment