Upload Maaza-MLM-135M-JSON-v1 - v1.0.0 production release
Browse files
README.md
CHANGED
|
@@ -65,7 +65,7 @@ Evaluated on 158 test cases across 24 schema types:
|
|
| 65 |
| **Field F1** | 0.520 |
|
| 66 |
| **Schema Compliance** | 41.1% |
|
| 67 |
| **Latency (CPU)** | 18.5 tokens/sec |
|
| 68 |
-
| **Training Time** |
|
| 69 |
|
| 70 |
### By Complexity Level
|
| 71 |
|
|
@@ -89,7 +89,7 @@ Evaluated on 158 test cases across 24 schema types:
|
|
| 89 |
## Training Data
|
| 90 |
|
| 91 |
### Dataset: EdgeJSON v3
|
| 92 |
-
- **Total Examples**: 787 (
|
| 93 |
- **Train Split**: 629 examples (80%)
|
| 94 |
- **Test Split**: 158 examples (20%)
|
| 95 |
- **Validation Rate**: 100% (all examples pass schema validation)
|
|
@@ -114,7 +114,7 @@ Output:
|
|
| 114 |
|
| 115 |
### Hardware
|
| 116 |
- **GPU**: NVIDIA RTX 4080 SUPER (16GB)
|
| 117 |
-
- **Training Time**:
|
| 118 |
- **Effective Batch Size**: 32 (4 per device × 8 gradient accumulation)
|
| 119 |
|
| 120 |
### Hyperparameters
|
|
@@ -286,7 +286,7 @@ If you use this model in your research, please cite:
|
|
| 286 |
|
| 287 |
### v1.0.0 (2025-11-20)
|
| 288 |
- Initial release
|
| 289 |
-
- Trained on EdgeJSON v3 dataset (
|
| 290 |
- 24.7% JSONExact, 0.520 Field F1
|
| 291 |
- LoRA fine-tuning (r=16, alpha=32)
|
| 292 |
- 48.7 second training time
|
|
|
|
| 65 |
| **Field F1** | 0.520 |
|
| 66 |
| **Schema Compliance** | 41.1% |
|
| 67 |
| **Latency (CPU)** | 18.5 tokens/sec |
|
| 68 |
+
| **Training Time** | <1 minute |
|
| 69 |
|
| 70 |
### By Complexity Level
|
| 71 |
|
|
|
|
| 89 |
## Training Data
|
| 90 |
|
| 91 |
### Dataset: EdgeJSON v3
|
| 92 |
+
- **Total Examples**: 787 (validated)
|
| 93 |
- **Train Split**: 629 examples (80%)
|
| 94 |
- **Test Split**: 158 examples (20%)
|
| 95 |
- **Validation Rate**: 100% (all examples pass schema validation)
|
|
|
|
| 114 |
|
| 115 |
### Hardware
|
| 116 |
- **GPU**: NVIDIA RTX 4080 SUPER (16GB)
|
| 117 |
+
- **Training Time**: <1 minute
|
| 118 |
- **Effective Batch Size**: 32 (4 per device × 8 gradient accumulation)
|
| 119 |
|
| 120 |
### Hyperparameters
|
|
|
|
| 286 |
|
| 287 |
### v1.0.0 (2025-11-20)
|
| 288 |
- Initial release
|
| 289 |
+
- Trained on EdgeJSON v3 dataset (validated)
|
| 290 |
- 24.7% JSONExact, 0.520 Field F1
|
| 291 |
- LoRA fine-tuning (r=16, alpha=32)
|
| 292 |
- 48.7 second training time
|