CycleCore-Technologies commited on
Commit
0bf576f
·
verified ·
1 Parent(s): 43a35f3

Upload Maaza-MLM-135M-JSON-v1 - v1.0.0 production release

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -65,7 +65,7 @@ Evaluated on 158 test cases across 24 schema types:
65
  | **Field F1** | 0.520 |
66
  | **Schema Compliance** | 41.1% |
67
  | **Latency (CPU)** | 18.5 tokens/sec |
68
- | **Training Time** | 48.7 seconds |
69
 
70
  ### By Complexity Level
71
 
@@ -89,7 +89,7 @@ Evaluated on 158 test cases across 24 schema types:
89
  ## Training Data
90
 
91
  ### Dataset: EdgeJSON v3
92
- - **Total Examples**: 787 (100% validated)
93
  - **Train Split**: 629 examples (80%)
94
  - **Test Split**: 158 examples (20%)
95
  - **Validation Rate**: 100% (all examples pass schema validation)
@@ -114,7 +114,7 @@ Output:
114
 
115
  ### Hardware
116
  - **GPU**: NVIDIA RTX 4080 SUPER (16GB)
117
- - **Training Time**: 48.7 seconds
118
  - **Effective Batch Size**: 32 (4 per device × 8 gradient accumulation)
119
 
120
  ### Hyperparameters
@@ -286,7 +286,7 @@ If you use this model in your research, please cite:
286
 
287
  ### v1.0.0 (2025-11-20)
288
  - Initial release
289
- - Trained on EdgeJSON v3 dataset (100% validated)
290
  - 24.7% JSONExact, 0.520 Field F1
291
  - LoRA fine-tuning (r=16, alpha=32)
292
  - 48.7 second training time
 
65
  | **Field F1** | 0.520 |
66
  | **Schema Compliance** | 41.1% |
67
  | **Latency (CPU)** | 18.5 tokens/sec |
68
+ | **Training Time** | <1 minute |
69
 
70
  ### By Complexity Level
71
 
 
89
  ## Training Data
90
 
91
  ### Dataset: EdgeJSON v3
92
+ - **Total Examples**: 787 (validated)
93
  - **Train Split**: 629 examples (80%)
94
  - **Test Split**: 158 examples (20%)
95
  - **Validation Rate**: 100% (all examples pass schema validation)
 
114
 
115
  ### Hardware
116
  - **GPU**: NVIDIA RTX 4080 SUPER (16GB)
117
+ - **Training Time**: <1 minute
118
  - **Effective Batch Size**: 32 (4 per device × 8 gradient accumulation)
119
 
120
  ### Hyperparameters
 
286
 
287
  ### v1.0.0 (2025-11-20)
288
  - Initial release
289
+ - Trained on EdgeJSON v3 dataset (validated)
290
  - 24.7% JSONExact, 0.520 Field F1
291
  - LoRA fine-tuning (r=16, alpha=32)
292
  - 48.7 second training time