gemma_2b_social_reasoning_reward_model

Files changed (3) hide show

README.md CHANGED Viewed

@@ -20,8 +20,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/gemma-2b](https://huggingface.co/google/gemma-2b) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.6194
-- Accuracy: 0.6766
 ## Model description
@@ -55,18 +55,18 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
-| 0.7925        | 0.24  | 10   | 0.6773          | 0.6182   |
-| 0.6702        | 0.48  | 20   | 0.6361          | 0.6532   |
-| 0.6146        | 0.72  | 30   | 0.6348          | 0.6673   |
-| 0.646         | 0.96  | 40   | 0.6261          | 0.6813   |
-| 0.5914        | 1.2   | 50   | 0.6224          | 0.6708   |
-| 0.5868        | 1.44  | 60   | 0.6430          | 0.6532   |
-| 0.5826        | 1.68  | 70   | 0.6401          | 0.6725   |
-| 0.5731        | 1.92  | 80   | 0.6309          | 0.6637   |
-| 0.5364        | 2.16  | 90   | 0.6383          | 0.6462   |
-| 0.5079        | 2.4   | 100  | 0.6386          | 0.6655   |
-| 0.4908        | 2.63  | 110  | 0.6389          | 0.6673   |
-| 0.5155        | 2.87  | 120  | 0.6387          | 0.6708   |
 ### Framework versions

 This model is a fine-tuned version of [google/gemma-2b](https://huggingface.co/google/gemma-2b) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.6131
+- Accuracy: 0.6713
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
+| 0.7962        | 0.24  | 10   | 0.6976          | 0.6077   |
+| 0.6657        | 0.48  | 20   | 0.6464          | 0.6217   |
+| 0.6334        | 0.72  | 30   | 0.6330          | 0.6392   |
+| 0.6434        | 0.96  | 40   | 0.6218          | 0.6427   |
+| 0.5745        | 1.2   | 50   | 0.6119          | 0.6567   |
+| 0.5813        | 1.44  | 60   | 0.6282          | 0.6445   |
+| 0.5822        | 1.68  | 70   | 0.6350          | 0.6585   |
+| 0.5914        | 1.92  | 80   | 0.6294          | 0.6550   |
+| 0.5386        | 2.16  | 90   | 0.6341          | 0.6550   |
+| 0.5142        | 2.4   | 100  | 0.6330          | 0.6567   |
+| 0.4906        | 2.63  | 110  | 0.6313          | 0.6620   |
+| 0.5079        | 2.87  | 120  | 0.6309          | 0.6637   |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -21,8 +21,8 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "q_proj",
-    "v_proj"
   ],
   "task_type": "SEQ_CLS",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "v_proj",
+    "q_proj"
   ],
   "task_type": "SEQ_CLS",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:665eea71182bd80b48a75e0aa4929d699910e7075c6516833710fd05e56fc2fb
 size 14763488

 version https://git-lfs.github.com/spec/v1
+oid sha256:e11c77d9338dce646bb81953c72b8ec68fc314cb3baf01992e6144695ede31e7
 size 14763488