winnieyangwannan/entity_Llama-3.1-8B-Instruct_mlp-down_pnas_layer_16_4_all_37_0.001_2560_3 Text Generation • 8B • Updated Nov 23 • 3
winnieyangwannan/entity_Llama-3.1-8B-Instruct_mlp-down_pnas_layer_16_4_all_37_0.001_1280_3 Text Generation • 8B • Updated Nov 23 • 5
winnieyangwannan/entity_dpo_Llama-3.1-8B-Instruct_lora_0_lr_0.0001_beta_0.05_1280_all_37_epoch_1_layer_all Text Generation • 8B • Updated Nov 23 • 6
winnieyangwannan/entity_sft_Llama-3.1-8B-Instruct_lora_0_lr_5e-06_1280_all_37_epoch_1_layer_all Text Generation • 8B • Updated Nov 23 • 3
winnieyangwannan/entity_grpo_Llama-3.1-8B-Instruct_lora_0_lr_1e-05_beta_0_12800_all_37_epoch_1_layer_all Text Generation • 8B • Updated Nov 20 • 4
winnieyangwannan/entity_grpo_Llama-3.1-8B-Instruct_lora_0_lr_5e-06_beta_0_12800_all_37_epoch_1_layer_all Text Generation • 8B • Updated Nov 20 • 4
winnieyangwannan/entity_grpo_Llama-3.1-8B-Instruct_lora_0_lr_1e-05_beta_0_11520_all_37_epoch_1_layer_all Text Generation • 8B • Updated Nov 19 • 3
winnieyangwannan/entity_grpo_Llama-3.1-8B-Instruct_lora_0_lr_5e-06_beta_0_11520_all_37_epoch_1_layer_all Text Generation • 8B • Updated Nov 19 • 5
winnieyangwannan/entity_grpo_Llama-3.1-8B-Instruct_lora_0_lr_5e-06_beta_0_10240_all_37_epoch_1_layer_all Text Generation • 8B • Updated Nov 19 • 4
winnieyangwannan/entity_grpo_Llama-3.1-8B-Instruct_lora_0_lr_1e-05_beta_0_10240_all_37_epoch_1_layer_all Text Generation • 8B • Updated Nov 19 • 5
winnieyangwannan/entity-visual-landmark_all_Qwen2.5-VL-7B-Instruct Viewer • Updated Sep 23 • 10k • 14
winnieyangwannan/entity-visual-worldcuisines_all_Qwen2.5-VL-7B-Instruct Viewer • Updated Sep 21 • 27k • 23