wgcyeo/ci-feedback_disallowed_ema_Llama-3.1-8B-Instruct_reverse_kl_ema0p999_ep30 Text Generation • 8B • Updated about 16 hours ago • 11
wgcyeo/ci-grpo_Llama-3.1-8B-Instruct_bs16_g16_mb128_lr1e-6_b1e-3_clip0p2_temp0p7_ep30 Text Generation • 8B • Updated about 19 hours ago
wgcyeo/ci-grpo_Llama-3.1-8B-Instruct_bs16_g16_mb128_lr1e-6_b1e-3_clip0p2_temp0p7_ep30 Text Generation • 8B • Updated about 19 hours ago
wgcyeo/ci-grpo_Llama-3.1-8B-Instruct_bs16_g16_mb128_lr1e-6_b1e-3_clip0p2_temp0p7_ep30ref Text Generation • 8B • Updated about 19 hours ago
wgcyeo/ci-grpo_Llama-3.1-8B-Instruct_bs16_g16_mb128_lr1e-6_b1e-3_clip0p2_temp0p7_ep30ref Text Generation • 8B • Updated about 19 hours ago
wgcyeo/ci-grpo_Olmo-3-7B-Instruct_bs16_g16_mb128_lr1e-6_b1e-3_clip0p2_temp0p7_ep30 Text Generation • 7B • Updated 1 day ago • 141
wgcyeo/ci-grpo_Olmo-3-7B-Instruct_bs16_g16_mb128_lr1e-6_b1e-3_clip0p2_temp0p7_ep30 Text Generation • 7B • Updated 1 day ago • 141
wgcyeo/ContextualIntegritySyntheticDataset_Llama-3.1-70B-Instruct_all Viewer • Updated 1 day ago • 729 • 12
wgcyeo/ContextualIntegritySyntheticDataset_Llama-3.1-70B-Instruct_all Viewer • Updated 1 day ago • 729 • 12
wgcyeo/ci-feedback_disallowed_ema_Llama-3.1-8B-Instruct_reverse_kl_ema0p999_ep30 Text Generation • 8B • Updated about 16 hours ago • 11
wgcyeo/ci-feedback_disallowed_ema_Olmo-3-7B-Instruct_reverse_kl_ema0p999_ep30 Text Generation • Updated 3 days ago • 11
wgcyeo/ci-feedback_disallowed_ema_Olmo-3-7B-Instruct_reverse_kl_ema0p999_ep30 Text Generation • Updated 3 days ago • 11