Instruction-Guided Lesion Segmentation for Chest X-rays with Automatically Generated Large-Scale Dataset Paper • 2511.15186 • Published Nov 19 • 25
When to Ensemble: Identifying Token-Level Points for Stable and Fast LLM Ensembling Paper • 2510.15346 • Published Oct 17 • 33
Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning Paper • 2510.03259 • Published Sep 26 • 57
DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding Paper • 2411.19527 • Published Nov 29, 2024 • 11
No Prompt Left Behind: Exploiting Zero-Variance Prompts in LLM Reinforcement Learning via Entropy-Guided Advantage Shaping Paper • 2509.21880 • Published Sep 26 • 52
ReviewScore: Misinformed Peer Review Detection with Large Language Models Paper • 2509.21679 • Published Sep 25 • 63
EMC2-Net: Joint Equalization and Modulation Classification based on Constellation Network Paper • 2303.10934 • Published Mar 20, 2023
SimPSI: A Simple Strategy to Preserve Spectral Information in Time Series Data Augmentation Paper • 2312.05790 • Published Dec 10, 2023
Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical Reasoning Paper • 2410.08047 • Published Oct 10, 2024
ReviewScore: Misinformed Peer Review Detection with Large Language Models Paper • 2509.21679 • Published Sep 25 • 63
ReviewScore: Misinformed Peer Review Detection with Large Language Models Paper • 2509.21679 • Published Sep 25 • 63 • 2
view article Article Argunauts Training Phase III: RLVF with Hindsight Instruction Relabeling, Self-Correction and Dynamic Curriculum Jul 24 • 2
Reasoning Model is Stubborn: Diagnosing Instruction Overriding in Reasoning Models Paper • 2505.17225 • Published May 22 • 64
Reasoning Model is Stubborn: Diagnosing Instruction Overriding in Reasoning Models Paper • 2505.17225 • Published May 22 • 64
view article Article syncIAL🍏: A Multi-Purpose Synthetic Debate and Argument Mapping Corpus Feb 4 • 4