Stabilizing Reinforcement Learning with LLMs: Formulation and Practices Paper • 2512.01374 • Published 10 days ago • 85
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence Paper • 2511.18538 • Published 18 days ago • 256
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning Paper • 2511.22570 • Published 14 days ago • 71
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length Paper • 2512.04677 • Published 7 days ago • 166
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Paper • 2512.02556 • Published 9 days ago • 199
Everything is Context: Agentic File System Abstraction for Context Engineering Paper • 2512.05470 • Published 6 days ago • 1
huawei-csl/Kimi-Linear-48B-A3B-Instruct-4bit-SINQ Text Generation • 27B • Updated 17 days ago • 37 • 2