Peng Sun
voidmain
AI & ML interests
AI Infra
Recent Activity
upvoted
a
paper
about 1 month ago
Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction
authored
a paper
2 months ago
BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via
Balanced Policy Optimization with Adaptive Clipping