3 11

Zhiheng Xi

WooooDyy

AI & ML interests

None yet

Recent Activity

upvoted a paper 16 days ago

ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding

upvoted a paper 16 days ago

Memory in the Age of AI Agents

authored a paper 27 days ago

Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction

View all activity

Organizations

upvoted 2 papers 16 days ago

ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding

Paper • 2512.13586 • Published 17 days ago • 87

Memory in the Age of AI Agents

Paper • 2512.13564 • Published 17 days ago • 120

authored a paper 27 days ago

Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction

Paper • 2512.04987 • Published 28 days ago • 75

upvoted a paper 27 days ago

Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction

Paper • 2512.04987 • Published 28 days ago • 75

commented a paper 2 months ago

Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning

Paper • 2510.24320 • Published Oct 28, 2025 • 19 •

upvoted a paper 2 months ago

Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning

Paper • 2510.24320 • Published Oct 28, 2025 • 19

commented 2 papers 2 months ago

Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning

Paper • 2510.24320 • Published Oct 28, 2025 • 19 •

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21, 2025 • 83 •

upvoted a paper 2 months ago

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21, 2025 • 83

upvoted a paper 4 months ago

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Paper • 2509.08755 • Published Sep 10, 2025 • 56

upvoted 2 papers 6 months ago

Skywork-R1V3 Technical Report

Paper • 2507.06167 • Published Jul 8, 2025 • 72

BMMR: A Large-Scale Bilingual Multimodal Multi-Discipline Reasoning Dataset

Paper • 2507.03483 • Published Jul 4, 2025 • 23

authored a paper 11 months ago

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published Jan 20, 2025 • 109

upvoted a paper 11 months ago

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published Jan 20, 2025 • 109

authored a paper 12 months ago

ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use

Paper • 2501.02506 • Published Jan 5, 2025 • 10

updated a dataset about 1 year ago

MathCritique/MathCritique-76k

Updated Nov 25, 2024 • 11 • 9

authored 4 papers about 1 year ago

TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models

Paper • 2310.06762 • Published Oct 10, 2023 • 2

Improving Generalization of Alignment with Human Preferences through Group Invariant Learning

Paper • 2310.11971 • Published Oct 18, 2023 • 1

Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement

Paper • 2305.14497 • Published May 23, 2023

LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment

Paper • 2312.09979 • Published Dec 15, 2023 • 2

Zhiheng Xi

AI & ML interests

Recent Activity

Organizations

WooooDyy's activity