Byung-Kwan Lee

BK-Lee

https://sites.google.com/view/byungkwanlee

AI & ML interests

Vision-Language Models

Recent Activity

upvoted a paper 9 days ago

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

upvoted a paper 15 days ago

Efficient Reasoning with Balanced Thinking

upvoted a paper 15 days ago

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

View all activity

Organizations

upvoted a paper 9 days ago

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Paper • 2603.24472 • Published 10 days ago • 48

upvoted 3 papers 15 days ago

Efficient Reasoning with Balanced Thinking

Paper • 2603.12372 • Published 23 days ago • 143

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published 16 days ago • 65

ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents

Paper • 2603.18815 • Published 17 days ago • 14

upvoted a paper about 1 month ago

Recursive Think-Answer Process for LLMs and VLMs

Paper • 2603.02099 • Published Mar 2 • 7

upvoted 9 papers 3 months ago

OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding

Paper • 2601.09575 • Published Jan 14 • 26

Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning

Paper • 2601.09708 • Published Jan 14 • 54

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 229

SurgWorld: Learning Surgical Robot Policies from Videos via World Modeling

Paper • 2512.23162 • Published Dec 29, 2025 • 14

Quantile Rendering: Efficiently Embedding High-dimensional Feature on 3D Gaussian Splatting

Paper • 2512.20927 • Published Dec 24, 2025 • 17

Masking Teacher and Reinforcing Student for Distilling Vision-Language Models

Paper • 2512.22238 • Published Dec 23, 2025 • 30

NVIDIA Nemotron 3: Efficient and Open Intelligence

Paper • 2512.20856 • Published Dec 24, 2025 • 43

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2512.20848 • Published Dec 23, 2025 • 42

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Paper • 2512.16649 • Published Dec 18, 2025 • 27

upvoted 6 papers 4 months ago

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published Nov 20, 2025 • 95

Nemotron Elastic: Towards Efficient Many-in-One Reasoning LLMs

Paper • 2511.16664 • Published Nov 20, 2025 • 29

Byung-Kwan Lee

AI & ML interests

Recent Activity

Organizations

BK-Lee's activity