Benefits and Pitfalls of Reinforcement Learning for Language Model Planning: A Theoretical Perspective Paper • 2509.22613 • Published Sep 26 • 9
Benefits and Pitfalls of Reinforcement Learning for Language Model Planning: A Theoretical Perspective Paper • 2509.22613 • Published Sep 26 • 9 • 2
Complete Dictionary Learning via $\ell_p$-norm Maximization Paper • 2002.10043 • Published Feb 24, 2020
ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based Image Manipulation Paper • 2308.00906 • Published Aug 2, 2023 • 13
Sparse Mixture-of-Experts are Domain Generalizable Learners Paper • 2206.04046 • Published Jun 8, 2022 • 1
CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation Paper • 2404.19394 • Published Apr 30, 2024
ALPINE: Unveiling the Planning Capability of Autoregressive Learning in Language Models Paper • 2405.09220 • Published May 15, 2024 • 27
Can Graph Learning Improve Planning in LLM-based Agents? Paper • 2405.19119 • Published May 29, 2024
Large Multi-modal Models Can Interpret Features in Large Multi-modal Models Paper • 2411.14982 • Published Nov 22, 2024 • 19
Omni-DNA: A Unified Genomic Foundation Model for Cross-Modal and Multi-Task Learning Paper • 2502.03499 • Published Feb 5 • 1
What Makes a Good Diffusion Planner for Decision Making? Paper • 2503.00535 • Published Mar 1 • 1
Habitizing Diffusion Planning for Efficient and Effective Decision Making Paper • 2502.06401 • Published Feb 10 • 1
SciVer: Evaluating Foundation Models for Multimodal Scientific Claim Verification Paper • 2506.15569 • Published Jun 18 • 12
Multimodal-SAE Collection The collection of the sae that hooked on llava • 5 items • Updated Mar 4 • 8
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published Jan 8 • 287