CommonLID: Re-evaluating State-of-the-Art Language Identification Performance on Web Data Paper • 2601.18026 • Published Jan 25
The PokeAgent Challenge: Competitive and Long-Context Learning at Scale Paper • 2603.15563 • Published 3 days ago • 10
Beyond One World: Benchmarking Super Heros in Role-Playing Across Multiversal Contexts Paper • 2510.14351 • Published Oct 16, 2025 • 2
Deflanderization for Game Dialogue: Balancing Character Authenticity with Task Execution in LLM-based NPCs Paper • 2510.13586 • Published Oct 15, 2025 • 1
LLM Agent-Based Simulation of Student Activities and Mental Health Using Smartphone Sensing Data Paper • 2508.02679 • Published Jul 17, 2025
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia Paper • 2503.07920 • Published Mar 10, 2025 • 101