Why Stop at One Error? Benchmarking LLMs as Data Science Code Debuggers for Multi-Hop and Multi-Bug Errors Paper โข 2503.22388 โข Published Mar 28, 2025 โข 1
UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset Paper โข 2402.04588 โข Published Feb 7, 2024 โข 2
MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization Paper โข 2402.11453 โข Published Feb 18, 2024
Why Stop at One Error? Benchmarking LLMs as Data Science Code Debuggers for Multi-Hop and Multi-Bug Errors Paper โข 2503.22388 โข Published Mar 28, 2025 โข 1