BuildBench: Benchmarking LLM Agents on Compiling Real-World Open-Source Software Paper โข 2509.25248 โข Published Sep 27, 2025 โข 3