Ebisu: Benchmarking Large Language Models in Japanese Finance Paper • 2602.01479 • Published 3 days ago • 16
Same Claim, Different Judgment: Benchmarking Scenario-Induced Bias in Multilingual Financial Misinformation Detection Paper • 2601.05403 • Published 27 days ago • 10
The Illusion of Specialization: Unveiling the Domain-Invariant "Standing Committee" in Mixture-of-Experts Models Paper • 2601.03425 • Published 29 days ago • 16