AlignBench: Benchmarking Fine-Grained Image-Text Alignment with Synthetic Image-Caption Pairs Paper • 2511.20515 • Published Nov 25 • 3
AlignBench: Benchmarking Fine-Grained Image-Text Alignment with Synthetic Image-Caption Pairs Paper • 2511.20515 • Published Nov 25 • 3
Zero-shot Hierarchical Plant Segmentation via Foundation Segmentation Models and Text-to-image Attention Paper • 2509.09116 • Published Sep 11
VIR-Bench: Evaluating Geospatial and Temporal Understanding of MLLMs via Travel Video Itinerary Reconstruction Paper • 2509.19002 • Published Sep 23 • 2
Traveling Across Languages: Benchmarking Cross-Lingual Consistency in Multimodal LLMs Paper • 2505.15075 • Published May 21 • 1