SwingBench

community

https://github.com/menik1126/Swing-Bench/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

kirailol authored a paper 1 day ago

ClawBench: Can AI Agents Complete Everyday Online Tasks?

kirailol authored a paper 20 days ago

PhyX: Does Your Model Have the "Wits" for Physical Reasoning?

kirailol authored a paper 20 days ago

Exploring Layer-wise Information Effectiveness for Post-Training Quantization in Small Language Models

View all activity

authored a paper 1 day ago

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published 3 days ago • 138

authored 6 papers 20 days ago

PhyX: Does Your Model Have the "Wits" for Physical Reasoning?

Paper • 2505.15929 • Published May 21, 2025 • 49

Exploring Layer-wise Information Effectiveness for Post-Training Quantization in Small Language Models

Paper • 2508.03332 • Published Aug 5, 2025

LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction

Paper • 2509.07403 • Published Sep 9, 2025 • 35

PTQTP: Post-Training Quantization to Trit-Planes for Large Language Models

Paper • 2509.16989 • Published Sep 21, 2025 • 1

SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving

Paper • 2505.23932 • Published May 29, 2025

BabyVision: Visual Reasoning Beyond Language

Paper • 2601.06521 • Published Jan 10 • 201

submitted a paper to Daily Papers about 2 months ago

DSDR: Dual-Scale Diversity Regularization for Exploration in LLM Reasoning

Paper • 2602.19895 • Published Feb 23 • 14

submitted a paper to Daily Papers 2 months ago

V-Retrver: Evidence-Driven Agentic Reasoning for Universal Multimodal Retrieval

Paper • 2602.06034 • Published Feb 5 • 8

updated a dataset 2 months ago

SwingBench/SwingBench

Viewer • Updated Feb 6 • 34.8k • 96

submitted a paper to Daily Papers 2 months ago

OVD: On-policy Verbal Distillation

Paper • 2601.21968 • Published Jan 29 • 4

published a dataset 2 months ago

SwingBench/SwingBench

Viewer • Updated Feb 6 • 34.8k • 96

authored 6 papers 3 months ago

ATTS: Asynchronous Test-Time Scaling via Conformal Prediction

Paper • 2509.15148 • Published Sep 18, 2025 • 1

DoPE: Denoising Rotary Position Embedding

Paper • 2511.09146 • Published Nov 12, 2025 • 98

Efficient Diffusion Models: A Survey

Paper • 2502.06805 • Published Feb 3, 2025

SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving

Paper • 2505.23932 • Published May 29, 2025

SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model Compression

Paper • 2503.12340 • Published Mar 16, 2025

MMFormalizer: Multimodal Autoformalization in the Wild

Paper • 2601.03017 • Published Jan 6 • 106

submitted a paper to Daily Papers 3 months ago

MMFormalizer: Multimodal Autoformalization in the Wild

Paper • 2601.03017 • Published Jan 6 • 106

authored a paper 5 months ago

ATTS: Asynchronous Test-Time Scaling via Conformal Prediction

Paper • 2509.15148 • Published Sep 18, 2025 • 1