Qianfan-OCR: A Unified End-to-End Model for Document Intelligence Paper • 2603.13398 • Published 13 days ago • 145
AfriNLLB Collection AfriNLLB: Efficient Translation Models for African Languages • 11 items • Updated Feb 15 • 4
Nemotron-Terminal Collection We are releasing Nemotron-Terminal models and training datasets. • 5 items • Updated about 3 hours ago • 31
LLaDA-o: An Effective and Length-Adaptive Omni Diffusion Model Paper • 2603.01068 • Published 24 days ago • 22
SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale Paper • 2602.23866 • Published 26 days ago • 88
OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens Paper • 2603.02138 • Published 22 days ago • 147
Jr. AI Scientist and Its Risk Report: Autonomous Scientific Exploration from a Baseline Paper Paper • 2511.04583 • Published Nov 6, 2025 • 5
jina-embeddings-v5-text: Task-Targeted Embedding Distillation Paper • 2602.15547 • Published Feb 17 • 26
IRPAPERS: A Visual Document Benchmark for Scientific Retrieval and Question Answering Paper • 2602.17687 • Published Feb 5 • 1
view article Article A framework and leaderboard for Retrieval Pipelines evaluation on ViDoRe v3 25 days ago • 12
PyVision-RL: Forging Open Agentic Vision Models via RL Paper • 2602.20739 • Published 29 days ago • 31
On Data Engineering for Scaling LLM Terminal Capabilities Paper • 2602.21193 • Published 28 days ago • 100
pplx-embed Collection Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated 27 days ago • 88
view article Article Alyah ⭐️: Toward Robust Evaluation of Emirati Dialect Capabilities in Arabic LLMs Jan 27 • 24