22 19 27

Cihang Xie

cihangxie

https://cihangxie.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper about 4 hours ago

Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory

upvoted a paper 15 days ago

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

authored a paper 22 days ago

In-Context Reinforcement Learning for Tool Use in Large Language Models

View all activity

Organizations

authored a paper 22 days ago

In-Context Reinforcement Learning for Tool Use in Large Language Models

Paper • 2603.08068 • Published 25 days ago • 42

submitted a paper to Daily Papers 2 months ago

OpenVision 3: A Family of Unified Visual Encoder for Both Understanding and Generation

Paper • 2601.15369 • Published Jan 21 • 21

authored 2 papers 7 months ago

AHELM: A Holistic Evaluation of Audio-Language Models

Paper • 2508.21376 • Published Aug 29, 2025 • 9

OpenVision 2: A Family of Generative Pretrained Visual Encoders for Multimodal Learning

Paper • 2509.01644 • Published Sep 1, 2025 • 34

authored 16 papers 8 months ago

How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs

Paper • 2311.16101 • Published Nov 27, 2023 • 1

Audio-Visual LLM for Video Understanding

Paper • 2312.06720 • Published Dec 11, 2023

Compress & Align: Curating Image-Text Data with Human Knowledge

Paper • 2312.06726 • Published Dec 11, 2023

Tuning LayerNorm in Attention: Towards Efficient Multi-Modal LLM Finetuning

Paper • 2312.11420 • Published Dec 18, 2023 • 2

SPFormer: Enhancing Vision Transformer with Superpixel Representation

Paper • 2401.02931 • Published Jan 5, 2024

iBOT: Image BERT Pre-Training with Online Tokenizer

Paper • 2111.07832 • Published Nov 15, 2021

Masked Autoencoders Enable Efficient Knowledge Distillers

Paper • 2208.12256 • Published Aug 25, 2022

SMAUG: Sparse Masked Autoencoder for Efficient Video-Language Pre-training

Paper • 2211.11446 • Published Nov 21, 2022

Unleashing the Power of Visual Prompting At the Pixel Level

Paper • 2212.10556 • Published Dec 20, 2022

A Preliminary Study of o1 in Medicine: Are We Closer to an AI Doctor?

Paper • 2409.15277 • Published Sep 23, 2024 • 38

VHELM: A Holistic Evaluation of Vision Language Models

Paper • 2410.07112 • Published Oct 9, 2024 • 3

M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image Generation

Paper • 2411.10433 • Published Nov 15, 2024

Cihang Xie

AI & ML interests

Recent Activity

Organizations

cihangxie's activity