Jianfei Chen's picture

3

Jianfei Chen

surfingtomchen

·

surfingtomchen

AI & ML interests

None yet

Organizations

None yet

authored a paper 7 months ago

LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models

Paper • 2505.19223 • Published May 25, 2025 • 9

authored a paper 8 months ago

SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training

Paper • 2505.11594 • Published May 16, 2025 • 75

authored 2 papers 10 months ago

Identifying Sensitive Weights via Post-quantization Integral

Paper • 2503.01901 • Published Feb 28, 2025 • 8

SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference

Paper • 2502.18137 • Published Feb 25, 2025 • 59

authored a paper 11 months ago

Visual Generation Without Guidance

Paper • 2501.15420 • Published Jan 26, 2025 • 8

authored 2 papers about 1 year ago

SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration

Paper • 2411.10958 • Published Nov 17, 2024 • 57

SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration

Paper • 2410.02367 • Published Oct 3, 2024 • 50