DPN-LLaVA Collection Accelerating MLLMs through multi-layer dynamic pooling of experts. • 4 items • Updated 8 days ago • 1
Seedream 4.0: Toward Next-generation Multimodal Image Generation Paper • 2509.20427 • Published Sep 24, 2025 • 82
Dynamic Pyramid Network for Efficient Multimodal Large Language Model Paper • 2503.20322 • Published Mar 26, 2025 • 1
view article Article From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate +2 Jun 13, 2024 • 61
CSGO: Content-Style Composition in Text-to-Image Generation Paper • 2408.16766 • Published Aug 29, 2024 • 18
InstantStyle-Plus: Style Transfer with Content-Preserving in Text-to-Image Generation Paper • 2407.00788 • Published Jun 30, 2024 • 23
Stable Diffusion Reference Only: Image Prompt and Blueprint Jointly Guided Multi-Condition Diffusion Model for Secondary Painting Paper • 2311.02343 • Published Nov 4, 2023 • 2