Agentic Systems in Radiology: Design, Applications, Evaluation, and Challenges Paper • 2510.09404 • Published Oct 10, 2025 • 1
Real-Time Inverse Kinematics for Generating Multi-Constrained Movements of Virtual Human Characters Paper • 2507.00792 • Published Jul 1, 2025 • 1
ImaGGen: Zero-Shot Generation of Co-Speech Semantic Gestures Grounded in Language and Image Input Paper • 2510.17617 • Published Oct 20, 2025 • 1
Conveying Meaning through Gestures: An Investigation into Semantic Co-Speech Gesture Generation Paper • 2510.17599 • Published Oct 20, 2025 • 1
Integrating Representational Gestures into Automatically Generated Embodied Explanations and its Effects on Understanding and Interaction Quality Paper • 2406.12544 • Published Jun 18, 2024
Augmented Co-Speech Gesture Generation: Including Form and Meaning Features to Guide Learning-Based Gesture Synthesis Paper • 2307.09597 • Published Jul 13, 2023
AQ-GT: a Temporally Aligned and Quantized GRU-Transformer for Co-Speech Gesture Synthesis Paper • 2305.01241 • Published May 2, 2023
Addressing Data Scarcity in Multimodal User State Recognition by Combining Semi-Supervised and Supervised Learning Paper • 2202.03775 • Published Feb 8, 2022
UniFusion: Vision-Language Model as Unified Encoder in Image Generation Paper • 2510.12789 • Published Oct 14, 2025 • 19
Merlin: A Vision Language Foundation Model for 3D Computed Tomography Paper • 2406.06512 • Published Jun 10, 2024 • 2
MedVAE: Efficient Automated Interpretation of Medical Images with Large-Scale Generalizable Autoencoders Paper • 2502.14753 • Published Feb 20, 2025 • 1
SMMILE: An Expert-Driven Benchmark for Multimodal Medical In-Context Learning Paper • 2506.21355 • Published Jun 26, 2025 • 10
Expert-level validation of AI-generated medical text with scalable language models Paper • 2507.03152 • Published Jul 3, 2025
Improving Performance, Robustness, and Fairness of Radiographic AI Models with Finely-Controllable Synthetic Data Paper • 2508.16783 • Published Aug 22, 2025 • 1
Processing and acquisition traces in visual encoders: What does CLIP know about your camera? Paper • 2508.10637 • Published Aug 14, 2025 • 8
How to Train your Text-to-Image Model: Evaluating Design Choices for Synthetic Training Captions Paper • 2506.16679 • Published Jun 20, 2025 • 1
Class Attribute Inference Attacks: Inferring Sensitive Class Information by Diffusion-Based Attribute Manipulations Paper • 2303.09289 • Published Mar 16, 2023 • 2
Distilling Adversarial Prompts from Safety Benchmarks: Report for the Adversarial Nibbler Challenge Paper • 2309.11575 • Published Sep 20, 2023
MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal Image Generation Paper • 2305.15296 • Published May 24, 2023 • 1