Shengfang Zhai's picture

4

Shengfang Zhai

zsf

https://scholar.google.com/citations?user=bJYY-tIAAAAJ&hl=en

zhaisf

AI & ML interests

Trustworthy AI, Generative Models, AI Privacy, Backdoor Attacks

Organizations

None yet

authored a paper 5 months ago

NaviDet: Efficient Input-level Backdoor Detection on Text-to-Image Synthesis via Neuron Activation Variation

Paper • 2503.06453 • Published Mar 9, 2025

authored a paper 8 months ago

GuardReasoner-VL: Safeguarding VLMs via Reinforced Reasoning

Paper • 2505.11049 • Published May 16, 2025 • 60

authored a paper 11 months ago

GuardReasoner: Towards Reasoning-based LLM Safeguards

Paper • 2501.18492 • Published Jan 30, 2025 • 88

authored 2 papers 12 months ago

Text-to-Image Diffusion Models can be Easily Backdoored through Multimodal Data Poisoning

Paper • 2305.04175 • Published May 7, 2023 • 1

Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy

Paper • 2405.14800 • Published May 23, 2024 • 1