Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shengfang Zhai's picture
4

Shengfang Zhai

zsf
https://scholar.google.com/citations?user=bJYY-tIAAAAJ&hl=en
  • zhaisf

AI & ML interests

Trustworthy AI, Generative Models, AI Privacy, Backdoor Attacks

Organizations

None yet

authored a paper 5 months ago

NaviDet: Efficient Input-level Backdoor Detection on Text-to-Image Synthesis via Neuron Activation Variation

Paper • 2503.06453 • Published Mar 9, 2025
authored a paper 8 months ago

GuardReasoner-VL: Safeguarding VLMs via Reinforced Reasoning

Paper • 2505.11049 • Published May 16, 2025 • 60
authored a paper 11 months ago

GuardReasoner: Towards Reasoning-based LLM Safeguards

Paper • 2501.18492 • Published Jan 30, 2025 • 88
authored 2 papers 12 months ago

Text-to-Image Diffusion Models can be Easily Backdoored through Multimodal Data Poisoning

Paper • 2305.04175 • Published May 7, 2023 • 1

Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy

Paper • 2405.14800 • Published May 23, 2024 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs