Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Dataset-Tools 's Collections
Dataset transformation, preparation and edition
Models for dataset curation
Dataset Exploration
Synthetic Dataset Creation
Dataset Creation

Models for dataset curation

updated Dec 5, 2024
Upvote
17

  • HuggingFaceFW/fineweb-edu-classifier

    Text Classification • 0.1B • Updated Nov 17, 2024 • 27.2k • • 204

    Note Classify texts based on their educational quality


  • minishlab/potion-base-8M

    Updated Sep 9, 2025 • 243k • 73

    Note A blazing-fast embedding generator


  • nvidia/domain-classifier

    Updated Sep 22, 2025 • 9.21k • 96

    Note A model to classify text according to different domains


  • nvidia/quality-classifier-deberta

    Updated Sep 22, 2025 • 1.3k • 73

    Note Classify texts based on their general quality


  • urchade/gliner_multi_pii-v1

    Token Classification • Updated Apr 20, 2024 • 49.7k • 147

    Note Identify and classify personal identifiable information PII


  • giacomoarienti/nsfw-classifier

    Image Classification • 85.8M • Updated Mar 26, 2025 • 20.1k • • 47

  • Falconsai/nsfw_image_detection

    Image Classification • 85.8M • Updated Apr 6, 2025 • 50.5M • • 955

  • PleIAs/celadon

    Text Classification • 0.1B • Updated Jun 12, 2025 • 16 • 35
Upvote
17
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs