Whisper Medium Music-Speech Classifier

Bu model, Whisper Medium'un fine-tune edilmiş versiyonudur ve ses kayıtlarını müzik ve konuşma olarak sınıflandırır.

Model Detayları

  • Base Model: openai/whisper-medium
  • Task: Audio Classification (Music vs Speech)
  • Dataset: Aynursusuz/original_dataset
  • Learning Rate: 2e-4
  • Batch Size: 32
  • Epochs: 1
  • Accuracy: 1.0

Kullanım

from transformers import AutoFeatureExtractor, AutoModelForAudioClassification
import torch

feature_extractor = AutoFeatureExtractor.from_pretrained("Aynursusuz/whisper-medium-music-speech-classifier")
model = AutoModelForAudioClassification.from_pretrained("Aynursusuz/whisper-medium-music-speech-classifier")

# Ses dosyanızı yükleyin ve tahmin yapın

Eğitim Bilgileri

  • Optimizer: AdamW
  • Warmup Ratio: 0.1
  • FP16 Training: Evet
  • Gradient Accumulation Steps: 4

Label Mapping

  • 0: music
  • 1: speech
Downloads last month
-
Safetensors
Model size
0.3B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train Aynursusuz/whisper-medium-music-speech-classifier

Evaluation results