Whisper Medium Music-Speech Classifier
Bu model, Whisper Medium'un fine-tune edilmiş versiyonudur ve ses kayıtlarını müzik ve konuşma olarak sınıflandırır.
Model Detayları
- Base Model: openai/whisper-medium
- Task: Audio Classification (Music vs Speech)
- Dataset: Aynursusuz/original_dataset
- Learning Rate: 2e-4
- Batch Size: 32
- Epochs: 1
- Accuracy: 1.0
Kullanım
from transformers import AutoFeatureExtractor, AutoModelForAudioClassification
import torch
feature_extractor = AutoFeatureExtractor.from_pretrained("Aynursusuz/whisper-medium-music-speech-classifier")
model = AutoModelForAudioClassification.from_pretrained("Aynursusuz/whisper-medium-music-speech-classifier")
# Ses dosyanızı yükleyin ve tahmin yapın
Eğitim Bilgileri
- Optimizer: AdamW
- Warmup Ratio: 0.1
- FP16 Training: Evet
- Gradient Accumulation Steps: 4
Label Mapping
- 0: music
- 1: speech
- Downloads last month
- -
Dataset used to train Aynursusuz/whisper-medium-music-speech-classifier
Evaluation results
- Accuracy on original_datasetself-reported1.000