Hervé BREDIN
feat: update README
9bc673d
metadata
tags:
  - pyannote
  - pyannote-audio
  - pyannote-audio-pipeline
  - audio
  - voice
  - speech
  - speaker
  - speaker-diarization
  - speaker-change-detection
  - voice-activity-detection
  - overlapped-speech-detection

Precision-2 speaker diarization

This pipeline runs Precision-2 speaker diarization on pyannoteAI cloud.
Read the announcement blog post.

This pipeline is a stripped down version of pyannoteAI SDK that provides much more features:

  • speaker diarization optimized for speech-to-text
  • speaker voiceprinting and identification
  • confidence scores
  • and more...

A self-hosted version of Precision-2 is also available for enterprise customers.

Setup

  1. pip install pyannote.audio
  2. Create an API key on pyannoteAI dashboard (free credits included)

Usage

# initialize speaker diarization pipeline
from pyannote.audio import Pipeline
pipeline = Pipeline.from_pretrained(
    'pyannote/speaker-diarization-precision-2', 
    token="{pyannoteAI-api-key}")

# run speaker diarization on pyannoteAI cloud
output = pipeline("/path/to/audio.wav")

# enjoy state-of-the-art speaker diarization
for turn, speaker in output.speaker_diarization:
    print(f"start={turn.start:.1f}s stop={turn.end:.1f}s speaker_{speaker}")