speaker-diarization-precision-2 / README.md

Hervé BREDIN

feat: update README

9bc673d 3 months ago

1.42 kB

metadata

tags:
  - pyannote
  - pyannote-audio
  - pyannote-audio-pipeline
  - audio
  - voice
  - speech
  - speaker
  - speaker-diarization
  - speaker-change-detection
  - voice-activity-detection
  - overlapped-speech-detection

`Precision-2` speaker diarization

This pipeline runs Precision-2 speaker diarization on pyannoteAI cloud.
Read the announcement blog post.

This pipeline is a stripped down version of pyannoteAI SDK that provides much more features:

speaker diarization optimized for speech-to-text
speaker voiceprinting and identification
confidence scores
and more...

A self-hosted version of Precision-2 is also available for enterprise customers.

Setup

pip install pyannote.audio
Create an API key on pyannoteAI dashboard (free credits included)

Usage

# initialize speaker diarization pipeline
from pyannote.audio import Pipeline
pipeline = Pipeline.from_pretrained(
    'pyannote/speaker-diarization-precision-2', 
    token="{pyannoteAI-api-key}")

# run speaker diarization on pyannoteAI cloud
output = pipeline("/path/to/audio.wav")

# enjoy state-of-the-art speaker diarization
for turn, speaker in output.speaker_diarization:
    print(f"start={turn.start:.1f}s stop={turn.end:.1f}s speaker_{speaker}")

Precision-2 speaker diarization

Setup

Usage

`Precision-2` speaker diarization