vaibhavs10/incredibly-fast-whisper

whisper-large-v3, incredibly fast, powered by Hugging Face Transformers! 🤗

Input
Configure the inputs for the AI model.

Task to perform: transcribe or translate to another language.

Audio file

Provide a hf.co/settings/token for Pyannote.audio to diarise the audio clips. You need to agree to the terms in 'https://huggingface.co/pyannote/speaker-diarization-3.1' and 'https://huggingface.co/pyannote/segmentation-3.0' first.

Language spoken in the audio, specify 'None' to perform language detection.

Whisper supports both chunked as well as word level timestamps.

0
100

Number of parallel batches you want to compute. Reduce if you face OOMs.

Use Pyannote.audio to diarise the audio clips. You will need to provide hf_token below too.

Output
The generated output will appear here.

No output yet

Click "Generate" to create an output.

incredibly-fast-whisper - ikalos.ai