Issues with time stamps for long form transcription

#6
by Hossep - opened

I'm using the generic code from Whisper-medium to generate long-form transcription with timestamps, so that I can create subtitles

prediction = pipe(audio_path, batch_size=8, return_timestamps=True)["chunks"]

But when I use this fine-tuned model, I get the following error:

1411 if return_timestamps and not hasattr(generation_config, "no_timestamps_token_id"):
1412 raise ValueError(
1413 "You are trying to return timestamps, but the generation config is not properly set. "
1414 "Make sure to initialize the generation config with the correct attributes that are needed such as "

Sign up or log in to comment