Replies: 7 comments 71 replies
-
Try different --compute_type, --beam_size, --best_of, --initial_prompt. It is impossible for a set of parameters to be suitable for all audios. When encountering audios without punctuation marks, you can only try to adjust the parameters to achieve better results. |
Beta Was this translation helpful? Give feedback.
-
My advice: don't touch --intitial_prompt. |
Beta Was this translation helpful? Give feedback.
-
Isn't punctuation degrading after some time with your custom prompt? |
Beta Was this translation helpful? Give feedback.
-
Ok, here you go, the promised test build -> https://we.tl/t-wBSWVD8hQy New hardcoded initial_prompt presets:
So, I would want feedback on how new prompt defaults compares to the previous version, then check if |
Beta Was this translation helpful? Give feedback.
-
I tested a few dozen more audios in different languages, and also read here: https://github.com/openai/openai-cookbook/blob/main/examples/Whisper_prompting_guide.ipynb Then I feel that the sentences filled in by The impact of This is just my personal opinion after testing, there may be better solutions. I also hope there is a good way to solve the current problem. :) |
Beta Was this translation helpful? Give feedback.
-
Here is test4 -> https://we.tl/t-E0WZgFf0tR Added Test it on audio where it after some time starts transcribing with no punctuations. Check if it triggers and if punctuation recovers after it. Of course it should be used with reprompt. |
Beta Was this translation helpful? Give feedback.
-
r160.11 with new stuff is released. |
Beta Was this translation helpful? Give feedback.
-
Just as mentioned before in the subtitledit project, I encounter situations where there are no punctuation marks in a large portion of the text when transcribing some video files using Fast-whisper. I use large v2 model to transcribe only english video. the only parameter I add is “--sentence" to keep putting out compelet sentence to translate. All other parameters are set to default.
I have tried adding intitial_prompts such like --intitial_prompt "Hello, this is the first sentence. And this is the second one. A little pause ... and we are back."instructing it to add punctuation marks, but this problem still persists.
What methods or parameters need to be added to force consistent punctuation throughout the entire transcription?
Beta Was this translation helpful? Give feedback.
All reactions