Timestamp issues with using --language English to force transcription of non-English audio. #129

despairTK · 2023-12-02T02:23:42Z

despairTK
Dec 2, 2023

This problem is mainly for transcribing non-English audio. I have tested Japanese audio, Spanish audio, and Korean audio. If the language specified when generating subtitles is the same as the language of the transcribed audio (such as Japanese audio, set --language Japanese), the subtitle results will be There will be some selection errors for polysyllabic words. However, if you force the generated subtitles to be in English (such as Japanese audio, set --language English), you will get more accurate subtitle results than the original language of the audio. And the results obtained by using --language English to force the transcription of non-English audio are very different from --task translate, and are more accurate than --task translate. However, the timestamp may have incorrect positions or be of too short duration, and there may also be repeated hallucinations.

It would be even better if the timestamps could be normal when using --language English to force transcribe non-English audio. Otherwise, you can only manually correct the timestamp of each line of subtitles.

Hallucinations and timestamp issues that may occur:

Answered by Purfview

Dec 2, 2023

Not so accurate timestamps on the translations is known thing. It think I even added a warning about it.

I've no idea about your observations with --language English vs proper language and --task=translate.
Better ask there -> https://github.com/openai/whisper/discussions

View full answer

Purfview · 2023-12-02T03:20:29Z

Purfview
Dec 2, 2023
Maintainer

Not so accurate timestamps on the translations is known thing. It think I even added a warning about it.

I've no idea about your observations with --language English vs proper language and --task=translate.
Better ask there -> https://github.com/openai/whisper/discussions

1 reply

despairTK Dec 2, 2023
Author

OK, thank you very much for your answer.

Overall, my translation abilities are: Japanese>English>Korean>Other languages. It’s not that I don’t understand English, it’s just that I can’t better express some professional words in the professional field when I feedback some questions to you. After all, I rarely come into contact with such professional words in daily translation. I can still understand ordinary, common sentences and words very well.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Timestamp issues with using --language English to force transcription of non-English audio. #129

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment 1 reply

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

Timestamp issues with using --language English to force transcription of non-English audio. #129

despairTK Dec 2, 2023

Replies: 1 comment · 1 reply

Purfview Dec 2, 2023 Maintainer

despairTK Dec 2, 2023 Author

despairTK
Dec 2, 2023

Replies: 1 comment 1 reply

Purfview
Dec 2, 2023
Maintainer

despairTK Dec 2, 2023
Author