You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Whisper was trained on a lot of youtube videos so, it has a tendency to output phrases like "Thanks for watching!", "Thanks!", "That's all," etc. when there's no intelligible speech in the audio & after applying VAD to remove no spech audio the issue reduced to a great extent but still appears from time to time again.
The text was updated successfully, but these errors were encountered:
Whisper was trained on a lot of youtube videos so, it has a tendency to output phrases like "Thanks for watching!", "Thanks!", "That's all," etc. when there's no intelligible speech in the audio & after applying VAD to remove no spech audio the issue reduced to a great extent but still appears from time to time again.
The text was updated successfully, but these errors were encountered: