Question about Silero VAD implementation and chunking #882
Unanswered
ngcheeyuan
asked this question in
Q&A
Replies: 1 comment
-
@ngcheeyuan , hello. You can enable Silero VAD by setting the option |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I can't quite find information on how VAD is implemented in faster-whipser. Some details will help.
For example :
Do you all chop up the audio into 30 seconds chunk, run VAD on each of those chunks, remove the silence portion if it's below min silenece threshold, and transcribe them?
Or is there something similar to WhisperX being performed?
https://github.com/m-bain/whisperX
Where they merge smaller chunks before transcribing them.
Thanks.
Beta Was this translation helpful? Give feedback.
All reactions