-
Notifications
You must be signed in to change notification settings - Fork 161
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The length of the audio after noise reduction is shortened #71
Comments
Here comes another problem |
用Pyaudio或者sounddevice实现,每次还是喂给模型512,每次替换512里面的256即可。 |
Hello, I am looking for a fast noise reduction model, yours just meets my requirements, thank you for your efforts.
I'm using this model to do preprocessing for my speech recognition model. My processing process is to receive part of the audio through websocket, then denoise, then perform VAD, and then splicing the entire audio for speech recognition processing, so I perform denoising. Noisy audio is only part of an entire sentence.
My requirement is that multiple segments of audio after voice segment noise reduction can be spliced together perfectly, but after I use real_time_processing_tf_lite.py to process the sound, there will be blank parts in the spliced audio, causing the sound to freeze.
In the figure below, the upper part is the audio before processing, and the lower part is the audio after processing. It can be seen that the audio after processing is the same length as the audio before processing, but the part with waveform is shorter, so the two parts after processing Audio cannot be stitched together directly.
Can multiple segments of noise-reduced audio be spliced together perfectly? I'm new to coding and I'm not very familiar with it. Can you help me realize it?
The text was updated successfully, but these errors were encountered: