save a segment of that model is predicted #246

m15kh · 2024-10-24T18:38:17Z

"Hi! @juanmc2005
I want to save the segments that the model predicts as containing speech. The model detects segments in real-time where someone is talking, and I specifically want to save those audio segments where the model indicates 'yes' for a spoken label. Please save these detected sounds in WAV format."

m15kh · 2024-10-26T11:50:14Z

@juanmc2005
can you help me for this issue?

juanmc2005 · 2024-11-06T14:50:01Z

Hi @m15kh,

The SpeakerDiarization pipeline does provide the waveform aligned to the current diarization output (see here).

The StreamingInference class provides you a way to execute some code when a new pair of "output-audio" is available.
You can achieve this with the attach_hooks() method (see here) by passing a function to execute whenever a new tuple[Annotation, SlidingWindowFeature] is available. Then it would be a matter of cropping the audio according to the speech in the annotation.

juanmc2005 added the question Further information is requested label Nov 6, 2024

juanmc2005 mentioned this issue Nov 6, 2024

Returning real time waveform #250

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

save a segment of that model is predicted #246

save a segment of that model is predicted #246

m15kh commented Oct 24, 2024 •

edited

Loading

m15kh commented Oct 26, 2024

juanmc2005 commented Nov 6, 2024

save a segment of that model is predicted #246

save a segment of that model is predicted #246

Comments

m15kh commented Oct 24, 2024 • edited Loading

m15kh commented Oct 26, 2024

juanmc2005 commented Nov 6, 2024

m15kh commented Oct 24, 2024 •

edited

Loading