voice_segmentor

A set of scripts that reads media -> detect voice from the media -> create wav for each sentences

Scripts

`folder_convert_media_to_wav.py`

convert all non-wav media into wav in a given folder it uses ffmpeg internally as that's performing the best (most stable, and compatible with any) ffmpeg uses default which is 16 bit PCM, 48Khz, 2ch

Usage:

python folder_convert_media_to_wav.py -f <folder name>

`segment-media-vad-onx.py`

Reads either a single wave file or all the wave files in a specific folder, and run Voice Activity Dector (with onx option enabled) to generate a folder that has each sentences as wave file it internally uses https://github.com/snakers4/silero-vad for Voice activity Detection

Usage: For a single wave file

python segment-media-vad-onx.py <wave file>

For all wave file in a folder

python segment-media-vad-onx.py -f <folder>

`labeling-with-whipser.py`

Read a folder that has list of wave files (assuming it's a segmented wav files by segment-media-vad-onx.py`,

create a excel file that has the same name as the folder
add header row of Filename (first column), Transcription (second column) to the first row
for the first column, fill up all the rows withe the wav file names
for the second column, fill up all the rows by running whisper to the corresponding wav file
once done, save that to the excel file

Usage: For all wave file in a folder

python labeling-with-whipser.py -f <folder>

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
folder_convert_media_to_wav.py		folder_convert_media_to_wav.py
labeling-with-whipser.py		labeling-with-whipser.py
segment-media-vad-onx.py		segment-media-vad-onx.py
waves-in-the-folder.py		waves-in-the-folder.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

voice_segmentor

Scripts

`folder_convert_media_to_wav.py`

`segment-media-vad-onx.py`

`labeling-with-whipser.py`

About

Releases

Packages

Contributors 2

Languages

ysshin/voice_segmentor

Folders and files

Latest commit

History

Repository files navigation

voice_segmentor

Scripts

folder_convert_media_to_wav.py

segment-media-vad-onx.py

labeling-with-whipser.py

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

`folder_convert_media_to_wav.py`

`segment-media-vad-onx.py`

`labeling-with-whipser.py`

Packages