Audio transcript extraction #8
Labels
enhancement
New feature or request
good first issue
Good for newcomers
help wanted
Extra attention is needed
Looking to support
mp3
,wav
Audio is not standard in commercial multimodal models today in 2024. Because of this, I am also looking to transcribe audio to text, probably via Whisper.
The text was updated successfully, but these errors were encountered: