Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Audio transcriber #25

Open
wants to merge 80 commits into
base: dev
Choose a base branch
from
Open

Audio transcriber #25

wants to merge 80 commits into from

Conversation

damielulade
Copy link
Collaborator

In src/lib/transcriber.ts:

  • transcribe() is the exported function to integrate into the next stage of the project
  • extract_audio() used fluent-ffmpeg to take in a .mp4 video from the filesystem and output a .mp3 audio file back to the filesystem.
  • run_query() imports an audio file, and queries the Whisper-large-v3 Inference API to transcribe the video.

We need to find out the limit of file size/audio length that the fast prototyping can take in, so we can run the API in chunks (this slows down the code significantly). The other options are: paying for the development API to run longer videos; or taking the API as a python module (ask @VrishYT for more details on that).

rishi-khiroya and others added 30 commits January 15, 2024 15:51
Add textarea for transcript input, add summarise button which extracts transcript. Currently only alerts, summary not yet implemented.
Connect GPT 3.5 API in orde to mae summary requests. Not yet linked to frontend.
openai-test.js can now be run with nodes openai-test.js "your-transcript-here" to output the summary of the argument given.
Extract the message content from the completion returned by the GPT API rather than the entire response
Added an abstract class that is extended by the classes that contain transcripts of the whole lecture and transcripts split by slides
define plan for document format based on classes in IR branch. (These classes such as Transcript, Slide etc. are not present yet on this branch.)

classes Slide, Transcript etc. are not yet on this branch
Move openai-test.js code into Summariser class to use with summariser tests
Change Summariser argument type from string to String
Resolve summary in summariser_tests.js and set to new variable if it isn't null (initialised to empty string).
@damielulade damielulade self-assigned this Jan 19, 2024
Copy link

vercel bot commented Jan 19, 2024

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name Status Preview Comments Updated (UTC)
summentia ❌ Failed (Inspect) Jan 22, 2024 1:37pm

This was linked to issues Jan 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: In review
Development

Successfully merging this pull request may close these issues.

Implement audio transcriber Split video into audio and video
4 participants