Skip to content

Context - You have been hired by a company which analyzes internet data to monetize it. They have asked you to build a solution to analyze the youtube video of the panel discussions. Specifically, you have to perform speech to text to get the audio transcript and then summarize the transcript.

Notifications You must be signed in to change notification settings

snehvora/YouTube-panel-discussions-summariser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 

Repository files navigation

YouTube panel discussions summariser

Problem-4 (NLP: Panel discussion summarization)

Context:

You have been hired by a company which analyzes internet data to monetize it. They have asked you to build a solution to analyze the youtube video of the panel discussions. Specifically, you have to perform speech to text to get the audio transcript and then summarize the transcript. Technical details

Input: Youtube URL of a panel discussion

Output: Textual summary of the discussion

Required Libraries :

--> !pip install SpeechRecognition // converting from audio to text

--> !pip install youtube_dl // downloading audio file of youtube video

--> !pip install pydub // to convert the mp3 file to wav format

--> !pip install os // saving the mp3 format and to raname it or to delete it

--> !pip3 install git+https://github.com/ernie-mlg/rpunct.git. // to add punctuation

--> !pip install gensim // to summarize the whole text

Approach :

--> I have divided this project in 4 phase :
- downloading audio in mp3 format.
- converting audio to text using speechrecognizer library.
- Add punctuations.
- use summarizer to summarize the whole transcript.

How to run the code :

--> Export this file to google colab and run each shell.

How to test the code :

--> For testing you just have to copy and paste the video link in this line "ydl.download(['https://www.youtube.com/watch?v=xb98qYIfNZ4'])".

About

Context - You have been hired by a company which analyzes internet data to monetize it. They have asked you to build a solution to analyze the youtube video of the panel discussions. Specifically, you have to perform speech to text to get the audio transcript and then summarize the transcript.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published