Skip to content

pkirti33/video_classification_pytorch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 

Repository files navigation

video_classification_pytorch

This repository features a Google Colab notebook where I explore multi-class video classification using the UCF11 video dataset. The dataset is available here on Kaggle.

In this project, I built a mini 3D Convolutional Neural Network inspired by Pytorch's 18-layer 3D Resnet Model and trained it using two different methods:

  1. Custom Dimensions Transformation: All videos were resized to 100x100 pixels. This approach achieved a 32% test accuracy over 10 epochs, taking approximately 1 hour.
  2. Inward Cropping Transformation: Videos were cropped inward by 10% of their original length to focus on the most crucial parts. This method resulted in a 54% test accuracy over 10 epochs, taking about 1.4 hours.

Next Steps

Future steps include experimenting with other transformations, tuning hyperparameters, and training over more epochs.

Graphs

Accuracy and Loss Time

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published