Skip to content

Latest commit

 

History

History
9 lines (5 loc) · 688 Bytes

README.md

File metadata and controls

9 lines (5 loc) · 688 Bytes

This program takes factors such as actors,directors,genres,IMDB ratings,release year,budget and predicts the estimation of revenue can be collected in released country.

The datascience part uses python. It uses RandomForest Regression to predict revenue and success.

preprocessingAndAccuracy.py file is used to clean and modify the meta dataset of IMDB and also train the model to get accuracy.

main.py file takes the cleaned data and according to the input values it predicts the revenue in Released country which shows success if the revenue collected is atleast 50% more than the budget

The datsets were obtained from kaggle which were scraped from IMDB. it has 5000 data entries