This was the 2nd task done under the Elite Techno Groups as Python for ML/AI Intern.
- The "Summer Olympics Dataset" is from the year 1896-2012.
- It contains data about Athlete like their
Name
,Gender
,Country
they represent, theMedal
(Gold/Silver/Bronze) they won and in whichSport
,Discipline
andEvent
. - It also contains the
City
names andYear
in which the Olympics was held.
- The dataset was analysed and 8 Major Questions, as asked by the instructor, is answered with the help of two major analysis libraries of Python i.e.
Numpy
andPandas
and also for those questions in which Top 5 was asked, is answered with the help of bar plots using two major visualization libraries of Python i.e.Matplotlib
andSeaborn
. - Eight Major Questions answered by data analysis are:-
- In how many cities Summer Olympics is held so far?
- Which sport is having most number of Gold Medals so far? (Top 5)
- Which sport is having most number of medals so far? (Top 5)
- Which player has won most number of medals? (Top 5)
- Which player has won most number Gold Medals of medals? (Top 5)
- In which year India won first Gold Medal in Summer Olympics?
- Which event is most popular in terms on number of players? (Top 5)
- Which sport is having most female Gold Medalists? (Top 5)
- Python
- Libraries Used
- Numpy
- Pandas
- Matplotlib
- Seaborn
- Jupyter Notebook:- For Data Analysis
- GIT:- To push our project file to GitHub