This file contains a detailed data dictionary for a Spotify Most Streamed Songs dataset, providing information on track details, streaming metrics across platforms, and musical attributes. It serves as a guide for analyzing trends in music popularity and composition. This project not only explores the depths of 2023's chart-toppers but also leverages Python and the Spotify Web APIs.Here Python script is used that interacts with the Spotify Web API and a pandas DataFrame to retrieve track details including the cover URL for a list of tracks provided in an Excel file.Using a pandas DataFrame, we retrieve intricate track details, from beats and danceability to album cover In this project, we meticulously calculate average streams per year, dissect streams by track and artist names, and analyze trends based on release dates, top songs, their danceability, valence, energy levels, and instrumental content🎵📊🔍
This dataset contains comprehensive information on some of the most streamed songs on Spotify, enriched with additional insights from other popular streaming platforms like Apple Music, Deezer, and Shazam. It is ideal for music analysts, data scientists, and machine learning enthusiasts who are interested in exploring trends and characteristics of popular music tracks.
Here is the meaning of each column in the dataset:
- track_name: Name of the song.
- artist(s)_name: Name of the artist(s) performing the song.
- artist_count: Number of artists contributing to the song.
- released_year, released_month, released_day: Release date details.
- in_spotify_playlists: Number of Spotify playlists the song is featured in.
- in_spotify_charts: Rank of the song on Spotify charts.
- streams: Total number of streams on Spotify.
- in_apple_playlists, in_apple_charts: Presence in Apple Music playlists and charts.
- in_deezer_playlists, in_deezer_charts: Presence in Deezer playlists and charts. 10.in_shazam_charts: Rank on Shazam charts.
- bpm: Beats per minute, representing the tempo of the song.
- key: Key of the song.
- mode: Indicates whether the song is in a major or minor mode.
- danceability_%: Suitability of the song for dancing.
- valence_%: Positivity of the song’s musical content.
- valence_%: Positivity of the song’s musical content.
- acousticness_%: Acoustic sound presence in the song.
- instrumentalness_%: Proportion of instrumental content in the track.
- liveness_%: Presence of live performance elements.
- speechiness_%: Amount of spoken words in the song.
- cover_url: Here we can see the url of the song.
Visual Studio Code (VS Code).
- Importing Dataset.
- Exploratory Data Analysis(EDA)
- Trating Null Values
- Seperating Numerical and Categorical Columns in Dataset.
- Description of Dataset.
- Identifying The Outliers.
- Removing Outliers.
- Checking Skewness
- Training Testing Dataset.
- Regression Algorithms