Freeglisse.com positions itself as the leading second-hand ski resale website in France. It offers an extensive catalog of equipment, categorized by type of use, quality, brand, etc. Our objective is to analyze the products available for resale on the site to assess its positioning and product offerings. This initiative is being conducted in preparation for the acquisition of the site by a third-party company (Olist).
To achieve this, we will first collect product information through web scraping with Python (using BeautifulSoup and Selectolax) and then analyze the gathered data using a Power BI dashboard.
Note : This project is an exercise as part of a data analysis training program, designed to teach web scraping techniques with Python and to master dashboard design using Power BI.
Since the products on freeglisse.com are categorized into three quality levels (A, B, and C), we perform web scrapping from the base URLs stored in the config.py file :
BASE_URLS = {
"A": "https://freeglisse.com/fr/12-ski-occasion/s-1/etat_du_materiel-qualite_a?page=",
"B": "https://freeglisse.com/fr/12-ski-occasion/s-1/etat_du_materiel-qualite_b?page=",
"C": "https://freeglisse.com/fr/12-ski-occasion/s-1/etat_du_materiel-qualite_c?page=",
}
Clone this repository, create a virtual environment, and install dependencies :
git clone https://github.com/cyrilgrv/freeglisse-olist/
cd freeglisse-olist
python3 -m venv venv
source venv/bin/activate #Unix
venv\Scripts\activate #Windows
pip install -r requirements.txt
python main.py
Note : The data displayed in this demo dashboard was fetched on October 12, 2024.