- Live Dashboard
- Video Presentation
- Linkedin Post Link
- Details about complete challenge
- Dashboard PDF version
This project focuses on analyzing the operations and performance of GoodCabs, a cab service company, through data-driven insights. The goal is to understand key business metrics, identify growth opportunities, and optimize services based on data analysis.
GoodCabs is facing challenges in key areas like revenue growth, repeat passenger rate, and operational efficiency. The business requires actionable insights to improve performance, enhance customer experience, and adapt to emerging trends.
- Identify patterns in revenue, trips, passengers, and ratings.
- Analyze city-specific performance to find growth opportunities.
- Establish correlations between customer behavior, trip distance, and fare.
- Enhance the decision-making process with interactive dashboards.
-
Business Problem Understanding:
- Gathered insights from stakeholders to understand key objectives and requirements.
-
Exploratory Data Analysis (EDA):
- Conducted a thorough EDA using Power Query and SQL to explore the data, identify patterns, and summarize key findings.
-
Ad-Hoc Analysis:
- Performed ad-hoc data analysis using SQL queries to answer specific business questions.
- Visualized results using Excel for easy interpretation.
-
Data Cleaning & Transformation:
- Cleaned and transformed raw data to ensure data integrity and consistency.
- Applied Star and Snowflake schemas to establish relationships between different datasets.
-
Dashboard Creation:
- Created an interactive Power BI dashboard to visualize key performance indicators (KPIs) such as revenue, trip types, passenger ratings, etc.
-
Results Presentation:
- Compiled key findings and insights into a PowerPoint presentation for stakeholders.
This project utilizes data from two primary databases: trips_db
and targets_db
. These databases contain detailed and aggregated data on Goodcabs' operations, including trip information, passenger behavior, city-specific performance, and monthly targets for growth.
This database stores detailed and aggregated data on trips, passenger types, and repeat trip behavior across Goodcabs’ operations in tier-2 cities. The data is organized by city, month, and day type (weekday or weekend) for comprehensive analysis of travel patterns, demographics, and repeat usage trends.
-
dim_city
- Purpose: Provides city-specific details for location-based analysis of trips and passenger behavior.
- Key Columns:
city_id
: Unique city identifier (e.g., RJ01 for Jaipur).city_name
: City name (e.g., Jaipur, Lucknow).
-
dim_date
- Purpose: Provides date-specific details for time-based grouping and analysis of trip patterns across days, months, and weekends vs weekdays.
- Key Columns:
date
: The specific date of the entry (YYYY-MM-DD).start_of_month
: The first day of the respective month.month_name
: Name of the month.day_type
: Weekday or weekend indicator.
-
fact_passenger_summary (Aggregated Data)
- Purpose: Provides aggregated passenger counts (new and repeat) for each city by month.
- Key Columns:
month
: Start date of the month.city_id
: City identifier.total_passengers
: Total count of all passengers.new_passengers
: Count of new passengers.repeat_passengers
: Count of repeat passengers.
-
dim_repeat_trip_distribution (Aggregated Data)
- Purpose: Provides a breakdown of repeat trip behavior, categorized by trip frequency (up to 10 trips per month) for each city.
- Key Columns:
month
: Start date of the month.city_id
: City identifier.trip_count
: Number of trips taken by repeat passengers.repeat_passenger_count
: Count of repeat passengers for each trip frequency.
-
fact_trips
- Purpose: Provides detailed trip-level data, including distance, fare, and ratings for each trip.
- Key Columns:
trip_id
: Unique trip identifier.date
: Date of the trip.city_id
: City identifier.passenger_type
: New or repeat passenger.distance_travelled (km)
: Distance of the trip in kilometers.fare_amount
: Fare amount paid.passenger_rating
: Passenger rating (1-10).driver_rating
: Driver rating (1-10).
This database contains monthly targets for each city, including trip counts, new passenger acquisition, and average passenger ratings. It helps in evaluating Goodcabs' performance against the company's established goals.
-
city_target_passenger_rating
- Purpose: Stores target average passenger ratings for each city.
- Key Columns:
city_id
: Unique city identifier.target_avg_passenger_rating
: Target average passenger rating.
-
monthly_target_new_passengers
- Purpose: Stores the target number of new passengers to acquire for each city in a given month.
- Key Columns:
month
: Start date of the target month (YYYY-MM-DD).city_id
: City identifier.target_new_passengers
: Target new passenger count.
-
monthly_target_trips
- Purpose: Stores the target number of total trips to achieve for each city and month.
- Key Columns:
month
: Start date of the target month (YYYY-MM-DD).city_id
: City identifier.total_target_trips
: Target total trips count.
-
Optimize Fare Strategy:
- Introduce dynamic pricing models to attract passengers during off-peak hours.
- Maintain competitive pricing to retain repeat customers.
-
Boost Passenger Satisfaction:
- Address key pain points: driver behavior, and repeat passenger rate.
- Introduce driver training programs and incentivize quality service.
-
Targeted Marketing:
- Promote services during tourism seasons and local events to boost demand.
- Run campaigns to attract new passengers while offering loyalty programs to retain repeat users.
-
Partnership Opportunities:
- Collaborate with local businesses (hotels, malls, events) to drive demand in high-footfall areas.
-
Data Collection & Analysis:
- Implement feedback mechanisms to gather more insights on passenger preferences.
- Analyze socio-economic patterns to tailor operations city-wise.
The project provided actionable insights to optimize Goodcabs' operations, improve customer satisfaction, and increase repeat passenger rates. The recommendations aim to drive sustainable growth and enhance operational efficiency across cities through data-driven decisions.
- SQL: For data extraction and ad-hoc analysis.
- Power Query: For data cleaning and transformation.
- Excel: For visualizing analysis results.
- Power BI: For interactive dashboard creation.
- Canva: For presenting insights to stakeholders.
- Flaticon: For icons
- Adobe Color & Coolors: For generating color palette
- Adobe Stock & shutterctock: For images
- Data Analysis
- Visualization
- Critical Thinking
- Business Strategy
- Communication