Bike store sales
This is an article that records I use Excel for data cleaning and dashboard creation process.
Dataset: Kaggle: Bike Buyers 1000
1. Overview and Cleaning data
In this case, we got 13 columns:
| ID | Marital Status | Gender | Income | Children | Education | Occupation | Home Owner | Cars | Commute Distance | Region | Age | Purchased Bike |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 29447 | S | F | $10,000.00 | 2 | Bachelors | Clerical | No | 1 | 2-5 Miles | Europe | 68 | No |
| 29424 | M | M | $10,000.00 | 0 | Partial High School | Manual | Yes | 2 | 0-1 Miles | Europe | 32 | No |
| 29380 | M | F | $20,000.00 | 3 | High School | Manual | Yes | 0 | 0-1 Miles | Europe | 41 | Yes |
| … | … | … | … | … | … | … | … | … | … | … | … | … |
The last column “Purchased Bike” is extremely important, this is going to tell us whether they did or did not buy a bike. I’m going to use it a lot in the next step.
Before cleaning data step, create 3 sheets:
- Working sheet: a copy of original data. Modify data in this sheet in case I mess something up or any issue happens.
- Pivot table:
- Dashboard:
First of all, I would like to remove duplicates data, just want to see if there’s any useless duplicated data that I do not need.
What I did for the data cleaning process:
-
2. Making Pivot Tables
3. Creating a Dashboard
4. Adding Filter Buttons
Project document: Bike sales analysis
Bike store sales
http://jknuts.github.io/2022/09/26/Bike-store-sales/