Bike store sales

This is an article that records I use Excel for data cleaning and dashboard creation process.


Dataset: Kaggle: Bike Buyers 1000

1. Overview and Cleaning data

In this case, we got 13 columns:

ID Marital Status Gender Income Children Education Occupation Home Owner Cars Commute Distance Region Age Purchased Bike
29447 S F $10,000.00 2 Bachelors Clerical No 1 2-5 Miles Europe 68 No
29424 M M $10,000.00 0 Partial High School Manual Yes 2 0-1 Miles Europe 32 No
29380 M F $20,000.00 3 High School Manual Yes 0 0-1 Miles Europe 41 Yes

The last column “Purchased Bike” is extremely important, this is going to tell us whether they did or did not buy a bike. I’m going to use it a lot in the next step.

Before cleaning data step, create 3 sheets:

  • Working sheet: a copy of original data. Modify data in this sheet in case I mess something up or any issue happens.
  • Pivot table:
  • Dashboard:

First of all, I would like to remove duplicates data, just want to see if there’s any useless duplicated data that I do not need.

What I did for the data cleaning process:

-

2. Making Pivot Tables

3. Creating a Dashboard

4. Adding Filter Buttons


Project document: Bike sales analysis


Bike store sales
http://jknuts.github.io/2022/09/26/Bike-store-sales/
Author
Haohui Jin
Posted on
September 26, 2022
Licensed under