Skip to content

gokcengiz/Shopping-data-analysis

Repository files navigation

Customer Shopping Data Analysis

alt text

About Dataset

The project is based on the Customer Shopping Dataset - Retail Sales Data provided by Kaggle. We will do data visualization studies with PowerBI using this data set.

Content

The dataset includes shopping information from 10 different shopping centers between the years 2021-2023. Data is available from various age groups and genders to provide a comprehensive view of shopping habits in Istanbul. The dataset includes basic information such as invoice numbers, customer IDs, age, gender, payment methods, product categories, quantity, price, order dates, and mall locations.

The dataset contains 10 columns in customer_shopping_data.csv

  • invoice_no: Invoice number. Nominal. A combination of the letter 'I' and a 6-digit integer uniquely assigned to each operation.
  • customer_id: Customer number. Nominal. A combination of the letter 'C' and a 6-digit integer uniquely assigned to each operation.
  • gender: String variable of the customer's gender.
  • age: Positive Integer variable of the customers age.
  • category: String variable of the category of the purchased product.
  • quantity: The quantities of each product (item) per transaction. Numeric.
  • price: Unit price. Numeric. Product price per unit in Turkish Liras (TL).
  • payment_method: String variable of the payment method (cash, credit card or debit card) used for the transaction.
  • invoice_date: Invoice date. The day when a transaction was generated.
  • shopping_mall: String variable of the name of the shopping mall where the transaction was made.