The data is retail sales data at a fictional superstore located in the Sample - Superstore csv. Here's the key:
Row ID => Unique ID for each row.
Order ID => Unique Order ID for each Customer.
Order Date => Order Date of the product.
Ship Date => Shipping Date of the Product.
Ship Mode=> Shipping Mode specified by the Customer.
Customer ID => Unique ID to identify each Customer.
Customer Name => Name of the Customer.
Segment => The segment where the Customer belongs.
Country => Country of residence of the Customer.
City => City of residence of of the Customer.
State => State of residence of the Customer.
Postal Code => Postal Code of every Customer.
Region => Region where the Customer belong.
Product ID => Unique ID of the Product.
Category => Category of the product ordered.
Sub-Category => Sub-Category of the product ordered.
Product Name => Name of the Product
Sales => Sales of the Product.
Quantity => Quantity of the Product.
Discount => Discount provided.
Profit => Profit/Loss incurred.
Describe the dataset given with three or more data visualizations. These can be maps, histograms, line graphs, combinations of those, or anything else. It can be a time-series, or an interactive plot.
Think, if you could only show someone these graphs to describe most of the data, what graphs would you choose.
Design matters, making this beautiful matters.
The sky is the limit!
Part 2 is more free-form, and allows you to showcase YOUR specific skillset.
Here's a chance to showcase your data science skills!
Model this data! Predict some outcome, make some claims, show your work, analyse it statistically, and tell us your thinking all the way through. You can use machine learning, deep learning, statistical analysis, linear models, or anything else that you want to use. Whatever interests you, go for it!
Be creative, think big, and report your findings in a clean, clear way!
Email [email protected] with questions!