Identify a question
Develop a hypothesis
Tell the story with your data
Brainstorm project ideas
Selected a scenario (question) that you will answer for project 00 (problem set 4), applying the D3M process, principles of data storytelling, and guidelines for creating effective data visualizations.
We will spend Friday’s lab working on building your data story to answer the question you choose.
A question might be driven by a business decision or goal. Examples:
We need to cut costs. Can we reallocate labor more efficiently?
We want to increase profits. Can we increase marketing efforts on profitable product lines?
Ask yourself:
supermarket_sales
dataset?supermarket_sales
dataset?What do you think the answer is?
Why? \(\rightarrow\) Describe the mechanism.
The why part of your answer will help guide your analysis of the data.
Think through how you can analyze the data to answer your question.
The evidence should support your conclusions.
We have a limited budget for a promotional campaign.
Questions:
Which product(s) have a high customer rating by customer type?
What day of the week do members buy those products?
What kind of visuals will help you answer these questions?
Which product(s) have a high customer rating by customer type?
What day of the week do members buy those products?
What else might you investigate during your exploratory data analysis (EDA)?
Does this vary by city/branch?
Does this vary by gender?
Does this vary by payment type?
Keep a notebook as you explore your dataset, especially in Tableau.
Document your thought process–this will help you tell your data story.
Data analysis can be fun–you may be tempted to keep checking new things.
Stay focused on your question and quit when you have answered it.
Develop a narrative.
Create effective visuals.
Start with the question–Sometimes you won’t have a clear question, but you are communicating an insight that requires audience engagement.
Characters and setting–Who and what is involved?
Preliminaries–Provide supporting evidence before revealing the main finding.
Answer (or insight)–Clearly show your main result.
Next steps–Suggest recommendations or further investigation.
Limitations–Understand and communicate the limitations of your analysis.
Which product(s) have a high customer rating by customer type?
What day of the week do members buy those products?
Who are the characters?
What is the setting?
Sales in Q1 2019 at three branch locations
Transaction-level data, where customers rate the product and buying experience after a purchase
You will often need to set up your main point with supporting evidence.
Then, drill down into the details.
Your key insights is the most important part of your story.
Make sure the visual is effective.
Does your visual clearly illustrate the point?
Members rate health and beauty products highest among categories
Members spend more on health and beauty products on the weekend
Make recommendations (if appropriate)
Identify additional analyses if needed.
Example analysis:
Understanding the limitations of your analysis, and communicating them, helps ensure transparency and prevents overgeneralization of your findings.
Are there any data limitations (e.g., missing variables, small sample size, outdated information)?
Could there be biases in the data collection process or analysis?
What external factors might influence the results that were not accounted for?
Here are some example project questions:
Understanding Customer Behavior
Sales & Revenue Analysis
Customer Segmentation
Spending Methods
Customer Satisfaction Analysis
Dykes, B. (2019). Effective Data Storytelling: How to Drive Change with Data, Narrative and Visuals (1st edition). Wiley.