This is a self-curated project using real world data - the "Iowa Liquor Sale".
This project started from a big ambiguous question:
"How to optimize sale portfolio in Iowa for a vendor?"
I queried the dataset in Google #Bigquery, and picked Jimbeam as the vendor in this project. By the data that is currently available, I built this mock business recommendations on how to optimize their product portfolio in Iowa.
The material and the information contained in this presentation and in any documentation attached to it are for my personal case study purpose only. It does not constitute an offer, true business recommendation to any person and/ or organization.
For distribution channels of grocery, liquor and convenience stores in Iowa, the business recommendation I have for Jim Beam Brands are:
For Whiskey products: Maintain the current amount of investment in this category and generate as much profit as possible. And use the capital in other areas such as diversification in the production line.
For Tequila: Invest more in this category and try to get as much market share before the category market growth slows down.
For Brandy, Cocktails/ RTD, Cordials/ Liqueurs, Vodka, Rum and Gin: Unless there are some other strategic aims, I recommend that Jim Beam Brands start to gradually divest from these markets and re-allocate the capital to other categories like Tequila.
Especially Rum and Gin, as they have had the least growth rate (near 0%) since 2012.
This dataset contains the spirits purchase information of Iowa Class “E” liquor licensees by product and date of purchase from January 1, 2012 to September 30th, 2022.
There are 24.8 million rows with 24 columns, each row is an individual product purchase.
You can find it on Bigquery public dataset or access the same via the link: https://data.iowa.gov/Sales-Distribution/Iowa-Liquor-Sales/m3tr-qhgy
I will break this big ambiguous business problem into small pieces and use data to answer each small question. In the end put them together to answer the big question.
The framework I drew and going to use in the analysis:
To save time, I will answer the quantitative questions and then investigate further on qualitative questions in a top-down way.
Data exploration, extraction, processing and analyzing are done by SQL queries in Google Bigquery sandbox. As I used the free version of Bigquery, you might find the data cleaning process unlike normal practice as the DML is not available in the free version.
Visualization: Microsoft Power BI
This analysis is for the sale portfolio in Iowa area only.
As the dataset only contains spirits purchase information of Iowa Class “E” liquor licensees, this means the distribution channel only include: grocery, liquor and convenience store and their wholesale sales to on-premise class "A", "B", "C" and "D". The dataset does not include on-premise sales (licensees "A", "B", "C" and "D"), i.e. members and guests of non-profit clubs, hotels and motels, taverns, bars, restaurants, trains, airplanes and watercrafts etc.
The dataset contains the sales record of liquor in Iowa from 2012-01-03 to 2022-09-30. There were 24,842,520 records in the dataset, 24,847,481 (-4,961 rows) after cleaning.
There are 24 fields in this dataset, as per the dimensions that the data is stored, I categorized them into below 4 categories:
As the dimension "Store" is not in the scope to solve the business problem, to save time, I cleaned other fields for analysis.
[bottles_sold], [sale_dollars]
[category_name], [item_description]
For those that are not categorized as above or not listed, assign to “OTHERS”
[state_bottle_retail]
[vendor_name]
In fact, in the beginning, the big ambiguous question I asked in this project was "How to optimize the sale portfolio for a vendor?" There wasn't "the who" at the beginning. So, I took a look to see the ranking of vendors (by sales through 2012/01/01 and 2022/09/30)
The top 3 are Diageo, Sazerac Company and then JimBeam.
After going over the ranking, I want to go with the analysis to help Jim Beam Brands, the 3rd largest vendor by market share. The reason is: They have room to grow, and the NO.4 Pernod Ricard is so close behind. (or maybe simply because I like their whiskey)
As I picked Jim Beam Brands as client in this project, defining the market for most of their products is my first priority (an incorrectly defined market can lead to poor classification in liquor market)
I classified the liquor market (in price) into these buckets:
Based on the buckets, the price distribution of Jim Beam Brands Products shows as:
Most of Jim Beam Brands products (98.43%) are less than and equal to USD 100. Therefore, I will proceed the analysis with the market that the price per bottle is ≤ USD 100.
Plus, I want to see where Jim Beam Brands stands in each category, so I will be putting each category in a growth-share matrix. Therefore, I want to get “Market Growth” of each category and “Relative Market Share” next.
I used CAGR rather than YoY% to calculate market growth because it gives me an idea of general growth rate.
As the max date in dataset that currently available is 2022-09-30 (end of Q3), to calculate the CAGR over the past 10 years, I used 1st, October 2012 as starting date.
Firstly, in order to draw the middle line in the growth rate axis in the matrix, I used the overall CAGR of Iowa liquor market as that data point.
The overall CAGR of Iowa liquor market is: 5.55% (Note: this is for market that each bottle ≤ USD 100, from Q4 2012 to Q3 2022)
As the top 10 market share holder in each category control the big majority liquor market in each category, I will use the average relative market share of top 10 vendors as the middle data point to draw the line.
The average relative market share is: 0.208
Note: as the leading rival’s relative market share is 1, so every leading rival’s relative market share is excluded (only 9 relative market share in top 10 in each category is used to count the average)
After collecting all the data points necessary, the growth-share matrix can be drawn as following: (Note: As category: “OTHERS” is not listed in specific market. And “SPECIALTY” mostly contains special packages that are not regular product. I didn’t dive into these 2 categories for the time being.)
What the matrix tells:
Whiskey product is Jim Beam Brands’ cash cow that is the most competitive and generating the most revenue among all the categories.
Tequila product is Jim Beam Brands’ star as the growth rate of the Tequila market is far greater than the overall liquor market and Jim Beam Brands control a competitive amount of market share.
Other categories are relatively plain in terms of growth rate and Jim Beam Brands' relative market share.
In order to strengthen what the growth-share matrix tells, by finding out the growing momentum of each category will help to make decision.
To get the growing momentum of each category, I queried the CAGR change from long term to short term (from a 9 years interval to 1 year interval) to see the trend.
We will find that in a 3 years interval, other than "Tequila", the growth rate of other categories started to decline and some even turn negative in a 2 years interval.
The main market (in price) of Jim Beam Brands is below and equals to USD 100 per bottle with 98.43% in total.
The overall CAGR of Iowa liquor market from Q4 2012 to Q3 2022 (for market that each bottle ≤ USD 100) is 5.55%
Jim Beam Brands most competitive category by relative market share is “Whiskey” (64%), then “Tequila” (23%) follows.
Tequila is the most active market with CAGR of 11.67% over Q3 2012- Q3 2022 and meantime has the best momentum in growth rate compared to all other categories.
For next step, I'd like to look into three areas:
Profitability: I will look into the profitability of each category and narrow it down to “which” product to identify which exact product(s) to invest in or divest. Then it can be further identified how much the profit would be increased by investing more or divesting from specific categories.
Sales Trend: After the profitability check, I will dive into the sales trend of each product in each category to support the decision-making of which exact product(s) to invest more or divest.
Risks: Look into whether divesting the categories will negatively impact the brand name. And investigate whether it will give too much share to competitors.