This dataset has information about the customer and its location. Use it to identify unique customers in the orders dataset and to find the orders delivery location.
Here is the Data that we're going to analyze.
After we create this report, we can clearly observe each part and feature of all columns in dataset. Y can see that it not only evaluates the overall data but also divides each column and analyzes parameters such as distinct rows, missing values, the memory size, or many types of data, mean value, median value....
And if y want to know any informations for the column you choose, the report has a more details
button for you under each the description of column
From the color bar, which column is the more blue it has, the higher its value and otherwise, the column with more red color has lower value. We use heatmap because colors are usually used to represent the magnitude of the values in each cell. This makes it easier to visually compare values across different columns or rows, as the eye can quickly pick up on differences in color intensity.
We use the data of the first column in cohort_data
to see the big change by each period of the year. The beginning of Dec 2021 with the highest value of 948, but from Jan 2022 to the end, we see the uneven distribution that indicates the customer retention rate is very low and the policy don't apply well. Clients are decreasing and very few are returning.
In general, the range starts from Jun 2022 to the end, that doesn't have too many changes but from June 2022 onwards, the total of return clients is quite even. This is the perfect time to encourage customers to come back and make a purchase. Look at this chart, we can see how many percentages of people return at each period, and after 12 months of the year, how much is the change? The later, the fewer customers come back to buy, and the fewer new clients we attract. From there, we need to reconsider our product or evaluate whether the department of customer service is actually effective...
The chart shows the increase and decrease in the average order value per month, but we don't know whether these changes are due to spending by new or existing customers. We also don't know if the rise in AOV is due to higher revenue or an increase in the number of clients. Instead of seeing the AOV monthly chart, we try to check the Revenue & Customer by month chart below
From Sep, Oct and Nov 2022 in The Revenue & Customer per month chart
we can see revenue and no.customer in these columns that are very high and rise strongest but in The AOV chart
, we see the average order value is very low and decrease more than other column. So we don't know the real how the our client's spending impacts to the AOV
Insight here
new customer
has tends to reduce and returning customer
is quite a fewIn CohortIndexMonth
= 1, that means the number of new customers. In CohortIndexMonth
from 2 to 13, that means the number of returning clients. After 1 month - Dec 202, we just have 1131 existing clients left and the no. returning users is reducing as time goes on. After 12 month - 1 year, we only retains 260 customers
We can see that in the period from Dec 2021 to Dec 2022, we have 4371 new users
but by the end of the research phase, we just retain 260 `users', corresponding to 5.95%. This is a relatively low percentage for e-commerce sites. If the product has long depreciation period, we will need to choose a longer research period.
After 12 months, the overall customer churn up to 90% in the research period. We need to get goals to retain customers and keep the churn ratio as low as possible. The lower retention rate and the more churn rate that indicate your products are low quality, website isn't attractive and convenient for purchasers, or the customer service is substandard