Live training sessions are designed to mimic the flow of how a real data scientist would address a problem or a task. As such, a session needs to have some “narrative” where learners are achieving stated learning objectives in the form of a real-life data science task or project. For example, a data visualization live session could be around analyzing a dataset and creating a report with a specific business objective in mind (ex: analyzing and visualizing churn), a data cleaning live session could be about preparing a dataset for analysis etc ...
As part of the 'Live training Spec' process, you will need to complete the following tasks:
Edit this README by filling in the information for steps 1 - 4.
This part of the 'Live training Spec' process is designed to help guide you through session design by having you think through several key questions. Please make sure to delete the examples provided here for you.
mlxtend
.seaborn
and matplotlib
.numpy
pandas
matplotlib
seaborn
mlxtend
Terms like "beginner" and "expert" mean different things to different people, so we use personas to help instructors clarify a live training's audience. When designing a specific live training, instructors should explain how it will or won't help these people, and what extra skills or prerequisite knowledge they are assuming their students have above and beyond what's included in the persona.
Check all that apply.
Reasoning: Market Basket Analysis has limited overlap with popular methods in machine learning and data science (e.g. deep learning, gradient boosting, clustering, etc.). As such, learning the basics of Market Basket Analysis will open up an entirely new toolset for many data analysts, data scientists, statisticians, and machine learning scientists.
List three or more examples of skills that you expect learners to have before beginning the live training
array
in numpy
.DataFrame
in pandas
, create columns, and apply basic methods, such as .mean()
and .sum()
..apply()
and lambda
functions to transform columns in a DataFrame
.matplotlib
.List any prerequisite courses you think your live training could use from. This could be the live session’s companion course or a course you think students should take before the session. Prerequisites act as a guiding principle for your session and will set the topic framework, but you do not have to limit yourself in the live session to the syntax used in the prerequisite courses.
Market Basket Analysis in Python
A live training session usually begins with an introductory presentation, followed by the live training itself, and an ending presentation. Your live session is expected to be around 2h30m-3h long (including Q&A) with a hard-limit at 3h30m. You can check out our live training content guidelines here.
Introduction Slides
Data Preparation
pd.read_csv()
.pandas
and numpy
methods.Association Rules, Metrics, and Pruning
TransactionEncoder
from mlxtend
..mean()
to compute support for
individual items..mean()
to compute support for
itemsets.lambda
function.matplotlib
and seaborn
.The Apriori Algorithm
apriori
and association_rules
from mlxtend
.min_support
, max_len
, and min_threshold
to
perform pruning over itemsets and association rules.matplotlib
and seaborn
.Ending Slides
To get yourself started with setting up your live session, follow the steps below:
data
folder.assets
folder.notebooks
folder, and keep the template you want for your session while deleting all remaining ones.You can author and save your progress on your notebook using either of these methods.
How to author your notebook: By directly saving into GitHub
File
, Save a copy in GitHub
and follow remaining prompts.session_name_solution.ipynb
create an empty version of the Notebook to be filled out by you and learners during the session, end the file name with session_name.ipynb
. How to author your notebook: By uploading notebook into GitHub
file
- download .ipynb
file - and overwrite the notebook by uploading it into GitHub. session_name_solution.ipynb
create an empty version of the Notebook to be filled out by you and learners during the session, end the file name with session_name.ipynb
. You can check out either of those methods in action using this recording.