anton-dergunov / ml-playground

Personal repository for Machine Learning projects
1 stars 0 forks source link

ml-playground

Personal repository of small notebooks for Machine Learning projects.

These notebooks are intended to be my playground with various tools and techniques, so they can have mistakes or incomplete code. I would be grateful for any comments which you can submit using GitHub Issues.

You may need to provide configuration for some notebooks. For that, copy .env.template file to .env and adjust some parameters.

Blog

blog/univariate-feature-selection-draft.ipynb

Draft of a blog post on univariate feature selection.

Courses

courses/fastai/fastai_01_hw_birds.ipynb

Exercise for FastAI DL course: classification of bird images using CNN and transfer learning (executed on Azure).

courses/fastai/fastai_01_hw_simpsons.ipynb

Exercise for FastAI DL course: classification of Simpson character images using CNN and transfer learning (executed on Azure).

courses/fastai/fastai_01_hw_simpsons_colab.ipynb

Exercise for FastAI DL course: classification of Simpson character images using CNN and transfer learning (executed on Colab).

courses/fastai/fastai_03_hw_quora.ipynb

Exercise for FastAI DL course: Quora insincere question classification using language model + LSTM (executed on Azure, didn't work well).

courses/handson_ml/handson_ml_01.ipynb

Exercise from book "Hands-On Machine Learning": basic data analysis of life satisfaction dataset.

courses/handson_ml/handson_ml_02.ipynb

Exercise from book "Hands-On Machine Learning": basic data analysis of housing dataset, plotting data on a map.

courses/handson_ml/handson_ml_09.ipynb

Exercise from book "Hands-On Machine Learning": basic experiments with TensorFlow API, TensorFlow graph.

courses/handson_ml/handson_ml_10.ipynb

Exercise from book "Hands-On Machine Learning": basic MNIST with DNN via TensorFlow.

LangChain

langchain/chatbot.ipynb

Basic chatbot using LangChain.

langchain/simple_llm_application.ipynb

Making a simple LLM call using LangChain.

PySpark

pyspark/pyspark-snippets.ipynb

Snippets showing basic usage of PySpark.

PyTorch

pytorch/basics.ipynb

Basics of PyTorch: tensors, etc.

pytorch/cifar10.ipynb

CIFAR10 via Simple CNN in PyTorch.

pytorch/mnist.ipynb

MNIST with PyTorch using simple CNN: from scratch, then using higher-level APIs.

pytorch/rnn.ipynb

Learning several sentences using Elman RNN.

pytorch/lstm.ipynb

Sentiment analysis (IMDB) using LSTM.

 Snippets

snippets/remove-presentation-toolbar.ipynb

Little snippet to remove toolbar from FastAI notebooks on Azure.

Tasks

tasks/quora/01-LogisticRegression.ipynb

Quora insincere question classification: logistic regression with hashing trick.

tasks/quora/02-SplitData.ipynb

Quora insincere question classification: splitting train/test/valid data using stratified sampling.

tasks/quora/03-LogisticRegression-Analysis.ipynb

Quora insincere question classification: analysis of important words for logistic regression classification.

tasks/quora/05-RNN.ipynb

Quora insincere question classification: simplest RNN (unfinished).

tasks/jigsaw-edl.ipynb

Jigsaw toxic comments dataset: simple exploratory data analysis.

tasks/spam_classifier.ipynb

Spam classification using Spacy, various classifiers.

tasks/titanic.ipynb

Classic titanic dataset: data analysis, feature engineering, training models (logistic regression, SVM, decision tree, random forest, gradient boosting, including XGBoost, LightGBM, CatBoost).

Techniques

techniques/maximal_marginal_relevance.ipynb

Maximal marginal relevance.

Visualization

visualization/matplotlib.ipynb

Examples of helpful matplotlib graphs (to be extended).