noklam / blog_archive_fastpage

Nok's data science blog
https://noklam.github.io/blog
Apache License 2.0
2 stars 1 forks source link
blog data data-science machine-learning python sceince

description: My learning notes. Just in time (JIT) is better than Just in Case

README

Introduction

https://noklam.ml

All things data

I am a data scientist. Recently, I find myself studying database, data structure, data pipeline way more than machine learning. To build a good model, I found the importance of writing good code to produce data with quality often triumphs a SOTA model.

Delivering the model is the job of a data scientist. Inevitably, every data scientist should somewhat be a "full-stack" data scientist.

This is a central repository for my blogs and notes

Resource

I am generally interested in tools that increase productivity, please let me know if you have any recommendations. Here is a list of software/topics that I found useful.

Uncertainty Estimation

Uncertainty Quantification in Deep Learning

Visualization

Visualization (University of Washington)

Custom Matplotlib style for Presentation (Larger font size)

https://raw.githubusercontent.com/noklam/mediumnok/master/_demo/python-viz/presentation.mplstyle

my_style = 'https://raw.githubusercontent.com/noklam/mediumnok/master/_demo/python-viz/presentation.mplstyle'

with plt.style.context(['ggplot', my_style]):
    make_scatter_plot()
    make_line_plot()

Useful Python Tools

Syntax Highlight

A catalog of various machine learning topics.

Graph Neural Network Basics

Understand What is the weird D-1/2LD-1/2

  1. spectral graph theory - Why Laplacian Matrix need normalization and how come the sqrt of Degree Matrix? - Mathematics Stack Exchange
  2. spectral graph theory - Why Laplacian Matrix need normalization and how come the sqrt of Degree Matrix? - Mathematics Stack Exchange
  3. What's the intuition behind a Laplacian matrix? I'm not so much interested in mathematical details or technical applications. I'm trying to grasp what a laplacian matrix actually represents, and what aspects of a graph it makes accessible. - Quora

Supplement Chinese Reading

  1. Heat Diffusion
  2. GCN use edge to agg node information
  3. How to do batch training with GCN

Time Series Forecast

Motivation

While neural network has gain a lot of success in NLP and computer vision, there are relatively less changes for traditional time series forecasting. This repository aims to study the lastest practical technique for time series prediction, with either statistical method, machine learning, or deep neural network.

Forecasting Methods

Statistical Method

Machine Learning

Deep Neural Network

Gramian Angular Field : Transform time series into an image and use transfer learning with CNN

Prediction Interval

While forecasting accuracy is important, the prediction interval is also important and it is an area that the machine learning world has less focus on.

Python Time Series Forecasting Library

Prophet (Facebook): Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth. It has build-in modeling for the Holiday effect.

pyts : state-of-the-art algorithms for time-series transformation and classification

Contribution

Feel free to send a PR or discuss by starting an issue.😁

powered by fastpages

fastpages allow me to blog directly in Notebook, so I don't have to worry how to convert into markdown anymore. I simple code and write.