Open philusnarh opened 5 years ago
House-prices https://github.com/itsmuriuki/Predicting-House-prices https://www.dataquest.io/blog/machine-learning-tutorial/ https://www.kaggle.com/gopalchettri/usa-housing-machine-learning-linear-regression https://github.com/topics/housing-prices https://www.kaggle.com/lianglirong/tensorflow-predict-house-prices https://github.com/wqxu/kaggle/tree/master/House_Prices-Advanced_Regression_Techniques
Ghana data sites https://meqasa.com/house-for-sale-at-East-Legon-080024?y=1206773335
7 Types of Regression Techniques you should know! https://www.analyticsvidhya.com/blog/2015/08/comprehensive-guide-regression/
A Complete Tutorial to Learn Data Science with Python from Scratch https://www.analyticsvidhya.com/blog/2016/01/complete-tutorial-learn-data-science-python-scratch-2/ https://www.analyticsvidhya.com/blog/2015/10/regression-python-beginners/ https://www.analyticsvidhya.com/blog/2016/01/complete-tutorial-ridge-lasso-regression-python/ http://flennerhag.com/2017-04-18-introduction-to-ensembles/
X-ray Predictions https://www.kaggle.com/paultimothymooney/predicting-pathologies-in-x-ray-images https://github.com/brucechou1983/CheXNet-Keras https://www.kaggle.com/giuliasavorgnan/start-here-beginner-intro-to-lung-opacity-s1/notebook https://mc.ai/detecting-pneumonia-with-deep-learning/ https://medium.com/datadriveninvestor/detecting-pneumonia-with-deep-learning-a-soft-introduction-to-convolutional-neural-networks-b3c6b6c23a88 https://mc.ai/detecting-pneumonia-with-deep-learning/ https://pythonprogramming.net/convolutional-neural-network-kats-vs-dogs-machine-learning-tutorial/ https://www.kaggle.com/mallela432/cats-vs-dogs-cnn-implementation-with-keras https://github.com/halfbloodprince16/Meow-v-s-BhowBhow/tree/master/train http://mc.ai/image-classification-foundation-in-keras-with-python/
** Convolution Animation https://medium.com/x8-the-ai-community/cnn-9c5e63703c3f
image segementation https://github.com/Borda/pyImSegm
detecting-pneumonia-in-x-ray-images https://www.kaggle.com/paultimothymooney/detecting-pneumonia-in-x-ray-images
pip install tensorflow-gpu https://github.com/liuzhuang13/DenseNet https://www.tensorflow.org/install/source#common_installation_problems https://github.com/keras-team/keras/releases?after=1.0.7 step by step building of deep-learning https://machinelearningmastery.com/tutorial-first-neural-network-python-keras/
tensor flow installation http://deeplearning.lipingyang.org/2017/01/19/install-tensorflow-for-python-2-7-and-3-5-on-one-machine/ https://stackoverflow.com/questions/48580703/downsizing-a-densenet121-under-keras
densenet https://towardsdatascience.com/densenet-2810936aeebb https://github.com/thtang/CheXNet-with-localization https://github.com/topics/chexnet https://github.com/jrzech/reproduce-chexnet
Complete densenet package https://github.com/liuzhuang13/DenseNet https://innolitics.com/articles/pretrained-models-with-keras/
Softmax vs Sigmoid function in Logistic classifier? https://stats.stackexchange.com/questions/233658/softmax-vs-sigmoid-function-in-logistic-classifier
Natural Language processing https://www.kaggle.com/ashishpatel26/practice-tutorial-for-toxic-classification https://github.com/tensorflow/workshops/blob/master/extras/keras-bag-of-words/keras-bow-model.ipynb https://ai.stanford.edu/~amaas/data/sentiment/ https://www.kaggle.com/thechosanone/donald-trump-tweets https://github.com/mukesh-mehta/VDCNN https://github.com/mukesh-mehta/VDCNN/blob/master/toxic.ipynb http://dsbyprateekg.blogspot.com/2017/12/can-you-build-model-to-predict-toxic.html https://www.kaggle.com/fizzbuzz/bi-lstm-conv-layer-lb-score-0-9840/code https://machinelearningmastery.com/develop-n-gram-multichannel-convolutional-neural-network-sentiment-analysis/ https://www.kaggle.com/tannergi/text-generation-using-an-lstm-in-keras
Simulating Text With Markov Chains in Python
https://towardsdatascience.com/simulating-text-with-markov-chains-in-python-1a27e6d13fc6
Detecting Insults in Social Commentary https://www.kaggle.com/c/detecting-insults-in-social-commentary/data
Basic Of Naive Bayes https://www.analyticsvidhya.com/blog/2017/09/naive-bayes-explained/
Neural Network https://hackernoon.com/neural-networks-introduction-6048f69b68b0 https://towardsdatascience.com/build-your-own-convolution-neural-network-in-5-mins-4217c2cf964f
Road sign classification
https://chsasank.github.io/keras-tutorial.html
face recognition http://scikit-learn.org/stable/auto_examples/applications/plot_face_recognition.html#sphx-glr-auto-examples-applications-plot-face-recognition-py
Solving A Simple Classification Problem with Python — Fruits Lovers’ Edition https://towardsdatascience.com/solving-a-simple-classification-problem-with-python-fruits-lovers-edition-d20ab6b071d2
Plotting Feature Importances https://www.kaggle.com/grfiv4/plotting-feature-importances
Your First Machine Learning Model https://python-course.eu/expectation_maximization_and_gaussian_mixture_models.php https://www.kaggle.com/dansbecker/your-first-machine-learning-model https://www.kaggle.com/kanncaa1/machine-learning-tutorial-for-beginners
Training, validation & test https://en.wikipedia.org/wiki/Training,_test,_and_validation_sets
cats and dog using cnn https://pythonprogramming.net/convolutional-neural-network-kats-vs-dogs-machine-learning-tutorial/ https://becominghuman.ai/building-an-image-classifier-using-deep-learning-in-python-totally-from-a-beginners-perspective-be8dbaf22dd8 Understanding CNN http://cs231n.github.io/convolutional-networks/ https://github.com/venkateshtata https://medium.com/@rohanthomas.me/convolutional-networks-for-everyone-1d0699de1a9d https://hackernoon.com/visualizing-parts-of-convolutional-neural-networks-using-keras-and-cats-5cc01b214e59 http://www.picnet.com.au/blogs/guido/2016/05/16/review-of-keras-deep-learning-core-layers/
flight delays https://perso.telecom-paristech.fr/qleroy/aml/lab3.html https://www.datasciencecentral.com/profiles/blogs/predicting-flights-delay-using-supervised-learning https://www.kaggle.com/fabiendaniel/predicting-flight-delays-tutorial https://github.com/Sudeepv5/flight-delay-prediction https://www.kaggle.com/levaniz/machine-learning-analysis-of-flights-data
Research potential https://www.kaggle.com/farazrahman/predicting-star-galaxy-quasar-with-svm/notebook?utm_medium=email&utm_source=mailchimp&utm_campaign=datanotes-20181004
Face Recognition on Olivetti Dataset https://www.kaggle.com/serkanpeldek/face-recognition-on-olivetti-dataset
Datascientist_selary_analysis https://www.kaggle.com/scarecrow88/datascientist-selary-analysis
Anomaly Detection - Credit Card Fraud Analysis https://www.kaggle.com/pavansanagapati/anomaly-detection-credit-card-fraud-analysis
YouTube Trending Videos Analysis https://www.kaggle.com/ammar111/youtube-trending-videos-analysis
Your First Machine Learning Model https://www.kaggle.com/dansbecker/your-first-machine-learning-model
HousePrice_Predic https://www.kaggle.com/klauslyu/houseprice-predict
Intro to Deep Learning and Computer Vision https://www.kaggle.com/dansbecker/intro-to-deep-learning-and-computer-vision
Film recommendation engine https://www.kaggle.com/fabiendaniel/film-recommendation-engine
Seaborn Tutorial for Beginners https://www.kaggle.com/kanncaa1/seaborn-tutorial-for-beginners https://www.kaggle.com/pankajsoni12/plotting-with-seaborn-part3
Deep Learning Tutorial for Beginners https://www.kaggle.com/kanncaa1/deep-learning-tutorial-for-beginners
Python Programming from A to Z https://www.kaggle.com/dark4user/python-programming-from-a-to-z numpy polyfit https://www.kaggle.com/lianglirong/numpy-polyfit https://www.kaggle.com/dromosys/numpy-notes
A Tensorflow Keras CNN approach https://www.kaggle.com/amneves/a-tensorflow-keras-cnn-approach/notebook https://www.kaggle.com/easter3163/basic-classification-using-tensorflow-tutorial
Honey Bee health detection with CNN https://www.kaggle.com/dmitrypukhov/honey-bee-health-detection-with-cnn?utm_medium=email&utm_source=mailchimp&utm_campaign=datanotes-20181004
Predicting flight delays [Tutorial] https://www.kaggle.com/fabiendaniel/predicting-flight-delays-tutorial https://www.kaggle.com/niranjan0272/us-flight-delay
Telecom Customer Churn Prediction https://www.kaggle.com/pavanraj159/telecom-customer-churn-prediction/data
kaggle kernels https://www.kaggle.com/kernels
Ensembles Techniques https://machinelearningmastery.com/ensemble-machine-learning-algorithms-python-scikit-learn/ https://www.analyticsvidhya.com/blog/2018/06/comprehensive-guide-for-ensemble-models/ https://www.packtpub.com/mapt/book/big_data_and_business_intelligence/9781783555130/7/ch07lvl1sec44/implementing-a-simple-majority-vote-classifier https://www.dataquest.io/blog/introduction-to-ensembles/
Financial Distress Prediction: Bankruptcy Prediction https://www.kaggle.com/saurabhharsh/financial-distress-prediction/notebook https://www.kaggle.com/eric2396/financial-distress-prediction-using-ann https://www.kaggle.com/caspitush/playing-with-pca https://www.kaggle.com/keigito/financial-distress-prediction-with-regression https://www.kaggle.com/adityasheth/financial-distress-prediction https://www.kaggle.com/azurtheowl/distress-prediction
weather https://github.com/prl900/DeepWeather https://jaxenter.com/convolutional-lstm-deeplearning4j-146157.html CNN Architectures: VGG, Resnet, Inception, Alex
CNN Architectures: VGG, Resnet, Inception, Alex https://www.kaggle.com/shivamb/cnn-architectures-vgg-resnet-inception-alex
radar https://github.com/wqxu/ConvLSTM
EM tutorials https://mk-minchul.github.io/EM/
Markov Chains in Python: Beginner Tutorial https://www.datacamp.com/community/tutorials/markov-chains-python-tutorial https://stats.stackexchange.com/questions/165/how-would-you-explain-markov-chain-monte-carlo-mcmc-to-a-layperson/51467
Implementing a basic CNN using tensorflow in python https://stackoverflow.com/questions/41611510/implementing-a-basic-cnn-using-tensorflow-in-python
https://keras.io/getting-started/sequential-model-guide/
Tutorial DL using breast cancer https://www.kaggle.com/thebrownviking20/intro-to-keras-with-breast-cancer-data-ann/notebook more on DL https://www.kaggle.com/itratrahman/convolutional-neural-net-tutorial-tensorflow/notebook
Bank_costumer_prediction https://github.com/venkateshtata/Bank_costumer_prediction https://github.com/venkateshtata/Bank_costumer_prediction
Crime Detection Using ML https://www.kaggle.com/fahd09/eda-of-crime-in-chicago-2005-2016 https://www.kaggle.com/ravitejayerra/chicago-crime-data-analysis https://www.kaggle.com/currie32/crimes-in-chicago/kernels https://www.kaggle.com/threadid/chicago-crimes-regression-neural-network/data
Build Your First Deep Learning Classifier using TensorFlow: Dog Breed Example https://towardsdatascience.com/build-your-first-deep-learning-classifier-using-tensorflow-dog-breed-example-964ed0689430
Data Exploration https://www.kaggle.com/artgor/exploration-of-data-step-by-step
Linear Regression from Scratch https://www.kaggle.com/kennethjohn/linear-regression-from-scratch
A Comprehensive ML Workflow with Python https://www.kaggle.com/mjbahmani/a-comprehensive-ml-workflow-with-python
Adding an xgboost model https://www.kaggle.com/apapiu/regularized-linear-models
Machine Learning Tutorial for Beginners https://www.kaggle.com/kanncaa1/machine-learning-tutorial-for-beginners
Deep Learning Tutorial for Beginners https://www.kaggle.com/kanncaa1/deep-learning-tutorial-for-beginners
Estimating Rainfall From Weather Radar Readings Using Recurrent Neural Networks https://simaaron.github.io/Estimating-rainfall-from-weather-radar-readings-using-recurrent-neural-networks/
Multivariate Time Series Forecasting with LSTMs in Keras https://machinelearningmastery.com/multivariate-time-series-forecasting-lstms-keras/
Student Performance Analysis https://www.kaggle.com/jshen97/student-performance-analysis-dt-updated/data
New Weather Forecasting -- Prediction of Rainfall https://www.kaggle.com/nasirmeh/prediction-of-rainfall https://www.kaggle.com/shwetabh123/rainfall-prediction
Variational GP https://gpflow.readthedocs.io/en/develop/notebooks/vgp_notes.html https://github.com/GPflow/GPflow
Introduction I’ve always had a passion for learning and consider myself a lifelong learner. Being at SAS, as a data scientist, allows me to learn and try out new algorithms and functionalities that we regularly release to our customers. Often times, the algorithms are not technically new, but they’re new to me which makes it a lot of fun.
Recently, I had the opportunity to learn more about t-Distributed Stochastic Neighbor Embedding (t-SNE). In this post I’m going to give a high-level overview of the t-SNE algorithm. I’ll also share some example python code where I’ll use t-SNE on both the Digits and MNIST dataset.
What is t-SNE? t-Distributed Stochastic Neighbor Embedding (t-SNE) is an unsupervised, non-linear technique primarily used for data exploration and visualizing high-dimensional data. In simpler terms, t-SNE gives you a feel or intuition of how the data is arranged in a high-dimensional space. It was developed by Laurens van der Maatens and Geoffrey Hinton in 2008.
t-SNE vs PCA If you’re familiar with Principal Components Analysis (PCA), then like me, you’re probably wondering the difference between PCA and t-SNE. The first thing to note is that PCA was developed in 1933 while t-SNE was developed in 2008. A lot has changed in the world of data science since 1933 mainly in the realm of compute and size of data. Second, PCA is a linear dimension reduction technique that seeks to maximize variance and preserves large pairwise distances. In other words, things that are different end up far apart. This can lead to poor visualization especially when dealing with non-linear manifold structures. Think of a manifold structure as any geometric shape like: cylinder, ball, curve, etc.
t-SNE differs from PCA by preserving only small pairwise distances or local similarities whereas PCA is concerned with preserving large pairwise distances to maximize variance. Laurens illustrates the PCA and t-SNE approach pretty well using the Swiss Roll dataset in Figure 1 [1]. You can see that due to the non-linearity of this toy dataset (manifold) and preserving large distances that PCA would incorrectly preserve the structure of the data.
Read more ... https://www.kdnuggets.com/2018/08/introduction-t-sne-python.html
Visualizing PCA with Leaf Dataset https://www.kaggle.com/selfishgene/visualizing-pca-with-leaf-dataset
Introduction to Decision Trees (Titanic dataset) https://www.kaggle.com/dmilla/introduction-to-decision-trees-titanic-dataset
Regression Types https://mindmajix.com/lasso-regression
Supervised Machine Learning: Classification https://towardsdatascience.com/supervised-machine-learning-classification-5e685fe18a6d
PyAstronomy https://www.hs.uni-hamburg.de/DE/Ins/Per/Czesla/PyA/PyA/funcFitDoc/tutorialMCMC.html
EMCEE http://dfm.io/emcee/current/user/line/
An Introduction to Feature Selection https://machinelearningmastery.com/an-introduction-to-feature-selection/
https://www.kaggle.com/elmadj/detect-credit-card-fraud-using-logistic-regression/notebook https://www.kaggle.com/theonarh/kernels/notebooks/new?forkParentScriptVersionId=2167747 https://www.data-blogger.com/2017/06/15/fraud-detection-a-simple-machine-learning-approach/