Closed caglarmert closed 4 years ago
CLUSTERING
Unsupervised learning method
The task of dividing the population or data points into a number of groups such that data points in the same groups are more similar to other data points in the same group than those in other groups. In simple words, the aim is to segregate groups with similar traits and assign them into clusters.
Clustering is very much important as it determines the intrinsic grouping among the unlabeled data present. There are no criteria for a good clustering. It depends on the user, what is the criteria they may use which satisfy their need.
Divided into two subgroups :
Hard Clustering: In hard clustering, each data point either belongs to a cluster completely or not.
Soft Clustering: In soft clustering, instead of putting each data point into a separate cluster, a probability or likelihood of that data point to be in those clusters is assigned.
Clustering Methods :
Clustering Algorithms :
K-means clustering algorithm : It is the simplest unsupervised learning algorithm that solves clustering problem. K-means algorithm partition n observations into k clusters where each observation belongs to the cluster with the nearest mean serving as a prototype of the cluster .
1. Specify the desired number of clusters K
2. Randomly assign each data point to a cluster
3. Compute cluster centroids
4. Re-compute cluster centroids
5. Repeat steps 4 and 5 until no improvements are possible
CLASSIFICATION
Naive Bayes Naive Bayes is a probabilistic classifier in Machine Learning which is built on the principle of Bayes theorem. Naive Bayes classifier makes an assumption that one particular feature in a class is unrelated to any other feature and that is why it is known as naive.
Decision Tree Decision tree, as the name states, is a tree-based classifier in Machine Learning. You can consider it to be an upside-down tree, where each node splits into its children based on a condition.
Logistic Regression Logistic regression is a binary classification algorithm which gives out the probability for something to be true or false.
Support Vector Machine (SVM)
Yöntemleri için kullanılan algoritmaların tek cümlelik özetleri hazırlanacak.