You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The machine learning algorithm learns on a labeled dataset
152
155
We know the labels from the training set
153
156
157
+
KNN (k-nearest neighbors) and Support vector classifier (SVC) are supervised learning algorithms for classification.
158
+
154
159
## machine learning model
155
160
156
161
Once a machine learning model is built with a training set, it can be used to process new data points to make predictions or decisions
@@ -304,8 +309,6 @@ epsilon and minPoints remain the same while the algorithm is running.
304
309
305
310

306
311
307
-

308
-
309
312
## k-means clustering
310
313
311
314
k-means clustering splits N data points into K groups (called clusters).
@@ -332,8 +335,9 @@ The objective is to minimize the variance within each cluster.
332
335
Clusters are well separated from each other.
333
336
It maximizes the average inter-cluster distance.
334
337
338
+
k-means clusters tend to be of the same size. size refers to the area. size doesnt refer to the number od elements. Two clusters of the same area do not have to have the same number of elements (except if your data set has the same density)
335
339

336
-
340
+
The tendency of k-means to produce equal-sized clusters leads to bad results here
0 commit comments