Classification

Process of grouping unlabeled data based on differences and similarities

Classifications is about classifying labelled data, clustering is about finding the similarities and classifying them ad-hoc

Algorithms

  • K-means
  • DBScan
  • K-medoids
  • Density based
  • Hierarchical

Association

Process of finding relationship between variables via association, such as finding most bought together items.

Algorithms

  • Apriori
  • Euclat
  • Fp-Growth

Dimentionality Reduction

Often used in pre-processing state, reducing the amount of data while retaining integrity

Algorithms Principal Component Analysis (PCA) Linear Discriminant Analysis (LDA) Generalized Discriminant Analysis (GDA) Singular Value Decomposition (SVD) Latent Dirichlet allocation (LDA) Latent Semantic Analysis (LSA, pLSA, GLSA) t-SNE