Categories of Machine Learning systems

#supervised #learning #unsupervised #semisupervised #reinforcement

Akash Deep Dec 28 2021 · 2 min read
Share this

Machine Learning systems can be classified according to the amount and type of supervision they get during training.

There are four major categories:

  • supervised learning
  • unsupervised learning
  • semisupervised learning
  • Reinforcement Learning
  • Supervised learning

      In supervised learning, the training set you feed to the algorithm includes the desired solutions, called labels.

    A labeled training set for spam classification (an example of supervised learning)

      A typical supervised learning task is classification.

    Example:

    The spam filter is a good example of this: it is trained with many example emails along with their class (spam or ham), and it must learn how to classify new emails.  

      Another typical task is to predict a target numeric value, such as the price of a car, given a set of features (mileage, age, brand, etc.) called predictors. This sort of task is called regression .

    To train the system, you need to give it many examples of cars, including both their predictors and their labels (i.e., their prices).  

      Note that some regression algorithms can be used for classification as well, and vice versa.  

      Here are some of the most important supervised learning algorithms:

  • k-Nearest Neighbors
  • Linear Regression
  • Logistic Regression
  • Support Vector Machines (SVMs)
  • Decision Trees and Random Forests
  • Neural networks
  • Unsupervised learning

    In unsupervised learning, the training data is unlabeled.

    An unlabeled training set for unsupervised learning

      Here are some of the most important unsupervised learning algorithms :

  • ClusteringK-MeansDBSCANHierarchical Cluster Analysis (HCA)
  • K-Means
  • DBSCAN
  • Hierarchical Cluster Analysis (HCA)
  • Anomaly detection and novelty detectionOne-class SVMIsolation Forest
  • One-class SVM
  • Isolation Forest
  • Visualization and dimensionality reductionPrincipal Component Analysis (PCA)Kernel PCALocally Linear Embedding (LLE)t-Distributed Stochastic Neighbor Embedding (t-SNE)
  • Principal Component Analysis (PCA)
  • Kernel PCA
  • Locally Linear Embedding (LLE)
  • t-Distributed Stochastic Neighbor Embedding (t-SNE)
  • Association rule learningAprioriEclat
  • Apriori
  • Eclat
  • Example:

    Important unsupervised task is anomaly detection—for example, detecting unusual credit card transactions to prevent fraud, catching manufacturing defects, or automatically removing outliers from a dataset before feeding it to another learning algorithm. The system is shown mostly normal instances during training, so it learns to recognize them; then, when it sees a new instance, it can tell whether it looks like a normal one or whether it is likely an anomaly .

    Semisupervised learning

      Since labeling data is usually time-consuming and costly, you will often have plenty of unlabeled instances, and few labeled instances. Some algorithms can deal with data that’s partially labeled. This is called semisupervised learning  

    Example:

      Some photo-hosting services, such as Google Photos, are good examples of this. Once you upload all your family photos to the service, it automatically recognizes that the same person A shows up in photos 1, 5, and 11, while another person B shows up in photos 2, 5, and 7. This is the unsupervised part of the algorithm (clustering). Now all the system needs is for you to tell it who these people are. Just add one label per person and it is able to name everyone in every photo, which is useful for searching photos.  

    Reinforcement Learning

      Reinforcement Learning is a very different beast.

    The learning system, called an agent in this context, can observe the environment, select and perform actions, and get rewards in return or penalties in the form of negative rewards, as shown in figure. It must then learn by itself what is the best strategy, called a policy, to get the most reward over time. A policy defines what action the agent should choose when it is in a given situation.  

    Reinforcement Learning

    Example:

    Many robots implement Reinforcement Learning algorithms to learn how to walk.

    DeepMind’s AlphaGo program is also a good example of Reinforcement Learning: it made the headlines in May 2017 when it beat the world champion Ke Jie at the game of Go. It learned its winning policy by analyzing millions of games, and then playing many games against itself. Note that learning was turned off during the games against the champion; AlphaGo was just applying the policy it had learned.

    Reference:-

    Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, 2nd Edition

    By Aurélien Géron

    * * *
    Comments
    Read next