Categories: Big Data

Data Science – List of Key Machine Learning Algorithms

This article represents a list of key machine learning algorithms which are most widely used by data scientists while doing data analysis. Please feel free to comment/suggest if I missed to mention one or more important points. Also, sorry for the typos.
The list of machine learning algorithms presented below covers some of the most important and widely used algorithms which could set a stage for one to get started with data science/analytics and create models for predictions. Following are two high level classifications in which these machine learning algorithms fall under:
  • Supervised learning
  • Unsupervised learning

Following are some of the key tasks that are performed by machine learning algorithms falling under supervised and unsupervised learning:

  • Classification: Place input data in a set of discreet categories. For example, whether an input data related cancerous cells represents benign or malignant cancer, or whether a hockey team will win or not, or whether a person will default on loan or not. Read further about classification on wikipedia page for classification
  • Regression: Predict numeric value of target feature based on input values of other features. Read further details on wikipedia page for regression analysis
  • Clustering: Divide input dataset into one or more homogenous groups. This is sometimes used for segmentation analysis which identifies groups of individuals with similar purchasing, donating, or demographic information such that promotion campaigns can be tailored to particular group of people. Read greater details about cluste analysis on wikipedia page
  • Pattern Detection: Focuses on the recognition of patterns and regularities in data

Following is a list of algorithms listed under above categories:

  • Supervised learning: Following algorithms can be used for performing classification and numerical value prediction tasks:
    • Classification tasks:
      • Nearest Neighbor
      • naive Bayes
      • Decision trees
      • Classification rule learners
      • Neural networks
      • Support vector machine
    • Numeric prediction tasks:
      • Linear regression
      • Regression trees
      • Model trees
      • Neural networks
      • Support vector machine
  • Unsupervised learning: Following algorithms could be used for patterm detection and clustering tasks.
    • Association rules (pattern detection)
    • k-means clustering (clustering)
Ajitesh Kumar

I have been recently working in the area of Data analytics including Data Science and Machine Learning / Deep Learning. I am also passionate about different technologies including programming languages such as Java/JEE, Javascript, Python, R, Julia, etc, and technologies such as Blockchain, mobile computing, cloud-native technologies, application security, cloud computing platforms, big data, etc. For latest updates and blogs, follow us on Twitter. I would love to connect with you on Linkedin. Check out my latest book titled as First Principles Thinking: Building winning products using first principles thinking. Check out my other blog, Revive-n-Thrive.com

Recent Posts

Feature Selection vs Feature Extraction: Machine Learning

Last updated: 2nd May, 2024 The success of machine learning models often depends on the…

18 hours ago

Model Selection by Evaluating Bias & Variance: Example

When working on a machine learning project, one of the key challenges faced by data…

24 hours ago

Bias-Variance Trade-off in Machine Learning: Examples

Last updated: 1st May, 2024 The bias-variance trade-off is a fundamental concept in machine learning…

2 days ago

Mean Squared Error vs Cross Entropy Loss Function

Last updated: 1st May, 2024 As a data scientist, understanding the nuances of various cost…

2 days ago

Cross Entropy Loss Explained with Python Examples

Last updated: 1st May, 2024 In this post, you will learn the concepts related to…

2 days ago

Logistic Regression in Machine Learning: Python Example

Last updated: 26th April, 2024 In this blog post, we will discuss the logistic regression…

7 days ago