Category Archives: Data Science

Gradient Boosting Algorithm: Concepts, Example

gradient boosting algorithm error vs iterations

If you are a data scientist or machine learning engineer, then you know that Gradient Boosting Algorithm (GBA) is one of the most powerful algorithms in predicting results from data. This algorithm has been proven to increase the accuracy of predictions and is becoming increasingly popular among data scientists. Let’s take a closer look at GBA and explore how it works with an example.   What is a Gradient Boosting Algorithm? Gradient boosting algorithm is a machine learning technique used to build predictive models. It creates an ensemble of weak learners, meaning that it combines several smaller, simpler models in order to obtain a more accurate prediction than what an …

Continue reading

Posted in Data Science, Machine Learning.

Drivetrain Approach for Machine Learning

drivetrain approach for machine learning

In this post, you will learn about a very popular approach or methodology called as Drivetrain approach coined by Jeremy Howard. The approach provides you steps to design data products that provide you with actionable outcomes while using one or more machine learning models. The approach is indeed very useful for data scientists/machine learning enthusiasts at all levels. However, this would prove to be a great guide for data science architects whose key responsibility includes designing the data products.  Without further ado, let’s do a deep dive. Why Drivetrain Approach? Before getting into the drivetrain approach and understands the basic concepts, Lets understand why drivetrain approach in the first place? …

Continue reading

Posted in Data Science, Machine Learning. Tagged with , .

Machine Learning Models Evaluation Techniques

AUC-ROC curve

Machine learning is a powerful machine intelligence technique that can be used to develop predictive models for different types of data. It has become the backbone of many intelligent applications and evaluating machine learning model performance at a regular intervals is key to success of such applications. A machine learning model’s performance depends on several factors including the type of algorithm used, how well it was trained and more. In this blog post, we will discuss  essential techniques for evaluating machine-learning model performance in order to provide you with some best practices when working with machine-learning models. The following are different techniques that can be used for evaluating machine learning …

Continue reading

Posted in Data Science, Machine Learning. Tagged with , .

Data Preprocessing Steps in Machine Learning

data preprocessing in machine learning

Data preprocessing is an essential step in any machine learning project. By cleaning and preparing your data, you can ensure that your machine learning model is as accurate as possible. In this blog post, we’ll cover some of the important and most common data preprocessing steps that every data scientist should know. Replace/remove missing data Before building a machine learning model, it is important to preprocess the data and remove or replace any missing values. Missing data can cause problems with the model, such as biased results or inaccurate predictions. There are a few different ways to handle missing data, but the best approach depends on the situation. In some …

Continue reading

Posted in Data Science, Machine Learning. Tagged with , .

Resume Screening using Machine Learning & NLP

resume screening and shortlisting using machine learning

In today’s job market, there are many qualified candidates vying for the same position. So, how do you weed out the applicants who are not a good fit for your company? One way to do this is by using machine learning and natural language processing (NLP) to screen resumes. By using machine learning and NLP to screen resumes, you can more efficiently identify candidates who have the skills and qualifications you are looking for. In this blog, we will learn different aspects of screening and selecting / shortlisting candidates for further processing using machine learning & NLP techniques.  Key Challenges for Resume Screening / Shortlisting Resume screening is the process …

Continue reading

Posted in Data Science.

Bagging vs Boosting Machine Learning Methods

boosting vs bagging differences examples

In machine learning, there are a variety of methods that can be used to improve the performance of your models. Two of the most popular methods are bagging and boosting. In this blog post, we’ll take a look at what these methods are and how they work with the help of examples. What is Bagging? Bagging, short for “bootstrap aggregating”, is a method that can be used to improve the accuracy of your machine learning models. The idea behind bagging is to train multiple models on different subsets of the data and then combine the predictions of those models. The data is split into a number of smaller datasets, or …

Continue reading

Posted in Data Science, Machine Learning. Tagged with , .

Healthcare Claims Processing AI Use Cases

healthcare claims processing use cases AI and machine learning

In recent years, artificial intelligence (AI) / machine learning (ML) has begun to revolutionize many industries – and healthcare is no exception. Hospitals and insurance companies are now using AI to automate various tasks in the healthcare claims processing workflow. Claims processing is a complex and time-consuming task that often requires manual intervention. By using AI to automate claims processing, healthcare organizations can reduce costs, improve accuracy, and speed up the claims adjudication process. In this blog post, we will explore some of the most common use cases for healthcare claims processing AI / machine learning. Automated Data Entry One of the most time-consuming tasks in the claims process is …

Continue reading

Posted in AI, Data Science, Healthcare, Machine Learning. Tagged with , , , .

Checklist for Training Deep Learning Models

training deep learning models checklist

Deep learning is a powerful tool for solving complex problems, but it can be difficult to get started. In this blog post, we’ll provide a checklist of things to keep in mind when training and evaluating the deep learning models and deciding whether they are suitable to deploy in production. By following this checklist, you can ensure that your models are well-trained and ready to tackle real-world tasks. Validation of data distribution The distribution of data can have a significant impact on the performance of deep learning models. When training a model, it is important to ensure that the training data is representative of the distribution of the data that …

Continue reading

Posted in Data Science, Deep Learning, Machine Learning.

List of Machine Learning Topics for Learning

List of machine learning topics for learning

Are you looking for a list of machine learning topics to learn more about? If so, you’ve come to the right place. In this post, we will share a variety of machine learning topics that you can explore to boost your knowledge and skills. So, whether you’re a data scientist or machine learning engineer, there’s something here for everyone. The following represents a list of topics which can be taken up for learning and mastering artificial intelligence / machine learning: Introduction to data science Introduction to machine learning Check out this detailed post on machine learning concepts & examples. Introduction to deep learning Introduction to reinforcement learning Introduction to linear …

Continue reading

Posted in Books, Career Planning, Data Science, Machine Learning. Tagged with , .

What are Features in Machine Learning?

Features - Key to Machine Learning

Machine learning is a field of machine intelligence concerned with the design and development of algorithms and models that allow computers to learn without being explicitly programmed. Machine learning has many applications including those related to regression, classification, clustering, natural language processing, audio and video related, computer vision, etc. Machine learning requires training one or more models using different algorithms. Check out this detailed post in relation to learning machine learning concepts – What is Machine Learning? Concepts & Examples. One of the most important aspects of the machine learning model is identifying the features which will help create a great model, the model that performs well on unseen data. …

Continue reading

Posted in Data Science, Machine Learning. Tagged with , .

K-Nearest Neighbors (KNN) Python Examples

If you’re working with data analytics projects including building machine learning (ML) models, you’ve probably heard of the K-nearest neighbors (KNN) algorithm. But what is it, exactly? And more importantly, how can you use it in your own AI / ML projects? In this post, we’ll take a closer look at the KNN algorithm and walk through a simple Python example. You will learn about the K-nearest neighbors algorithm with Python Sklearn examples. K-nearest neighbors algorithm is used for solving both classification and regression machine learning problems. Stay tuned!  Introduction to K-Nearest Neighbors (K-NN) Algorithm K-nearest neighbors is a supervised machine learning algorithm for classification and regression. In both cases, the input consists …

Continue reading

Posted in Data Science, Machine Learning, Python. Tagged with , , , .

Predicting Customer Churn with Machine Learning

Customer churn prediction using machine learning

Customer churn, also known as customer attrition, is a major problem for businesses that rely on recurring revenue. Customer churn costs businesses billions of dollars every year, and it’s only getting worse as customers become more and more fickle. In fact, it’s been estimated that the average company loses 10-15% of its customers each year. That number may seem small, but it can have a huge impact on a company’s bottom line. Fortunately, there’s a way to combat churn: by using machine learning to predict which customers are likely to churn. In this blog post, we’ll discuss how customer churn prediction works and why it’s so important. We’ll also provide …

Continue reading

Posted in AI, Data Science, Machine Learning. Tagged with , , .

Stacking Classifier Sklearn Python Example

Stacking classifier python example

In this blog post, we will be going over a very simple example of how to train a stacking classifier machine learning model in Python using the Sklearn library and learn the concepts of stacking classifier. A stacking classifier is an ensemble learning method that combines multiple classification models to create one “super” model. This can often lead to improved performance, since the combined model can learn from the strengths of each individual model. What are Stacking Classifiers? Stacking is a machine learning ensemble technique that combines multiple models to form a single powerful model. The individual models are trained on different subsets of the data using some type of …

Continue reading

Posted in Data Science, Machine Learning, Python. Tagged with , , .

Decision Tree Hyperparameter Tuning Grid Search Example

decision tree grid search hyperparameter tuning example

The output prints out grid search across different values of hyperparameters, the model score with best hyperparameters and the most optimal hyperparameters value. In the above code, the decision tree model is train and evaluate our for each value combination and choose the combination that results in the best performance. In this case, “best performance” could be defined as either accuracy or AUC (area under the curve). Once we’ve found the best performing combination of hyperparameters, we can then train our final model using those values and deploy it to production. Conclusion In this blog post, we explored how to use grid search to tune the hyperparameters of a Decision …

Continue reading

Posted in Data Science, Machine Learning, Python. Tagged with , , .

Reinforcement Learning Real-world examples

Reinforcement-learning-real-world-example

 In this blog post, we’ll learn about some real-world / real-life examples of Reinforcement learning, one of the different approaches to machine learning where other approaches are supervised and unsupervised learning. Reinforcement learning is a type of machine learning that enables a computer system to learn how to make choices by being rewarded for its successes. This can be an extremely powerful tool for optimization and decision-making. It’s one of the most popular machine learning methods used today. Before looking into the real-world examples of Reinforcement learning, let’s quickly understand what is reinforcement learning. Introduction to Reinforcement Learning (RL) Reinforcement learning is an approach to machine learning in which the agents …

Continue reading

Posted in Data Science, Machine Learning. Tagged with , .

Passive Aggressive Classifier: Concepts & Examples

The passive aggressive classifier is a machine learning algorithm that is used for classification tasks. This algorithm is a modification of the standard Perceptron algorithm. The passive aggressive classifier was first proposed in 2006 by Crammer et al. as a way to improve the performance of the Perceptron algorithm on linearly separable data sets. In this blog, we will learn about the basic concepts and principles behind the passive aggressive classifier, as well as some examples of its use in real-world applications. What is the passive aggressive classifier and how does it work? The passive aggressive classifier algorithm falls under the category of online learning algorithms, can handle large datasets, …

Continue reading

Posted in Data Science. Tagged with , , .