Categories: Big Data

Data Science – Quick Start Guide for Machine Learning

This article represents a very high-level information on different aspects of machine learning with an objective to present a quick-start read/guide for the data science beginners. One could grab one or more books on Machine Learning to learn the subject in detail. Please feel free to comment/suggest if I missed to mention one or more important points. Also, sorry for the typos.
Following are the key points described later in this article:
  • What is machine learning?
  • Key phases of machine learning
  • Prediction API model of machine learning

 

What is Machine Learning?

Simply speaking, Machine Learning is a set of artifical intelligence techniques which are used to solve one of the following problems based on the examples in hand:

  • Classification problems: Problems having “which type” as a question. For example, “which type” of email is this? (Spam or ham). In classification problem, one out of fixed number of answers are chosen.
  • Regression problems: Problems having “how much” as a question. For example, “how much” should be the price of a house in a given locality? Simply speaking, regression problems are related with numeric answer.

Machine learning is a key aspect of data science. It allows data scientist to apply existing data sets to one of the machine learning algorithm and predict based on that. In other words, a person wanting to become a data scientist must learn machine learning algorithms to be able to predict/recommend.

 

Key Phases of Machine Learning

Following are key steps in machine learning:

  • Training: Train the model
  • Prediction: Predict using the model, given an input data-set

 

Prediction API Model of Machine Learning

Above steps of machine learning could be represented using following, from API perspective. Thus, whether using R, or pyhton APIs, following is how the API structure would look like:

# Model created based on a given data set
model = createModelAPI(existingDataSet)
# Model is fed with new dataset, newDataSet which gives predicted output, predictedOutput
predictedOutput = createPredictionAPI( model, newDataSet )

Lets take an example of linear regression using R programming console. Look at the code below:

# Linear regression model
model = lm( price ~ carat, data=diamonds )
price = predict( model, newData )
# Multiple linear regression model
model = lm( price ~ carat + cut + color, data=diamonds )
price = predict( model, newData )

 

Ajitesh Kumar

I have been recently working in the area of Data analytics including Data Science and Machine Learning / Deep Learning. I am also passionate about different technologies including programming languages such as Java/JEE, Javascript, Python, R, Julia, etc, and technologies such as Blockchain, mobile computing, cloud-native technologies, application security, cloud computing platforms, big data, etc. I would love to connect with you on Linkedin. Check out my latest book titled as First Principles Thinking: Building winning products using first principles thinking.

Recent Posts

Agentic Reasoning Design Patterns in AI: Examples

In recent years, artificial intelligence (AI) has evolved to include more sophisticated and capable agents,…

1 month ago

LLMs for Adaptive Learning & Personalized Education

Adaptive learning helps in tailoring learning experiences to fit the unique needs of each student.…

2 months ago

Sparse Mixture of Experts (MoE) Models: Examples

With the increasing demand for more powerful machine learning (ML) systems that can handle diverse…

2 months ago

Anxiety Disorder Detection & Machine Learning Techniques

Anxiety is a common mental health condition that affects millions of people around the world.…

2 months ago

Confounder Features & Machine Learning Models: Examples

In machine learning, confounder features or variables can significantly affect the accuracy and validity of…

2 months ago

Credit Card Fraud Detection & Machine Learning

Last updated: 26 Sept, 2024 Credit card fraud detection is a major concern for credit…

2 months ago