Tag Archives: Data Science

Beta Distribution Example for Cricket Score Analysis

December 10, 2019 by Ajitesh Kumar · Leave a comment

virat kohli score probability using beta distribution

This post represents a real-world example of Binomial and Beta probability distribution from the sports field. In this post, you will learn about how the run scored by a Cricket player could be modeled using Binomial and Beta distribution. Ever wanted to predict the probability of Virat Kohli scoring a half-century in a particular match. This post will present a perspective on the same by using beta distribution to model the probability of runs that can be scored in a match. If you are a data scientist trying to understand beta and binomial distribution with a real-world example, this post will turn out to be helpful. First and foremost, let’s identify the random variable that we would like …

Continue reading →

Posted in AI, Data Science, Machine Learning, statistics. Tagged with ai, Data Science, machine learning, statistics.

How to Print Unique Values in Pandas Dataframe Columns

December 7, 2019 by Ajitesh Kumar · Leave a comment

print unique column values in Pandas dataframe

A quick post representing code sample on how to print unique values in Dataframe columns in Pandas. Here is a data frame comprising of oil prices on different dates which column such as year comprising of repeated/duplicate value of years. In the above data frame, the requirement is to print the unique value of year column. Here is the code for same. Note the method unique()

Posted in AI, Data Science, Machine Learning, News, Python. Tagged with Data Science, machine learning, python.

Pandas – How to Extract Month & Year from Datetime

December 7, 2019 by Ajitesh Kumar · Leave a comment

how to extract month and year from datetime

This is a quick post representing code sample related to how to extract month & year from datetime column of DataFrame in Pandas. The code sample is shown using the sample data, BrentOilPrices downloaded from this Kaggle data page. Here is the code to load the data frame. Check the data type of the data using the following code: The output looks like the following: Date object Price float64 dtype: object Use the following command to change the date data type from object to datetime and extract the month and year. Printing data using head command would print the following:

Posted in Data Science, Machine Learning, News. Tagged with ai, Data Science, machine learning.

Difference between Machine Learning & Traditional Software

October 30, 2019 by Ajitesh Kumar · Leave a comment

difference traditional software machine learning

In this post, we will understand what are some of the key differences between machine learning models and traditional/conventional software. S.No Traditional Software Machine Learning 1 In traditional software, the primary objective is to meet functional and non-functional requirements. In machine learning models, the primary goal is to optimize the metric (accuracy, precision/recall, RMSE, etc) of the models. Every 0.1 % improvement in the model metrics could result in significant business value creation. 2 The quality of the software primary depends on the quality of the code. The quality of the model depends upon various parameters which are mainly related to the input data and hyperparameters tuning. 3 Traditional software …

Continue reading →

Posted in AI, Data Science, Machine Learning. Tagged with ai, Data Science, machine learning.

Neural Network Architecture for Text-to-Speech Synthesis

July 29, 2019 by Ajitesh Kumar · Leave a comment

In this post, you would learn about a neural network reference solution architecture which could be used to convert the text to speech. The neural network solution architecture given in this post is based on deep learning (autoencoder network (encoder-decoder) with attention). Neural Network Reference Architecture for Text-to-Speech Synthesis In the solution architecture diagram (figure 1) depicted below, the following is described: Sentences are first converted into character embeddings. Character embeddings are numeric representations of words. Numeric representations of each of the words could be used to create numeric representations of higher-level representations like sentences/paragraphs/documents/etc. Character embeddings are next fed into recurrent sequence-to-sequence feature prediction network with attention. The sequence-to-sequence …

Continue reading →

Posted in AI, Data Science, Machine Learning. Tagged with ai, artificial intelligence, Data Science, machine learning.

Reverse Image Search using Deep Learning (CNN)

July 27, 2019 by Ajitesh Kumar · Leave a comment

In this post, you will learn about a solution approach for searching similar images out of numerous images matching an input image (query) using machine learning / deep learning technology. This is also called a reverse image search. The image search is generally searching for images based on keywords. Here are the key components of the solution for reverse image search: A database of storing images with associated numerical vector also called embeddings. A deep learning model based on convolutional neural network (CNN) for creating numerical feature vectors (aka embeddings) for images A module which searches embeddings of an input image (query) from the image database based on the nearest neighbor …

Continue reading →

Posted in AI, Data Science, Machine Learning. Tagged with ai, artificial intelligence, Data Science, machine learning.

Why Data Scientists Must Learn Statistics?

July 27, 2019 by Ajitesh Kumar · Leave a comment

In order to understand the need for data scientists to be very good at the statistical concepts, one needs to clearly understand some of the following: Who are data scientists? What is the need for statistics in data scientists’ day-to-day work? Who are Data Scientists? Data Scientists are the primarily Scientists who do experiments to find some of the following: Whether there exists a relationship between data Whether the function approximated (machine learning or statistical learning model) from a given sample of data could be generalized for the entire population In case there are multiple function approximations for predicting outcomes given a set of input, which one of the function approximation …

Continue reading →

Posted in AI, Data Science, Machine Learning. Tagged with ai, artificial intelligence, Data Science, machine learning.

When not to use F-Statistics for Multi-linear Regression

July 16, 2019 by Ajitesh Kumar · Leave a comment

In this post, you will learn about the scenario in which you may NOT want to use F-Statistics for doing the hypothesis testing on whether there is a relationship between response and predictor variables in the multilinear regression model. Multilinear regression is a machine learning / statistical learning method which is used to predict the quantitative response variable and also understand/infer the relationship between the response and multiple predictor variables. We will look into the following topics: Background When not to use F-Statistics for Multilinear Regression Model Background F-statistics is used in hypothesis testing for determining whether there is a relationship between response and predictor variables in multilinear regression models. Let’s consider …

Continue reading →

Posted in AI, Data Science, Machine Learning. Tagged with ai, Data Science, machine learning.

Machine Learning – Cloud-native Model Deployments

June 29, 2019 by Ajitesh Kumar · Leave a comment

In this post, we are going to learn about the cloud-native machine learning model deployments. Cloud-native Deployments First and foremost, let’s understand the meaning of cloud-native deployments? If we are building an application or a service and we can deploy this application or the service on any cloud platform without much ado, it could be said as cloud-native deployment. And the way it is made possible is through the container technologies such as Dockers. What basically is required to be done is to wrap the applications or the services within the containers and move the containers images onto the cloud services such as AWS ECS, AWS EKS or Google Kubernetes …

Continue reading →

Posted in AI, Data Science, Machine Learning. Tagged with AWS, cloud computing, cloud native, Data Science, machine learning.

What, When & Why of Regularization in Machine Learning?

June 2, 2019 by Ajitesh Kumar · Leave a comment

In this post, we will try and understand some of the following in relation to regularizing the regression machine learning models to achieve higher accuracy and stable models: Background What is regularization? Why & when does one need to adopt/apply the regularization technique? Background At times, when one is building a multi-linear regression model, one uses the least squares method for estimating the coefficients of determination or parameters for features. As a result, some of the following happens: Often, the regression model fails to generalize on unseen data. This could happen when the model tries to accommodate for all kind of changes in the data including those belonging to both …

Continue reading →

Posted in AI, Data Science, Machine Learning. Tagged with ai, artificial intelligence, Data Science, machine learning.

Unit Tests & Data Coverage for Machine Learning Models

May 11, 2019 by Ajitesh Kumar · Leave a comment

Unit testing for Machine Learning Models

This post represents thoughts on what would it look like planning unit tests for machine learning models. The idea is to perform automated testing of ML models as part of regular builds to check for regression related errors in terms of whether the predictions made by certain set of input data vectors does not match with expected outcomes. This brings up some of the following topics for discussion: Why unit testing for machine learning models? What would unit tests for machine learning models mean? Data coverage or code coverage? Why unit testing for Machine Learning models? Once a model is built, the challenge is to monitor the performance metrics of the models …

Continue reading →

Posted in AI, Data Science, Machine Learning, QA, Testing. Tagged with ai, artificial intelligence, Data Science, machine learning, Unit Testing.

Machine Learning Cheat sheet (Stanford)

March 23, 2019 by Ajitesh Kumar · Leave a comment

Here is a great set of cheat sheet on some of the following topics: Supervised learning Unsupervised learning Deep learning Probability and statistics Linear algebra Tips and tricks including performance metrics https://stanford.edu/~shervine/teaching/cs-229/ Hope you liked the cheat sheets on different topics of machine learning and data science.

Posted in AI, Machine Learning. Tagged with Data Science, machine learning.

Machine Learning Models used in Facebook

March 3, 2019 by Ajitesh Kumar · Leave a comment

This post quickly represents machine learning projects and related machine learning models. The above diagram represents the usage of the following learning algorithms: Support Vector Machines (SVM) Gradient-boosted decision trees Multi-layer Perceptron (MLP): Used for ranking and personalizing news feeds, ads, search etc. Convolutional neural networks (CNN): Recurrent neural networks (RNN): Used for language translation, speech recognition, content understanding References

Posted in AI, Data Science, Machine Learning. Tagged with ai, Data Science, machine learning.

13 Programming Languages used for Machine Learning

January 6, 2019 by Ajitesh Kumar · Leave a comment

In this post, you will learn about different programming languages which can be used to create (train) machine learning models to solve supervised and unsupervised learning problems. Here are the top 13 programming languages used for machine learning: R Language: R is one of the most popular programming language and environment for statistical computing and graphics. Python: There are some of the following Python libraries which makes it easy to create machine learning/deep learning models: Scikit-learn library (Classical machine learning models): Packages such as NumPy, SciPy, Pandas are very useful and helpful in creating supervised and unsupervised learning models. Deep learning models using python libraries provided by Tensorflow, PyTorch, Theanos, CNTK, …

Continue reading →

Posted in AI, Data Science, Machine Learning. Tagged with ai, artificial intelligence, Data Science, machine learning.

Andrew NG Machine Learning Coursera Videos

December 23, 2018 by Ajitesh Kumar · Leave a comment

In this post, you will get to know the list of Andrew NG Machine Learning Coursera Videos. Here is the information: Youtube playlist of machine learning videos which are same as that of Andrew NG machine learning course on Coursera. One could use Internet Download Manager (IDM) to download these videos. Use Coursera-dl script found on Github to download the machine learning course. The script makes it easier to batch download lecture resources (e.g., videos, ppt, etc) for Coursera classes. Given one or more class names and account credentials, it obtains week and class names from the lectures page, and then downloads the related materials into appropriately named files and directories. Use AcademicTorrents website …

Continue reading →

Posted in AI, Data Science, Machine Learning. Tagged with ai, artificial intelligence, Data Science, machine learning.

I found it very helpful. However the differences are not too understandable for me

Very Nice Explaination. Thankyiu very much,

in your case E respresent Member or Oraganization which include on e or more peers?

Such a informative post. Keep it up

Thank you....for your support. you given a good solution for me.

Tag Archives: Data Science

Beta Distribution Example for Cricket Score Analysis

How to Print Unique Values in Pandas Dataframe Columns

Pandas – How to Extract Month & Year from Datetime

Difference between Machine Learning & Traditional Software

Neural Network Architecture for Text-to-Speech Synthesis

Reverse Image Search using Deep Learning (CNN)

Why Data Scientists Must Learn Statistics?

When not to use F-Statistics for Multi-linear Regression

What, When & Why of Regularization in Machine Learning?

Unit Tests & Data Coverage for Machine Learning Models

Machine Learning Cheat sheet (Stanford)

Machine Learning Models used in Facebook

13 Programming Languages used for Machine Learning

Top 5 Machine Learning Introduction Slides for Beginners

Andrew NG Machine Learning Coursera Videos

Recent Posts

Data Science / AI Trends

Free Online Tools

Newsletter

Recent Comments

Tag Archives: Data Science

Recent Posts

Data Science / AI Trends

Free Online Tools

Newsletter

Tag Cloud

Recent Comments