Author Archives: Ajitesh Kumar

Ajitesh Kumar

I have been recently working in the area of Data analytics including Data Science and Machine Learning / Deep Learning. I am also passionate about different technologies including programming languages such as Java/JEE, Javascript, Python, R, Julia, etc, and technologies such as Blockchain, mobile computing, cloud-native technologies, application security, cloud computing platforms, big data, etc. I would love to connect with you on Linkedin. Check out my latest book titled as First Principles Thinking: Building winning products using first principles thinking.

SVM RBF Kernel Parameters: Python Examples

SVM RBF Kernel Parameters - Gamma and C values

Support vector machines (SVM) are a popular and powerful machine learning technique for classification and regression tasks. SVM models are based on the concept of finding the optimal hyperplane that separates the data into different classes. One of the key features of SVMs is the ability to use different kernel functions to model non-linear relationships between the input variables and the output variable. One such kernel is the radial basis function (RBF) kernel, which is a popular choice for SVMs due to its flexibility and ability to capture complex relationships between the input and output variables. The RBF kernel has two important parameters: gamma and C (also called regularization parameter). …

Continue reading

Posted in AI, Data Science, Machine Learning. Tagged with , , .

How to Identify Analytics Use Cases for Solving Business Problems

business problems to analytics use cases - Decisions - actions - output

In today’s data-driven world, data analytics has become a key aspect of business decision making. Organizations are increasingly relying on data analytics to gain insights into their operations and customers, in order to drive growth and profitability. However, the challenge for many businesses is not in understanding the importance of analytics, but in identifying the right use cases for their particular business problems, execute those use cases and deliver in a timely manner. This is where a structured approach to identifying analytics use cases becomes critical. In this blog post, we will explore how product managers and data scientists can work with business owners and identify analytics use cases that …

Continue reading

Posted in Data, Data analytics. Tagged with .

A/B Testing & Data Science Projects: Examples

A/B Testing in Data Science Projects

Today, when organization is aiming to become data-driven, it is imperative that their data science and product management teams understand the importance of using A/B testing technique for validating or supporting their decisions. A/B testing is a powerful technique that allows product management and data science teams to test changes to their products or services with a small group of users before implementing them on a larger scale. In data science projects, A/B testing can help measure the impact of machine learning models and the content driven based on the their predictions, and other data-driven changes. This blog explores the principles of A/B testing and its applications in data science. …

Continue reading

Posted in Data Science. Tagged with .

Data Science Careers: India’s Job Market & AI Growth

Aspiring data scientists and AI enthusiasts in India have a plethora of opportunities in store, thanks to the country’s booming AI, machine learning (ML), and big data analytics industry. According to a recent report by NASSCOM, India boasts the second-largest talent pool globally in these fields, with a remarkable AI skill penetration score of 3.09 [1]. The nation’s rapid growth in AI talent concentration and scientific publications underscores the immense potential for individuals looking to build a successful data science career in India. As the demand for skilled professionals surges, multiple factors contribute to the thriving industry. The higher-than-average compensation and growth prospects in the field make it an attractive …

Continue reading

Posted in AI, Data Science, Machine Learning, News. Tagged with , .

Data Analytics Explained: What, Why & How?

forms of data analytics

Data analytics has become a buzzword in the business world today, and for all good reasons indeed as it brings competitive advantage to the business if leveraged in the most appropriate manner. The ability to collect, process, and analyze large amounts of data in order to solve business problems has given organizations unprecedented insights into their operations, customers, and markets. By leveraging these insights, businesses can make informed decisions also called as data-driven decisions, identify new opportunities, and drive growth. But what exactly is data analytics? What are the different forms of data analytics? Why is it so important? And how can businesses leverage it to their advantage? How can …

Continue reading

Posted in Data, Data analytics. Tagged with , .

Quiz #86: Large Language Models Concepts

machine learning interview questions

In the ever-evolving field of data science, large language models (LLMs) have become a crucial component in natural language processing (NLP) and AI applications. As a data scientist, keeping up with the latest developments and understanding the core concepts of LLMs can give you a competitive edge, whether you’re working on cutting-edge projects or preparing for job interviews. In this quiz, we have carefully curated a set of questions that cover the essentials of large language models, including their purpose, architecture, types, applications, and more. By attempting this quiz, you’ll not only test your current knowledge but also solidify your understanding of LLM concepts. This will prove valuable when discussing …

Continue reading

Posted in Career Planning, Data Science, Interview questions, Quiz. Tagged with , , , .

Generative AI: Scaling Techniques for LLM Models

Scaling techniques for foundational LLMs

In the rapidly evolving world of artificial intelligence, large language models (LLMs) have emerged as a game-changing force, revolutionizing the way we interact with technology and transforming countless industries. These powerful models can perform a vast array of tasks, from text generation and translation to question-answering and summarization. However, unlocking the full potential of these LLMs requires a deep understanding of how to effectively scale these LLMs, ensuring optimal performance and capabilities. In this blog post, we will delve into the crucial concept of scaling techniques for LLM models and explore why mastering this aspect is essential for anyone working in the AI domain. As the complexity and size of …

Continue reading

Posted in AI, Deep Learning, Generative AI, Machine Learning, NLP.

Meta Unveils SAM and Massive SA-1B Dataset to Advance Computer Vision Research

segment anything model - segment anything dataset

Meta Researchers have, yesterday, unveiled a groundbreaking new model, namely Segment Anything Model (SAM), alongside an immense dataset, the Segment Anything Dataset (SA-1B), which together promise to revolutionize the field of computer vision. SAM’s unique architecture and design make it efficient and effective, while the SA-1B dataset provides a powerful resource to fuel future research and applications. The Segment Anything Model is an innovative approach to promptable segmentation that combines an image encoder, a flexible prompt encoder, and a fast mask decoder. Its design allows for real-time, interactive prompting in a web browser on a CPU, opening up new possibilities for computer vision applications. One of the key challenges SAM …

Continue reading

Posted in Deep Learning, Machine Learning, News. Tagged with , , .

Quiz #85: MSE vs R-Squared?

Python interview questions and answers

Regression models are an essential tool for data scientists and statisticians to understand the relationship between variables and make predictions about future outcomes. However, evaluating the performance of these models is a crucial step in ensuring their accuracy and reliability. Two commonly used metrics for evaluating regression models are Mean Squared Error (MSE) and R-squared. Understanding when to use each metric and how they differ can greatly improve the quality of your analyses. Check out my related blog on this topic – Mean Squared Error vs R-Squared? Which one to use? To help you test your knowledge on MSE and R-squared (also known as coefficient of determination), we have created …

Continue reading

Posted in Career Planning, Data Science, Interview questions, Machine Learning, statistics. Tagged with , , , .

Machine Learning: Identify New Features for Disease Diagnosis

learning-new-features-from-deep-learning-

When diagnosing diseases that require X-rays and image-based scans, such as cancer, one of the most important steps is analyzing the images to determine the disease stage and to characterize the affected area. This information is central to understanding clinical prognosis and for determining the most appropriate treatment. Developing machine learning (ML) / deep learning (DL) based solutions to assist with the image analysis represents a compelling research area with many potential applications. Traditional modeling techniques have shown that deep learning models can accurately identify and classify diseases in X-rays and image-based scans and can even predict patient prognosis using known features, such as the size or shape of the …

Continue reading

Posted in Deep Learning, Healthcare, Machine Learning. Tagged with , , .

Data Storytelling Explained with Examples

data storytelling key components

Have you ever told a story to someone, but they just didn’t seem to understand it? They might have been confused about the plot or why the characters acted in certain ways. If this has happened to you before, then you are not alone. Many people struggle with storytelling or rather data storytelling because they do not know how to communicate their data effectively to tell an engaging story. Data storytelling is a powerful tool that can be used to educate, inform or persuade an audience by using different kinds of narration. By using charts, graphs, images and other visuals, data can be made more interesting and engaging. Data storytelling …

Continue reading

Posted in Data Science. Tagged with .

Quiz: Linear Regression & F-Statistics

Interview questions

Linear Regression is one of the most widely used statistical methods for predictive modeling in various fields such as finance, marketing, and engineering. It involves fitting a linear equation to a set of data points, which can be used to make predictions about new data. One important aspect of linear regression is the use of F-Statistics, which is a statistical test used to determine the significance of the regression model. If you’re looking to test your knowledge of Linear Regression and F-Statistics, you’ve come to the right place! It will also be helpful if you are preparing for data science interviews. In this capsule quiz, we’ve compiled 10 questions that …

Continue reading

Posted in Career Planning, Data Science, Interview questions, Machine Learning. Tagged with .

Meta AI Announces Advancements in Embodied AI: Artificial Visual Cortex & Adaptive Skill Coordination

embodied ai - artificial visual cortext - adaptive skill coordination

Meta AI has announced two major advancements in general-purpose embodied AI agents, focusing on challenging sensorimotor skills. These advancements include an artificial visual cortex called VC-1 and a new approach called Adaptive Skill Coordination (ASC). Both developments offer valuable benefits to data scientists and researchers in the field of AI. Embodied AI is field of AI focused on agents that can perceive, understand, and interact with their environment through sensorimotor experiences. It aims to create AI systems that can perform tasks in the physical world, bridging the gap between abstract thought and reasoning, and physical actions. VC-1 is a single perception model that supports a diverse range of sensorimotor skills, …

Continue reading

Posted in AI, News, robotics. Tagged with .

Python – Draw Confusion Matrix using Matplotlib

Classification models are a fundamental part of machine learning and are used extensively in various industries. Evaluating the performance of these models is critical in determining their effectiveness and identifying areas for improvement. One of the most common tools used for evaluating classification models is the confusion matrix. It provides a visual representation of the model’s performance by displaying the number of true positives, false positives, true negatives, and false negatives. In this post, we will explore how to create and visualize confusion matrices in Python using Matplotlib. We will walk through the process step-by-step and provide examples that demonstrate the use of Matplotlib in creating clear and concise confusion …

Continue reading

Posted in Data Science, Machine Learning, Python. Tagged with , , , .

Degree of Freedom in Statistics: Meaning & Examples

degrees of freedom in statistics - meaning and examples

The degree of freedom (DOF) is a term that statisticians use to describe the degree of independence in statistical data. A degree of freedom can be thought of as the number of variables that are free to vary, given one or more constraints. When you have one degree, there is one variable that can be freely changed without affecting the value for any other variable. As a data scientist, it is important to understand the concept of degree of freedom, as it can help you do accurate statistical analysis and  validate the results. In this blog, we will explore the meaning of degree of freedom in statistics, its importance in …

Continue reading

Posted in Data Science, statistics. Tagged with , .

Different types of Time-series Forecasting Models

different types of time-series forecasting

Forecasting is the process of predicting future events based on past and present data. Time-series forecasting is a type of forecasting that predicts future events based on time-stamped data points. Time-series forecasting models are an essential tool for any organization or individual who wants to make informed decisions based on future events or trends. From stock market predictions to weather forecasting, time-series models help us to understand and forecast changes over time. However, with so many different types of models available, it can be challenging to determine which one is best suited for a particular scenario. There are many different types of time-series forecasting models, each with its own strengths …

Continue reading

Posted in Data Science, Machine Learning. Tagged with , .