Data Science

How to use Sklearn Datasets For Machine Learning

In this post, you wil learn about how to use Sklearn datasets for training machine learning models. Here is a list of different types of datasets which are available as part of sklearn.datasets
  • Iris (Iris plant datasets used – Classification)
  • Boston (Boston house prices – Regression)
  • Wine (Wine recognition set – Classification)
  • Breast Cancer (Breast cancer wisconsin diagnostic – Classification)
  • Digits (Optical recognition of handwritten digits dataset – Classification)
  • Linnerud (Linnerrud dataset – Classification)
  • Diabetes (Diabetes – Regression)
The following command could help you load any of the datasets:
from sklearn import datasets
iris = datasets.load_iris()
boston = datasets.load_boston()
breast_cancer = datasets.load_breast_cancer()
diabetes = datasets.load_diabetes()
wine = datasets.load_wine()
datasets.load_linnerud()
digits = datasets.load_digits()

All of the datasets come with the following and are intended for use with supervised learning:
  • Data (to be used for training)
  • Labels (Target)
  • Labels attriibute
  • Description of the dataset
The following command can be used for accessing the value of above:
# Let's use IRIS as an example for reading different aspects of data
iris.data
iris.target
iris.target_names
print(iris.DESCR)

Ajitesh Kumar

I have been recently working in the area of Data analytics including Data Science and Machine Learning / Deep Learning and BI. I would love to connect with you on Linkedin. Check out my books titled as Designing Decisions, and First Principles Thinking.

Recent Posts

The Watermelon Effect: When Green Metrics Lie

We’ve all been in that meeting. The dashboard on the boardroom screen is a sea…

3 weeks ago

Coefficient of Variation in Regression Modelling: Example

When building a regression model or performing regression analysis to predict a target variable, understanding…

3 months ago

Chunking Strategies for RAG with Examples

If you've built a "Naive" RAG pipeline, you've probably hit a wall. You've indexed your…

3 months ago

RAG Pipeline: 6 Steps for Creating Naive RAG App

If you're starting with large language models, you must have heard of RAG (Retrieval-Augmented Generation).…

3 months ago

Python: List Comprehension Explained with Examples

If you've spent any time with Python, you've likely heard the term "Pythonic." It refers…

4 months ago

Large Language Models (LLMs): Four Critical Modeling Stages

Large language models (LLMs) have fundamentally transformed our digital landscape, powering everything from chatbots and…

6 months ago