Python

Python – How to Create Dictionary using Pandas Series

In this post, you will learn about one of the important Pandas fundamental data structure namely Series and how it can be used as a dictionary. It will be useful for beginner data scientist to understand the concept of Pandas Series object. 

A dictionary is a structure that maps arbitrary keys to a set of arbitrary values.

Pandas Series is a one-dimensional array of indexed data. It can be created using a list or an array. Pandas Series can be thought of as a special case of Python dictionary. It is a structure which maps typed keys to a set of typed values.

Here are the three different ways in which a dictionary can be created using Series object:

Series like one-dimensional Numpy Array

data = pd.Series(data=[85, 65, 92, 44]
Fig 1. Pandas Series with default numeric indices similar to Numpy one-dimensional array

In the above Series object, the indices default from 0 to 3. One can access values using syntax such as data[0] is 85, data[3] is 44. The values and index can be printed using commands such as data.values and data.index.

It may look like the Series object is basically interchangeable with a one-dimensional NumPy array. The essential difference is the presence of the index: while the Numpy Array has an implicitly defined integer index used to access the values, the Pandas Series has an explicitly defined index associated with the values.

Series with explicitly defined Index with values of any type

The following is another manner in which a dictionary from Series can be created:

data = pd.Series(data=[85, 65, 92, 44], index=['Mathematics', 'English', 'Science', 'Hindi'])
Fig 2. Pandas series with explicitly defined indices

In the above Series object, you would see an explicitly defined index. This explicit index definition gives the Series bject additional capabilities. The index need not be an integer. It can consist of values of any desired type. The index must be a hashable type and need not be unique. The object supports both integer- and label-based indexing

Series with Explicitly defined Index with values of any type – II

Here is another manner in which Pandas Series object can be created as a dictionary:

data = pd.Series(data={'Mathematics': 82, 'Science': 92, 'English': 64})
Fig 3. Pandas series with explicitly defined indices
Ajitesh Kumar

I have been recently working in the area of Data analytics including Data Science and Machine Learning / Deep Learning. I am also passionate about different technologies including programming languages such as Java/JEE, Javascript, Python, R, Julia, etc, and technologies such as Blockchain, mobile computing, cloud-native technologies, application security, cloud computing platforms, big data, etc. I would love to connect with you on Linkedin. Check out my latest book titled as First Principles Thinking: Building winning products using first principles thinking.

Recent Posts

Large Language Models (LLMs): Four Critical Modeling Stages

Large language models (LLMs) have fundamentally transformed our digital landscape, powering everything from chatbots and…

4 days ago

Agentic Workflow Design Patterns Explained with Examples

As Large Language Models (LLMs) evolve into autonomous agents, understanding agentic workflow design patterns has…

5 days ago

What is Data Strategy?

In today's data-driven business landscape, organizations are constantly seeking ways to harness the power of…

6 days ago

Mathematics Topics for Machine Learning Beginners

In this blog, you would get to know the essential mathematical topics you need to…

1 month ago

Questions to Ask When Thinking Like a Product Leader

This blog represents a list of questions you can ask when thinking like a product…

1 month ago

Three Approaches to Creating AI Agents: Code Examples

AI agents are autonomous systems combining three core components: a reasoning engine (powered by LLM),…

1 month ago