Categories: Big Data

Learn R – How to Add New Column to Data Frame

This article represents concepts and code samples on how to add new columns to a data frame using R programming language. Please feel free to comment/suggest if I missed to mention one or more important points. Also, sorry for the typos.

Lets create a student data frame. Following is the code:

# Create non-empty data frame with column names
# Assign names to x 
x <- c( "Calvin", "Chris", "Raj")
# Assign names to y
y <- c( 10, 25, 19)
# Create a non-empty data frame with column names 
# Assign x to "First Name" as column name 
# Assign y to "Age" as column name 
student <- data.frame( "First Name" = x, "Age" = y, stringsAsFactors=FALSE)
# Print the data frame
student

A student data frame is created and following gets printed:

  First.Name Age
1     Calvin  10
2      Chris  25
3        Raj  19

 

How to Add a New Column

With above student data frame, a column could be added in following manners:
Approach 1: Use $ sign with data frame and assign a vector

# Add "Location" column
student$Location <- c("Wilmington", "San Francisco", "Dayton")
# Print student

Notice the student data frame with additional “Location” column

  First.Name Age      Location
1     Calvin  10    Wilmington
2      Chris  25 San Francisco
3        Raj  19        Dayton

Approach 2: Use matrix-like notation and assign a vector. Pay attention to quotes used with “Smoke”

student[, "Smoke"] <- c( "yes", "no", "yes")

Following will get printed:

  First.Name Age      Location Smoke
1     Calvin  10    Wilmington   yes
2      Chris  25 San Francisco    no
3        Raj  19        Dayton   yes

Approach 3: Use array/map like notation and assign a vector. Pay attention to quotes used with “working”

student["Working"] <- c("no", "yes", "yes")

Following will get printed:

  First.Name Age      Location Smoke Working
1     Calvin  10    Wilmington   yes      no
2      Chris  25 San Francisco    no     yes
3        Raj  19        Dayton   yes     yes
Ajitesh Kumar

I have been recently working in the area of Data analytics including Data Science and Machine Learning / Deep Learning. I am also passionate about different technologies including programming languages such as Java/JEE, Javascript, Python, R, Julia, etc, and technologies such as Blockchain, mobile computing, cloud-native technologies, application security, cloud computing platforms, big data, etc. For latest updates and blogs, follow us on Twitter. I would love to connect with you on Linkedin. Check out my latest book titled as First Principles Thinking: Building winning products using first principles thinking. Check out my other blog, Revive-n-Thrive.com

Recent Posts

Model Selection by Evaluating Bias & Variance: Example

When working on a machine learning project, one of the key challenges faced by data…

1 hour ago

Bias-Variance Trade-off in Machine Learning: Examples

Last updated: 1st May, 2024 The bias-variance trade-off is a fundamental concept in machine learning…

19 hours ago

Mean Squared Error vs Cross Entropy Loss Function

Last updated: 1st May, 2024 As a data scientist, understanding the nuances of various cost…

19 hours ago

Cross Entropy Loss Explained with Python Examples

Last updated: 1st May, 2024 In this post, you will learn the concepts related to…

19 hours ago

Logistic Regression in Machine Learning: Python Example

Last updated: 26th April, 2024 In this blog post, we will discuss the logistic regression…

6 days ago

MSE vs RMSE vs MAE vs MAPE vs R-Squared: When to Use?

Last updated: 22nd April, 2024 As data scientists, we navigate a sea of metrics to…

1 week ago