Categories: Big Data

Big Data is NOT Just about Hadoop Stack Implementation

That is something any one can with a decent technical skill and Java experience could do it.

Big Data has lot to do with Data science. And, to stand out as a Big Data solution provider in the IT marketplace, one needs to have a team of Data scientist who work with technologist to implement Big data solution suggested by them.

Thus, following is how the Big Data team may look like?

  • Project/Delivery Manager
  • Data Scientist
  • Technical Architect (Hadoop)
  • Technical team including team/tech lead, developers, testers etc
  • Build/Configuration Engineer: This may be important owing to the Big Data typical cluster configurations requirement and the complexities surrounding it.

What is a Data Scientist?

Basically, a data scientist is someone who could extract meaning from data and create/propose new data products or enhancement to existing data products. Some of the pre-requisite skills for a data scientist are following:

  • Data Engineering
  • Pattern recognition and learning
  • Visualization
  • Data warehousing
  • Maths & Statistics

Take a look at picture below to get a visualization of skills.

Data Science

Thus, if you as a software provider have not yet started on this, its not too late at all. Plan to hire one or get a person in house to develop the skills on data science. Both may be harder to achieve as this job requirement is pretty new. However, the job need for a data scientist is only going to grow. So better late than never. Go and get one for your Big Data team to start with.

And one another thing. While talking to a new prospect on Big Data, get Data scientist involved as well along with a Hadoop technical architect. Chances of impressing the prospect and closing the deal will be higher.

Ajitesh Kumar

I have been recently working in the area of Data analytics including Data Science and Machine Learning / Deep Learning. I am also passionate about different technologies including programming languages such as Java/JEE, Javascript, Python, R, Julia, etc, and technologies such as Blockchain, mobile computing, cloud-native technologies, application security, cloud computing platforms, big data, etc. For latest updates and blogs, follow us on Twitter. I would love to connect with you on Linkedin. Check out my latest book titled as First Principles Thinking: Building winning products using first principles thinking. Check out my other blog, Revive-n-Thrive.com

Recent Posts

Feature Engineering in Machine Learning: Python Examples

Last updated: 3rd May, 2024 Have you ever wondered why some machine learning models perform…

41 mins ago

Feature Selection vs Feature Extraction: Machine Learning

Last updated: 2nd May, 2024 The success of machine learning models often depends on the…

19 hours ago

Model Selection by Evaluating Bias & Variance: Example

When working on a machine learning project, one of the key challenges faced by data…

1 day ago

Bias-Variance Trade-off in Machine Learning: Examples

Last updated: 1st May, 2024 The bias-variance trade-off is a fundamental concept in machine learning…

2 days ago

Mean Squared Error vs Cross Entropy Loss Function

Last updated: 1st May, 2024 As a data scientist, understanding the nuances of various cost…

2 days ago

Cross Entropy Loss Explained with Python Examples

Last updated: 1st May, 2024 In this post, you will learn the concepts related to…

2 days ago