In this blog, we will learn about the differences between complex event processing (CEP) and traditional database querying with the…
Hypothesis testing is a technique that helps scientists, researchers, or for that matter, anyone test the validity of their claims…
This post represents a comprehensive list of 85+ free books/ebooks and courses on machine learning, deep learning, data science, optimization,…
This blog represents high-level concepts on HBase architecture components. Following diagram represents the same: HBase Architecture Components - Key Building…
This article represents detailed view on what happens when a driver program (spark application) is started on one of the…
This article presents instructions and code samples for Docker enthusiasts to quickly get started with setting up Apache Spark standalone cluster…
This article represents top Linux foundation projects in relation with IOT, Cloud and Big Data. With the convergence of these…
This article represents top 5 pages listing global big data conferences coming up in 2016. Please feel free to comment/suggest…
This blog represents my notes on how data is read and written from/to HDFS. Please feel free to suggest if it…
This article represents top 5 usecases for using Solr to power your web and mobile search. Note that in case of mobile search requirements,…
This article represents key topics that one would want to learn in order to become a Hadoop Developer. One may…
This article intends to present dummies notes on how distributed computing works using Hadoop. As Hadoop is inspired by Google…
This article represents different document search architectural models using which one could create a search architecture that could search through 100s…
This article represents some of the top learning resources (webpages, videos etc) on my frequent visit list. Please feel free…
This article presents URL and short description of around 175 probability & statistics objective questions which could prove very useful…
This article represents quick details on some of the key open-source technologies (tools & frameworks) associated with Big Data. The…