Machine Learning – How to Predict Software Developers Productivity

This article represents my thoughts on how machine learning techniques could be used to solve one of the most popular problem of software industry such as whether a software developer is productive or not. Of all the effort that I have made to solve this problem using traditional programming techniques (rules-based), I could say that there is no definitive way of finding a concrete solution. As a matter of fact, I created a tool, AgileSQM to capture the software quality metrics (SQM) such as code coverage, duplication, complexity and infer from the trending data whether a software developer is productive. However, I soon hit the road-block in terms of acceptance …

Continue reading

Posted in Big Data. Tagged with .

Java – Top 10 Java-based Web Frameworks for 2014-2015

This article represents an analysis on Java-based web frameworks that emerged in the top 10 list this year 2014, and  worth consideration for your next project starting this year or next year (2015). I have done data analysis based on following: Job openings (as of today) on a very popular website, indeed.com Discussion threads (for this year) on a very popular Q&A based website, stackoverflow.com Responses on a very popular social bookmarking website, reddit.com Based on the analysis of top 10 frameworks, I have listed the top 5 frameworks which emerged as clear winner. Please feel free to comment/suggest if I missed to mention one or more important frameworks. Also, …

Continue reading

Posted in Java, Web. Tagged with .

How to use Sonar Dashboard to Report on Software Code Quality

This article represents methods one could adopt to read the sonar dashboard and gather data appropriately to monitor and control the software code quality. The primary reason why I am writing this blog is the fact that I have come across several team leads who asked me the questions related with data on sonar dashboard and what all they could do with it. Please feel free to comment/suggest if I missed to mention one or more important points. Also, sorry for the typos. Following are three different aspects of software quality which could be tracked using Sonar Dashboard: Maintainability (testability, reusability & modularity) Usability (readability and understand-ability) Security Following are …

Continue reading

Posted in Software Quality. Tagged with .

Productivity – Top 3 Javascript IDEs/Text Editors to Consider

This article represents the top 3 Javascript IDE/Text Editors that you may want to explore for your next project for Javascript related development. The way the top 3 editors are chosen is the number of votes (thumbs up) given by different users in one of the Javascript IDE/Text Editor related discussion thread on one of the very popular social bookmarking website, as of today. Interestingly, newer editors such as Brackets and Atom are catching attention of some of the users. Please feel free to comment/suggest if I missed to mention one or more important points. Also, sorry for the typos. Following are different Javascript Text Editors (at times, also called …

Continue reading

Posted in Javascript, Web. Tagged with .

Data Science – Examples of Machine Learning Problems

This article represents different classification of machine learning problems along with some of the examples taken from real world problems. Please feel free to comment/suggest if I missed to mention one or more important points. Also, sorry for the typos. Following is listed different categories which covers 80% of machine learning problems: Classification Clustering Regression   Machine Learning – Classification Problems Simply speaking, if the answer to problems consists of discrete values such as some of the following, the problem can be termed as classification problems. These are called as “Logistic Regression” problems. Yes or no,. e.g., 1 or 0. Finite set of values representing multi-classification problems Mathematically speaking, if “h(x)” …

Continue reading

Posted in Big Data. Tagged with .

Top 7 Data Science Subreddits to Follow

This article represents top subreddits related with Data Science on reddit.com that the Data Science aspirants or professionals could watch on regular basis for news, stories and discussions. Generally, I find reddit.com very useful to remain in touch with latest and interesting stories and keep myself up-to-date. For those unaware of what is subreddit, subreddit, simply speaking, represents the topic-based groups on reddit.com that comprise of users who want to publish/discuss news or stories related with that topic. For data science, there are multiple groups each focused on a single topic such as those mentioned below. Please feel free to comment/suggest if I missed to mention one or more important …

Continue reading

Posted in Big Data. Tagged with .

Javascript Frameworks Job Pattern with IT Companies in India

This article represents my analysis on job patterns vis-a-vis javascript frameworks with IT companies in India. This may also be read as a pattern of adoption of JS frameworks by developers community with IT companies in India. The analysis has been done based on number of job openings related with below listed JS frameworks, posted last month e.g., octobar 2014, on one of the topmost job portal catering to Indian IT market. Please feel free to comment/suggest if I missed to mention one or more important points. Also, sorry for the typos. Following are top 7 javascript frameworks which are most seeked after skillsets in potential UI related candidates, from …

Continue reading

Posted in Javascript, Web. Tagged with , , .

AngularJS – Top 6 Concepts That Developers Loved

This article represents top 6 popular AngularJS topics that has been used most by the AngularJS developer community till date. The inference is derived based on number of tagged discussions happening on Stackoverflow. Clearly, “Directive” is the winner and attracts most of them all. The article presents my thoughts on why these topics have been most popular. Please feel free to comment/suggest if I missed to mention one or more important points. Also, sorry for the typos. Following is the list of top 6 popular topics: Directives Scope Object Ng-repeat Angular UI & Bootstrap Routing Service   Following plot demonstrates the popularity of different feature/topics in relation with AngularJS.   …

Continue reading

Posted in Web. Tagged with .

Data Science – Quick Start Guide for Machine Learning

machine learning

This article represents a very high-level information on different aspects of machine learning with an objective to present a quick-start read/guide for the data science beginners. One could grab one or more books on Machine Learning to learn the subject in detail. Please feel free to comment/suggest if I missed to mention one or more important points. Also, sorry for the typos. Following are the key points described later in this article: What is machine learning? Key phases of machine learning Prediction API model of machine learning   What is Machine Learning? Simply speaking, Machine Learning is a set of artifical intelligence techniques which are used to solve one of …

Continue reading

Posted in Big Data. Tagged with , , .

Data Scraping – Top 5 Reasons for using Import.io Tool

This article represents my thoughts on why one would want to use this web data scraping tool, named as import.io. I must say that I am glad I found this tool for data scraping. Please feel free to comment/suggest if I missed to mention one or more important points. Also, sorry for the typos. Following are the key points described later in this article: Key aspects of Import.io Reasons Why One Must Try Import.io for their next Data Scraping Project Use-cases where Import.io scraping tool could be used   Key Aspects of Import.io Tool Import.io is a cloud-based web scraping tool which could act as a boon for those looking …

Continue reading

Posted in Big Data. Tagged with , .

Data Science – 8 Steps to Perform Regression Analysis using R

This article represents my thoughts on steps that may be required to perform regression analysis (linear or multiple) using R programming language, on a given data set where response variable is primarily a continuous variable. Remember that continuous variables are the ones which could take any numeric data unlike discreet variables which could take only limited set of data. Please feel free to comment/suggest if I missed to mention one or more important points. Also, sorry for the typos.   Following are the key steps described later in this article: Load the data Observe the data Clean the data Explore the data visually Fit the linear or multiple regression model …

Continue reading

Posted in Big Data. Tagged with , .

Data Science – Top 5 Videos to Learn Bayes’ Theorum

This article represents the top 5 videos that I thought to be great when I was trying to understand Bayes theorum from Youtube channels. Please feel free to comment/suggest if I missed to mention one or more important points. Also, sorry for the typos.   Following are top 5 videos that I found quite useful to understand Bayes theorum: Bayes’ Theorum Formula: This one, I liked most. Very short and sweet video which explains about Bayes theorum with a very nice example of economy and stock values in just 6 minutes. For beginners, I would recommend this to be first video to get started with Bayes theorum. Bayes Theorum with …

Continue reading

Posted in Big Data. Tagged with , .

Learn R – Hello World with R – Code Example

This article represents some of the basic concepts required to be understood to write Hello world using R programming language and, execute the same. Please feel free to comment/suggest if I missed to mention one or more important points. Also, sorry for the typos. Following are the key points described later in this article: Basic Concepts to Write Hello World Function in R Hello World – Code Example Basic Concepts to Write Hello World Function in R Following are some key points to pay attention at, while working Hello World example: R code is written as a set of one or more functions. In R, one could assign a function …

Continue reading

Posted in Big Data. Tagged with , , .

Learn R – How to Get Started with GGPlot – Code Example

This article represents quick introduction to GGPlot along with key concepts and code examples using R programming language. Please feel free to comment/suggest if I missed to mention one or more important points. Also, sorry for the typos.   Following are the key points described later in this article: Quick introduction to GGPlot Installation and loading of GGPlot GGPlot – Key Concepts   Quick Introduction to GGPlot ggplot is one of statistical package that facilitates the easy creation of different plots. One of the key concept related to ggplot is that ggplot is built up layer by layer. This means that one could start by initializing the ggplot using ggplot(data) …

Continue reading

Posted in Big Data. Tagged with , .

Learn R – When to use Histogram, Scatterplot & Boxplot – Code Example

This article represents some facts on when to use what kind of plots with code example and plots, when working with R programming language. Please feel free to comment/suggest if I missed to mention one or more important points. Also, sorry for the typos. Following are the key plots described later in this article: Histogram Scatterplot Boxplot   Following is the description for above mentioned plots along with code examples based on base R package. Note that each of the these plots could be done using different commands when using ggplot2 package. Histogram:Histograms is one of the best form of visualizations when working with single continuous variable. It plots the relative …

Continue reading

Posted in Big Data. Tagged with , .

Architecture – Top 10 Traits of a Software Architect

software architect

This article represents my opinion on top 10 traits of a software architect. Recently, I came across a need of my fellow colleague who asked me to take a shot at defining, what would it take for someone to become an architect. The below could be applied to architect in any technologies. Note that the below does not take into account few traits of an enterprise architect. However, I purposely did not include it. Please feel free to comment/suggest if I missed to mention one or more important points. Also, sorry for the typos.   Following are the top 10 traits described below: Knowledge of relevant technologies Should demonstrate strong …

Continue reading

Posted in Architecture, Enterprise Architecture. Tagged with .