Big Data Analytics (001) - Kapil Sharma

Big Data deals with huge volume of data. The core purpose of BD is streamline the quantity and quality. So that useful data analytics can be drive from clusters of data sets.

The purpose use for that is known as "ETL" (Extract >> Transform >> Load)

Where data change its shape and usability.

In opensource arena Apache PIG is used for ETL process, this Apache PIG software is not having pretty GUI to easy ETL the process.

After PIG ETL process the Apache HIVE comes in the picture for data analytics.

Apache HIVE used for data warehousing and query the big data sets drive from ETL through Apache PIG system.

I will show you all how to setup Apache PIG and HIVE on your machine in the next write-up. Till than happy coding :)

Map/Reduce Framework Basic - Kapil Sharma


Yii2, MVC, php web application framework resources - Kapil Sharma


A Data Engineer's Guide To Non-Traditional Data Storages


The Zen of devRant

Highly annoying clients: Non-technical family and friends. Clueless recruiters. You all know who I’m talking about – the client who wants you to code a website in GitHub; the partner who thinks your code looks like a bunch of sad winky faces; and the recruiters who want five years Swift experience when Swift is only two years old. For years, designers have been able to vent about Clients From Hell. Now, it’s developers’ turns to get the frustration off their chests on devRant. For those of you who’ve been living under a rock for the past year, devRant is where developers can, well, [anonymously] rant about all of the above. Some posts will make you laugh. Others will make you laugh so hard you cry. And just about all of them will make you empathize with the poster. This post is a culmination of our favorite devRants. We hope you enjoy them as much as we do. Work So you almost showed up to work with a positive attitude, but then PITA clients and bosses stepped in and turned that right …