Big data

Following topics will be covered:

  1. BigData stack installation
  2. Business case
  3. Development

1. Installation: Best way to go for this step is to have Ubuntu. I found this OS to be very friendly. you can go ahead with 14 Long term license for installation. Use Ubuntu site to download OR
Note: you can try before actually installing. you can also make bootable USBfrom windows and try this USB -OS first before going for actual installation.  

1.1. Which file system ? : HDFS - 

1.2 Which data management Layer ? - YARN

2. Business Use case: It is very important to understand that bid data world or hadoop world is for problems which have 5 V's . volume, variety, velocity, value and veracity.
More technically speaking if you are not going to have more than 5 datanodes no point using hdfs or less than millions/billions of of records thinks twice to use Hbase.
Also check the data generation activity e.g. is it machine or human generated.

3. Development: Eclipse could be used to have your first simple big data project.

Comments

Popular posts from this blog

Cloud Architecture Notes

Qlik Sense Important Links