Big data

Following topics will be covered:

  1. BigData stack installation
  2. Business case
  3. Development

1. Installation: Best way to go for this step is to have Ubuntu. I found this OS to be very friendly. you can go ahead with 14 Long term license for installation. Use Ubuntu site to download OR
Note: you can try before actually installing. you can also make bootable USBfrom windows and try this USB -OS first before going for actual installation.  

1.1. Which file system ? : HDFS - 

1.2 Which data management Layer ? - YARN

2. Business Use case: It is very important to understand that bid data world or hadoop world is for problems which have 5 V's . volume, variety, velocity, value and veracity.
More technically speaking if you are not going to have more than 5 datanodes no point using hdfs or less than millions/billions of of records thinks twice to use Hbase.
Also check the data generation activity e.g. is it machine or human generated.

3. Development: Eclipse could be used to have your first simple big data project.

Comments

Popular posts from this blog

Qlik Sense Important Links

Cloud Architecture Notes

AWS Rout53 NS records do not match with whois dns records OR Your site NOT working with registered domain name? Check this...