Big data
Following topics will be covered:
- BigData stack installation
- Business case
- Development
1. Installation: Best way to go for this step is to have Ubuntu. I found this OS to be very friendly. you can go ahead with 14 Long term license for installation. Use Ubuntu site to download OR
Note: you can try before actually installing. you can also make bootable USBfrom windows and try this USB -OS first before going for actual installation.
1.1. Which file system ? : HDFS -
1.2 Which data management Layer ? - YARN
2. Business Use case: It is very important to understand that bid data world or hadoop world is for problems which have 5 V's . volume, variety, velocity, value and veracity.
More technically speaking if you are not going to have more than 5 datanodes no point using hdfs or less than millions/billions of of records thinks twice to use Hbase.
Also check the data generation activity e.g. is it machine or human generated.
3. Development: Eclipse could be used to have your first simple big data project.
Comments
Post a Comment