My environment: Windows 8*
Step 1:
Download and install VM Player 5.1
Step 2:
Download and install HortonWorks sandbox. It is pre-configured and fasted way to get on to Hadoop. Cloudera has a similar one but I came across HortonWorks while finding our .net support on Hadoop. It seems HortonWorks is a training provider and it has it’s own offerings on Hadoop.
http://hortonworks.com/products/hortonworks-sandbox/
Step 3:
Load the image from HortonWorks in VMPlayer. It takes a minute or so and I was up and running (was glad to see IP Address ready for my browser). This is my first exposure to Hadoop as well as CentOS.
That’s all. Really. Only 3 simple steps. I may try to do a fresh install (single node) in coming days but for now, I want to grasp concepts and get a first-hand feeling.
I am planning to use the following as a guideline for Hadoop learning and may as well attend one in future (either from Cloudera or HortonWorks).
http://hortonworks.com/hadoop-training/hadoop-training-for-developers/
At a high level, we can compartmentalize Hadoop’s offering in the following 4 areas. This will help me understand big picture without getting lost into details.
*The installation steps should work just fine in Windows 7 environment though I haven’t tried. If anyone tries and find anything different, let me know and I would include it in the notes.
In next post, I will show you how to install HDInsight Server, an offering from Microsoft. Stay tuned!!