This class will provide a brief overview of what Hadoop is and the various components that are involved in the Hadoop ecosystem. There will be a hands on showcase for the users on how to use the dumbo(Hadoop) cluster to run basic map-reduce jobs. Various hands on exercises have been incorporated for the users to get a better understanding.
The pre-requisites of this class:
1. HPC user account is mandatory.
2. The user needs to have a basic knowledge of Unix and Java/python.
Thursday, February 1, 2018
2:00pm - 4:00pm
Bobst Library, Rm. 617, 6th Floor