Processing big amount of data and processing it at large scale has always been a task for the system, this is when engineers or Hadoop masters are required. Hadoop is a complete eco-system of open source project that provide us the frame work to deal with big data. Designed in a manner to manage from single server to thousands of machines, with local computation and storage the only challenge which is faced is the enormous investment and excessive time taken.