Install Hadoop in Stand-Alone Mode on Ubuntu 16.04
Big Data , Technology & Science / January 24, 2018

Hadoop is a framework written in Java for running applications on large clusters of commodity hardware and incorporates features comparable to those of the Google File System (GFS) and of the MapReduce computing paradigm. Hadoop’s HDFS is a highly fault-tolerant distributed file system and, like Hadoop in general, designed to be deployed on low-cost hardware. It provides high throughput access to application data and is suitable for applications that have large data sets. Hadoop was sponsored by Apache Software Foundation. It is a Cluster data Management Project (CDMP). Hadoop is a Java-based framework which manages the large data sets among the group of cluster machines. It is very hard to configure the Cluster with Hadoop. But we can also install Hadoop on a single machine to perform some basic operations. Hadoop may appear single software but it has a lot of components following it. Hadoop Common: We can say this as a big library which consists of utilities and libraries to support other Hadoop modules. HDFS: The Hadoop Distributed File system is responsible storing the data on the hard disk. YARN: YARN is the open source distributed processing framework and it stands for Yet Another Resource Negotiator. MapReduce: Map reduce…

Insert math as
Block
Inline
Additional settings
Formula color
Text color
#333333
Type math using LaTeX
Preview
\({}\)
Nothing to preview
Insert