data:image/s3,"s3://crabby-images/5e0d6/5e0d6170cd01e66e9c8a1e0071f63e932f07573f" alt="Big Data Analytics with Hadoop 3"
上QQ阅读APP看书,第一时间看更新
Starting HDFS
Follow these steps as shown to start HDFS (NameNode and DataNode):
- Format the filesystem:
$ ./bin/hdfs namenode -format
- Start the NameNode daemon and the DataNode daemon:
$ ./sbin/start-dfs.sh
The Hadoop daemon log output is written to the $HADOOP_LOG_DIR directory (defaults to $HADOOP_HOME/logs).
- Browse the web interface for the NameNode; by default it is available at http://localhost:9870/.
- Make the HDFS directories required to execute MapReduce jobs:
$ ./bin/hdfs dfs -mkdir /user
$ ./bin/hdfs dfs -mkdir /user/<username>
- When you're done, stop the daemons with the following:
$ ./sbin/stop-dfs.sh
- Open a browser to check your local Hadoop, which can be launched in the browser as http://localhost:9870/. The following is what the HDFS installation looks like:
data:image/s3,"s3://crabby-images/be5b6/be5b6f9425aac19a8706e85d69a6102a9db52727" alt=""
- Clicking on the Datanodes tab shows the nodes as shown in the following screenshot:
data:image/s3,"s3://crabby-images/6c25b/6c25bf041d7a5e9e2acebf44d3e0e4b76dc3df48" alt=""
Figure: Screenshot showing the nodes in the Datanodes tab
- Clicking on the logs will show the various logs in your cluster, as shown in the following screenshot:
data:image/s3,"s3://crabby-images/8eadd/8eaddbd1d44cc66ccae9f96045486516ac19303e" alt=""
- As shown in the following screenshot, you can also look at the various JVM metrics of your cluster components:
data:image/s3,"s3://crabby-images/3f21a/3f21ac6ea8dea0b6eb68087496774dc46d0fe489" alt=""
- As shown in the following screenshot, you can also check the configuration. This is a good place to look at the entire configuration and all the default settings:
data:image/s3,"s3://crabby-images/94799/9479926b47cbe686a6b0a727d00046278d331f9a" alt=""
- You can also browse the filesystem of your newly installed cluster, as shown in the following screenshot:
data:image/s3,"s3://crabby-images/d3038/d303846db1818401044b7c9c3f2af43da60c0d9b" alt=""
Figure: Screenshot showing the Browse Directory and how you can browse the filesystem in you newly installed cluster
At this point, we should all be able to see and use a basic HDFS cluster. But this is just a HDFS filesystem with some directories and files. We also need a job/task scheduling service to actually use the cluster for computational needs rather than just storage.