The Hadoop Distributed File System (HDFS) is a sub-project of the Apache Hadoop project. It is highly fault-tolerant and is designed to be deployed on low-cost hardware. It also provides high throughput access to application data and is suitable for applications that have large data sets.
This tutorial walks through commonly used commands to manage files through the command line interface (CLI) and web-based interface (Files View).
- Downloaded and Installed latest Hortonworks Data Platform (HDP) Sandbox
- Se familiariser avec la sandbox HDP