Create an end-user Linux account named with your GitHub handle
Make sure this Linux account is added to all cluster nodes
Create an HDFS directory under /user
useradd marianomier
passwd marianomier
Create a 10 GB file using teragen
Set the number of mappers to four
Limit the block size to 32 MB
Land the output in your user's home directory
Use the time command to report the job's duration
Run the terasort command on this file
Use the time command to report the job's duration
Land the result under your user's home directory
HDFS Lab: Test HDFS Snapshots
List the commands and output for each step below in storage/labs/2_snapshot_test.md.
Create a precious directory in HDFS; copy the ZIP course file into it.
Enable snapshots for precious
Create a snapshot called sebc-hdfs-test
Delete the directory
Delete the ZIP file
Restore the deleted file
Capture the NameNode web UI screen that lists snapshots in storage/labs/2_snapshot_list.png.
HDFS Lab: Enable HDFS HA
Use the Cloudera Manager wizard to enable HA
Once configured, get a screenshot of the HDFS Instances tab
Hint: Follow closely the Enabling HDFS HA Using Cloudera Manager instructions. There's more work that needs to be done besides running the wizard.
Name the file storage/3_HDFS_HA.png
Add a CM user and name it with your GitHub handle
Assign the Full Administrator role to this user
Assign the password cloudera to this user
Re-assign the admin user to the Limited Operator role
Take a screenshot of your users page; save it to storage/labs/4_CM_users.png
In an Issue comment, post the URL to your Cloudera Manager instance
Create an end-user Linux account named with your GitHub handle Make sure this Linux account is added to all cluster nodes Create an HDFS directory under /user
useradd marianomier passwd marianomier
Create a 10 GB file using teragen Set the number of mappers to four Limit the block size to 32 MB Land the output in your user's home directory Use the time command to report the job's duration
Run the terasort command on this file Use the time command to report the job's duration Land the result under your user's home directory
HDFS Lab: Test HDFS Snapshots List the commands and output for each step below in storage/labs/2_snapshot_test.md. Create a precious directory in HDFS; copy the ZIP course file into it. Enable snapshots for precious Create a snapshot called sebc-hdfs-test Delete the directory Delete the ZIP file Restore the deleted file Capture the NameNode web UI screen that lists snapshots in storage/labs/2_snapshot_list.png.
HDFS Lab: Enable HDFS HA Use the Cloudera Manager wizard to enable HA Once configured, get a screenshot of the HDFS Instances tab Hint: Follow closely the Enabling HDFS HA Using Cloudera Manager instructions. There's more work that needs to be done besides running the wizard. Name the file storage/3_HDFS_HA.png Add a CM user and name it with your GitHub handle Assign the Full Administrator role to this user Assign the password cloudera to this user Re-assign the admin user to the Limited Operator role Take a screenshot of your users page; save it to storage/labs/4_CM_users.png In an Issue comment, post the URL to your Cloudera Manager instance