marianomier / SEBC

Cloudera Boot Camp
0 stars 1 forks source link

Storage Lab #4

Open marianomier opened 6 years ago

marianomier commented 6 years ago

Create an end-user Linux account named with your GitHub handle Make sure this Linux account is added to all cluster nodes Create an HDFS directory under /user

useradd marianomier passwd marianomier

image

image image

Create a 10 GB file using teragen Set the number of mappers to four Limit the block size to 32 MB Land the output in your user's home directory Use the time command to report the job's duration

image image image image

Run the terasort command on this file Use the time command to report the job's duration Land the result under your user's home directory

image image image image

image HDFS Lab: Test HDFS Snapshots List the commands and output for each step below in storage/labs/2_snapshot_test.md. Create a precious directory in HDFS; copy the ZIP course file into it. Enable snapshots for precious Create a snapshot called sebc-hdfs-test Delete the directory Delete the ZIP file Restore the deleted file Capture the NameNode web UI screen that lists snapshots in storage/labs/2_snapshot_list.png. image

image

image

image

image

image image

image image

image

image

image image

image

HDFS Lab: Enable HDFS HA Use the Cloudera Manager wizard to enable HA Once configured, get a screenshot of the HDFS Instances tab Hint: Follow closely the Enabling HDFS HA Using Cloudera Manager instructions. There's more work that needs to be done besides running the wizard. Name the file storage/3_HDFS_HA.png Add a CM user and name it with your GitHub handle Assign the Full Administrator role to this user Assign the password cloudera to this user Re-assign the admin user to the Limited Operator role Take a screenshot of your users page; save it to storage/labs/4_CM_users.png In an Issue comment, post the URL to your Cloudera Manager instance image

image

image

image

image

image

image image

image

image

image