sheikhshack / bigdatabases-aws-50.043

The project for 50.043
1 stars 0 forks source link

Checkoff 2 #18

Open sheikhshack opened 3 years ago

sheikhshack commented 3 years ago

Hey guys, can comment whatever questions here to be asked to prof. Thanks!

sheikhshack commented 3 years ago

Production

  1. Do we need to use elastic IPs? Can we use placement groups for increasing throughput ?
  2. Are we graded based on speed of deployment? Or will just a deployment script that works be sufficient
  3. For 'take in credentials as input', are we assuming user feeds via .\aws\creds or must we let user specify each argument via CLI?
  4. For teardown, is terminating instances sufficient

HDFS

  1. Can we use libraries like this flintrock? Or others like in this article

Spark

  1. Can we just install mongo on the namenode and do an import direct via mongo cli?
  2. Or can we import mongo db data via spark?

General

  1. Any performance metric/benchmark we should compare with to evaluate how good our script is?
  2. What is meant by code quality?
sheikhshack commented 3 years ago

Prof Reply

Checkpoint 2 DB Creds