Closed coatk1 closed 3 years ago
https://www.tutorialspoint.com/hive/hive_quick_guide.htm
Spark runs onto of Hadoop to make Hadoop faster
Data mining is to look for patterns and make predictions
Data-warehouse uses ETL
Data-lake uses ELT (all raw data)
Kafka is used for data streaming
https://www.tutorialspoint.com/hive/hive_quick_guide.htm