amplab / training

Training materials for Strata, AMP Camp, etc
150 stars 121 forks source link

"Data exploration with Spark SQL" asks you to query the "text" field without having ever mentioned the "text" field #205

Open gostevehoward opened 8 years ago

gostevehoward commented 8 years ago

http://www.cs.berkeley.edu/~jey/ampcamp6/training/data-exploration-using-spark-sql.html

The exercise at the end ("How many articles contain the word “california”?") requires you to use the "text" field of wikiData. Unless there was some explanation of the schema I missed earlier, it's a confusing exercise to do since the "text" field hasn't been mentioned anywhere. It would also suffice to mention some way to explore the schema (e.g., wikiData.schema.fields).