Open ccasimiro9444 opened 7 years ago
:eyes:
GCP == Google Cloud Platform ? Cool : What is GCP?
Yes, Google Cloud Platform. There are some large public datasets on Google's Bigquery, that could be imported into Google Cloud Datalab (similar to Jupyter Notebooks) and then I want to use Python to visually render the data. I am from a stats background, so Python and GCP are kinda new to me. But saw this in action and hence will try to replicate some of it. But I am already stuck importing the Bigquery data into Datalab :) Hope you guys can help me figuring it out.
my question on stackoverflow https://stackoverflow.com/questions/44172105/load-bigquery-data-to-datalab
SELECT
pickup_datetime,
dropoff_datetime
FROM
bigquery-public-data.new_york.tlc_yellow_trips_20*
Go here https://cloud.google.com/bigquery/public-data/nyc-tlc-trips
and click on the "goto the new york city dataset" button
"""
SELECT pickup_datetime, dropoff_datetime FROM `bigquery-public-data.new_york.tlc_yellow_trips_20*` """)
@grandpotato @ccasimiro9444 here is the notebook: https://github.com/danfowler/bkkhack-stuff/blob/master/nyc-taxi-data.ipynb
Got it to run on datalab, just used Dan's code.
Excellent! 💯
By the way, I finally got datalab working on my Google account 😉
Maybe, if you haven't gotten around to it by then, we work on the visualization piece at the next bkkhack.
Nice!
Unfortunately I've had 0 success so far. So I'm going to just have to leave it here for now. :(
Next time I'll just clear everything and start from scratch and see if that helps.
Sounds good. Gotta get the whole dataset into a dataframe first, the direct import from Bigquery to dataframe takes too long. Maybe loading the table and then transforming will be faster. Let's try that and the visualization out at the next hack.
Using NYC yellow taxi dataset