BCCN-Prog / weather_2016

For the BCCN 2016 advanced programming project
3 stars 1 forks source link

Validate timeanddate data #89

Closed erensezener closed 8 years ago

erensezener commented 8 years ago

Please download the data with _dt suffix here: https://drive.google.com/folderview?id=0BwQc_CC3arWWMTNYaEpCOHlKZmc&usp=sharing

And look at nanmax(), unique() etc of columns and the number of entries to see if it makes sense.

It works like this:

import h5py
import numpy as np

>>> h5 = h5py.File('hourly_database.hdf5', 'r'); data = h5['weather_data'][:]
>>> np.unique(data[:,2])
array([  0.00000000e+00,   1.00000000e+00,   4.00000000e+00,
         2.01606212e+11])

Beware that the data is padded with rows of zero from the bottom.

akresnia commented 8 years ago

Checked, everything is ok, only 'temperature' and 'station_id' columns are full of nans (but these parameters exist only for historical data)

2016-07-15 11:30 GMT+02:00 C. Eren Sezener notifications@github.com:

Assigned #89 https://github.com/BCCN-Prog/weather_2016/issues/89 to @akresnia https://github.com/akresnia.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/BCCN-Prog/weather_2016/issues/89#event-724126310, or mute the thread https://github.com/notifications/unsubscribe-auth/ARq9KVhlgivm_fo34Pg1ap9gVmIyiSZcks5qV1M5gaJpZM4JNOh2 .