I have run into a problem with edx2bigquery where there is a failure to process a tracklog for a day; the error in the BigQuery interface is:
gs://tracklog-2017-11-17.json.gz: Error while reading data, error message: JSON parsing error in row starting at position 12345: No such field: event_struct.duration. (error code: invalid)
The portion in the file that triggers this is:
"event_struct": {"duration": 123.45
This will only occur for files that contain data with that specific JSON schema.
As a result of this, the job fails to complete, and no data for this tracklog file is uploaded to BigQuery. The edx2bigquery library itself fails silently in its execution.
Hi,
I have run into a problem with edx2bigquery where there is a failure to process a tracklog for a day; the error in the BigQuery interface is:
gs://tracklog-2017-11-17.json.gz: Error while reading data, error message: JSON parsing error in row starting at position 12345: No such field: event_struct.duration. (error code: invalid)
The portion in the file that triggers this is:
"event_struct": {"duration": 123.45
This will only occur for files that contain data with that specific JSON schema.
As a result of this, the job fails to complete, and no data for this tracklog file is uploaded to BigQuery. The edx2bigquery library itself fails silently in its execution.
Would someone please be able to look into this? I did some searching and this looks to be a similar situation, with possibly a solution: https://stackoverflow.com/questions/25279116/cannot-insert-new-value-to-bigquery-table-after-updating-with-new-column-using-s
Thank you!