Open VelizarVESSELINOV opened 8 years ago
I stopped using datalab API for storage and bigquery, now using only google-cloud packages for more portable code.
@VelizarVESSELINOV That's interesting! Are you custom building the VM instances and creating key-value stores from scratch, and applying your custom relational algebra on top of that? I'm just curious the level of granularity and the cloud packages in approaching portability.
Any updates about this issue? I'm facing the same problem
@pgrosu there are a lot custom file formats that need to be parsed, and for this reading StringIO is required. After parsing the output will be a Pandas dataframe optimal for data engineering/analytics/AI. The VM was custom server side defined.
When we try to read_from bucket storage file with size not far from 2 GB the system is going down even if the VM has 30 GB RAM.
From "Serial console output":
With Message dialog: