googledatalab / datalab

Interactive tools and developer experiences for Big Data on Google Cloud Platform.
Apache License 2.0
974 stars 249 forks source link

ZIP using CPU - not a user process #1912

Open VikramTiwari opened 6 years ago

VikramTiwari commented 6 years ago

screen shot 2018-01-16 at 10 29 10

It spawns up on its own every hour or so and tries to create a zip of content in /tmp/<random> directory. What is it and why does it do so?

If it's doing it for backup purposes or autosaving the context, it should skip binaries and csv/tsv files.

screen shot 2018-01-16 at 12 25 07

Seems like it's happening because of GCSBackup script https://github.com/googledatalab/datalab/blob/00c6cd4d3cd0c67a64c36f45599981c3d431b1d6/containers/datalab/content/GCSbackup.sh#L144-L147

Can I pass these as settings? I don't want backups to run except weekly. https://github.com/googledatalab/datalab/blob/dcedb1ef801ef0be4571c2afa7c7ffcae7eb28c4/sources/web/datalab/backupUtility.ts#L80

VikramTiwari commented 6 years ago

I was able to simply stop backups altogether by launching a new instance and passing flags to stop backups.