Enable upload of huge (50gb) documents

gannebamm commented 1 year ago

We need to upload massive (50 GB) documents to one of our GeoNode instances.

Expected Behavior

After changing the UPLAD_SIZE_LIMITS via Django Admin (https://docs.geonode.org/en/master/admin/upload-size-limits/index.html#upload-size-limits) and changing uwsgi.ini (https://github.com/GeoNode/geonode/blob/master/uwsgi.ini#L22) for higher harakiri values you should be able to upload huge files sizes.

Actual Behavior

The upload will trigger an error stating the upload was not successful and we shall check the file's integrity. Nonetheless, a document object will be created. After downloading the file, we see it is capped at 1 GB and will not unzip after downloading it from the frontend.

Steps to Reproduce the Problem

tweak uwsgi.ini file for higher harakiri values
modify Upload size limits in djano admin
upload huge document (zip)

Specifications

GeoNode version: 4.1.x
Installation type: geonode-project with some modifications
Installation method: docker
Additional details: https://docs.geonode.org/en/master/admin/upload-size-limits/index.html#advanced-notes-for-developers

Did I miss something?

Some error messages I spotted in the geonode.log

uwsgi_response_write_body_do() TIMEOUT !!!
OSError: write error
---
worker 9 lifetime reached, it was running for 6601 second(s)
worker 10 lifetime reached, it was running for 6601 second(s)
worker 11 lifetime reached, it was running for 6601 second(s)
worker 12 lifetime reached, it was running for 6601 second(s)
worker 13 lifetime reached, it was running for 6601 second(s)
worker 14 lifetime reached, it was running for 6601 second(s)
worker 15 lifetime reached, it was running for 6601 second(s)
Respawned uWSGI worker 9 (new pid: 9472)
Respawned uWSGI worker 10 (new pid: 9473)
Respawned uWSGI worker 11 (new pid: 9474)
Respawned uWSGI worker 12 (new pid: 9475)
Respawned uWSGI worker 13 (new pid: 9476)
Respawned uWSGI worker 14 (new pid: 9477)
Respawned uWSGI worker 15 (new pid: 9478)
worker 16 lifetime reached, it was running for 6601 second(s)
Respawned uWSGI worker 16 (new pid: 9479)

t-book commented 1 year ago

@gannebamm what is your client_max_body_size of NGINX conf?

gannebamm commented 1 year ago

Good idea I haven´t changed anything in that regard in the geonode.conf (https://github.com/GeoNode/geonode-docker/blob/master/docker/nginx/geonode.conf.envsubst)

Therefore it is using those values

# max upload size
client_max_body_size 100G;
client_body_buffer_size 256K;
client_body_timeout 600s;
large_client_header_buffers 4 64k;

proxy_connect_timeout       600;
proxy_send_timeout          600;
proxy_read_timeout          600;
uwsgi_read_timeout          600;
send_timeout                600;

This client_max_body_size 100G; seems fine but the timeouts could get fired. All of them.

uwsgi_response_write_body_do() TIMEOUT !!!

is likely bound to uwsgi_read_timeout 600; being to small.

t-book commented 1 year ago

that could be the case, yes!

t-book commented 1 year ago

In case you change it, be sure that it really changed as in my case it was set back to the old value on container restart somehow.

gannebamm commented 1 year ago

ok, almost have it. Remaining error

geonode.log

Wed Oct 25 17:56:22 2023 - worker 6 (pid: 344) is taking too much time to die...NO MERCY !!!
[busyness] 1s average busyness is at 0%, cheap one of 9 running workers
worker 6 killed successfully (pid: 344)
uWSGI worker 6 cheaped.
Wed Oct 25 17:56:52 2023 - worker 7 (pid: 345) is taking too much time to die...NO MERCY !!!
worker 7 killed successfully (pid: 345)
uWSGI worker 7 cheaped.

Will check for this tomorrow. Time to leave :wave:

t-book commented 1 year ago

Wed Oct 25 17:56:22 2023 - worker 6 (pid: 344) is taking too much time to die...NO MERCY !!!

I guess harakiri is still too small. In other words the process did not finish in time and is getting killed with ... no mercy :)

giohappy commented 1 year ago

We had to tweak the same values for a client with a huge download: uwsgi_read_timeout inside nginx.conf and harakiri inside uwsgi.ini

mkrueger-dev commented 2 months ago

Hi, I am facing the same problems when downloading datasets beyond 1 GB in size from the frontend. The download is capped at 1 GB. I already tweaked the values mentioned above. I also sufficiently upped the values of GeoServers WPS parameters. At the moment these are my configurations:

uwsgi.ini:

harakiri = 600       ; also tried 1200

geonode.conf:

client_max_body_size 100G;
client_body_buffer_size 256K;
client_body_timeout 600s; 
large_client_header_buffers 4 64k;

proxy_connect_timeout       600;       also tried 1200 for all of the following
proxy_send_timeout          600;
proxy_read_timeout          600;
uwsgi_read_timeout          600;
send_timeout                600;

This is the error I get:

nginx4mygeonode      | 2024/07/23 07:21:52 [error] 18#18: *133059 readv() failed (104: Connection reset by peer) while reading upstream, client: 999.999.999.999, server: mygeonode.de, request: "GET /datasets/geonode:Lot2_Sidescan_g/dataset_download HTTP/1.1", upstream: "http://172.18.0.6:8000/datasets/geonode:Lot2_Sidescan_g/dataset_download", host: "mygeonode.de", referrer: "https://mygeonode.de/"

GeoNode / geonode