Closed mmiller-max closed 1 year ago
Hi @mmiller-max ,
Any more info? Is this reproducible? If so, can you share a small code snippet that creates a task which exhibits this behavior when deleted using the UI?
For me just this creates the error:
from clearml import Task
task = Task.init()
task.upload_artifact("artifact", {"1":1})
Then Ctrl+C
and delete in UI.
And yep it's reproducible. Could it be a file permissions thing perhaps?
Well, seems like an obvious but - we'll take a look, I'll update!
Cheers @jkhenning !
Trying to do a bit of debugging on this but can't see anything in the fileserver logs. Do I need to set something in logging.conf
in either the file server or the api server?
I think you should see console logs. The issue might be in the WebApp...
I think this is the corresponding web app log but can't see any errors with it:
35.191.10.5 - - [29/Mar/2022:13:52:30 +0000] "POST /api/v2.16/tasks.delete_many HTTP/1.1" 200 395 "https://app.{domain}/projects/c28adf12db964d169a645f5351a669de/experiments?columns=selected&columns=type&columns=name&columns=tags&columns=status&columns=project.name&columns=users&columns=started&columns=last_update&columns=last_iteration&columns=parent.name&order=-last_update&filter=&archive=true" "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/14.1.2 Safari/605.1.15" "195.224.76.82,34.149.227.91"
Another bit of information, I only get undefined
in the error message if using the fileserver subdomain for the files (e.g. https://files.domain.com
. If I use a different URL (e.g. GCP bucket) it displays the URL (but still fails to delete)
One further comment, I'm pretty sure I never saw this error with ClearML Server v1.1.1
After updating to the latest server (1.9.1) I'm no longer seeing these errors so going to close this 👏
Appreciate the update @mmiller-max
When I delete a task using the Web GUI, I see the following message:
When I check the fileserver in the VM on which it's running, I can see that the artifacts are still there, for example at the location
/opt/clearml/data/fileserver/project/task/artifacts/...
So there seems to be two issues, one with artifacts not being deleted (which I wasn't aware happened with previous version of server) and one with the error message not showing what hasn't been deleted.
Server version is
1.2.0
running on GCP. Cheers!