Open CrosbyMonk opened 3 years ago
Thanks a lot @CrosbyMonk for reporting this issue. It appears that the directory that is tried to be deleted is a directory created by the pynisher for storing the output of a subprocess. Therefore, it is to be expected that Auto-sklearn tries to delete it.
However, there's now the question why the temporary directory is not empty? Are you still able to see the content of that directory and the files in there? Maybe the cleanup and the join need to be switched (https://github.com/automl/pynisher/blob/master/pynisher/limit_function_call.py#L281)?
The directory was empty from the time the process died. See the above output from doing an ls
on /data/shared/tmp/tmpg00q7u62
.
Hey, are you able to constantly reproduce this or did this only happen a single time?
Apparently missed your comment. 100% reproducible for me. Data science notebook base is jupyter/datascience-notebook:7e07b801d92b
with the following additional packages installed.
docker
RUN apt-get update \
&& DEBIAN_FRONTEND=noninteractive apt-get install --no-install-recommends -y \
less \
apt-transport-https \
apt-utils \
build-essential \
curl \
freeglut3-dev \
gdebi-core \
git \
graphviz \
krb5-config \
krb5-user \
libclang-dev \
libcurl4-openssl-dev \
libedit2 \
libnlopt-dev \
libsasl2-dev \
libsasl2-modules-gssapi-mit \
libspatialindex-dev \
libkrb5-dev \
libssl1.1 \
libssl-dev \
libxml2-dev \
netcat \
net-tools \
openssh-server \
psmisc \
rsync \
sf-dpl \
vim \
tesseract-ocr-all \
xvfb \
&& apt upgrade -y \
&& apt-get autoclean \
&& apt-get clean \
&& apt-get autoremove -y
and
docker
RUN python3 -m pip --no-cache-dir install --upgrade \
bs4 \
cloudpickle \
configparser \
cython \
flask \
graphviz \
impyla \
ipywidgets \
kerberos \
matplotlib \
numpy \
pandas \
pandasql \
pytest \
sasl \
scikit-learn \
scipy \
setuptools \
thrift \
thrift_sasl==0.2.1
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs for the next 7 days. Thank you for your contributions.
Describe the bug
It appears auto-sklearn want's to delete the python temp dir versus using the provided directories.
Code Snippet
Error:
Contents of
/data/shared/tmp/tmpg00q7u62
:Contents of
/users/jihh/automl/auto-sklearn/temp_housing/
:To Reproduce
Running the notebook in a jupyterlab environment.
Expected behavior
Expect it wouldn't try to manage directories that it doesn't need to.
Actual behavior, stacktrace or logfile
auto_sklearn.log
Environment and installation:
Please give details about your installation:
Jupyterlab running a version of the DataScience Notebook image. See auto_sklearn.log for version information.