UOB-AI / UOB-AI.github.io

A repository to host our documentations website.
https://UOB-AI.github.io
1 stars 3 forks source link

DynAE model , Is the keras version changed?! #53

Closed Amalsalem closed 2 months ago

Amalsalem commented 6 months ago

I used to run my model in: https://hayrat.uob.edu.bh/node/gpu02/26344/lab/tree/DynAE_Amal/OK_dynAE_bloodmnist-ablation_p3.ipynb

but now i run it and give me this error:


ModuleNotFoundError Traceback (most recent call last) Cell In[1], line 16 14 import numpy as np 15 from time import time ---> 16 from DynAE import DynAE 17 from datasets import load_data, load_data_conv 18 import metrics

File ~/DynAE_Amal/DynAE.py:41 39 import tensorflow.compat.v2 as tf 40 from keras import backend ---> 41 from keras.distribute import distributed_file_utils 42 from keras.distribute import worker_training_state 43 #from keras.optimizers import optimizer_experimental

ModuleNotFoundError: No module named 'keras.distribute'

asubah commented 5 months ago

We do update the libraries in the base environment from time to time. If you need a specific environment, you can request it or create it yourself using Python venv under /data/datasets/.

Amalsalem commented 5 months ago

I tried to fix it , bur i couldnot .

Could you please check it Notebook : DynAE_Amal__AP / Fmnist_65%-Figure.ipynb

FailedPreconditionError: Graph execution error:

Detected at node StatefulPartitionedCall defined at (most recent call last): File "/data/software/miniconda3/lib/python3.9/runpy.py", line 197, in _run_module_as_main

File "/data/software/miniconda3/lib/python3.9/runpy.py", line 87, in _run_code

File "/data/software/miniconda3/lib/python3.9/site-packages/ipykernel_launcher.py", line 17, in

File "/data/software/miniconda3/lib/python3.9/site-packages/traitlets/config/application.py", line 1043, in launch_instance

File "/data/software/miniconda3/lib/python3.9/site-packages/ipykernel/kernelapp.py", line 711, in start

File "/data/software/miniconda3/lib/python3.9/site-packages/tornado/platform/asyncio.py", line 195, in start

File "/data/software/miniconda3/lib/python3.9/asyncio/base_events.py", line 601, in run_forever

File "/data/software/miniconda3/lib/python3.9/asyncio/base_events.py", line 1905, in _run_once

File "/data/software/miniconda3/lib/python3.9/asyncio/events.py", line 80, in _run

File "/data/software/miniconda3/lib/python3.9/site-packages/ipykernel/kernelbase.py", line 510, in dispatch_queue

File "/data/software/miniconda3/lib/python3.9/site-packages/ipykernel/kernelbase.py", line 499, in process_one

File "/data/software/miniconda3/lib/python3.9/site-packages/ipykernel/kernelbase.py", line 406, in dispatch_shell

File "/data/software/miniconda3/lib/python3.9/site-packages/ipykernel/kernelbase.py", line 729, in execute_request

File "/data/software/miniconda3/lib/python3.9/site-packages/ipykernel/ipkernel.py", line 411, in do_execute

File "/data/software/miniconda3/lib/python3.9/site-packages/ipykernel/zmqshell.py", line 531, in run_cell

File "/data/software/miniconda3/lib/python3.9/site-packages/IPython/core/interactiveshell.py", line 3009, in run_cell

File "/data/software/miniconda3/lib/python3.9/site-packages/IPython/core/interactiveshell.py", line 3064, in _run_cell

File "/data/software/miniconda3/lib/python3.9/site-packages/IPython/core/async_helpers.py", line 129, in _pseudo_sync_runner

File "/data/software/miniconda3/lib/python3.9/site-packages/IPython/core/interactiveshell.py", line 3269, in run_cell_async

File "/data/software/miniconda3/lib/python3.9/site-packages/IPython/core/interactiveshell.py", line 3448, in run_ast_nodes

File "/data/software/miniconda3/lib/python3.9/site-packages/IPython/core/interactiveshell.py", line 3508, in run_code

File "/tmp/ipykernel_4156538/999318313.py", line 2, in

File "/home/nfs/20015279/DynAE_Amal_Ap/DynAE_fmnist.py", line 2131, in compute_cm

File "/home/nfs/20015279/DynAE_Amal_Ap/DynAE_fmnist.py", line 356, in predict_encoder

File "/home/nfs/20015279/.local/lib/python3.9/site-packages/keras/src/utils/traceback_utils.py", line 117, in error_handler

File "/home/nfs/20015279/.local/lib/python3.9/site-packages/keras/src/backend/tensorflow/trainer.py", line 515, in predict

File "/home/nfs/20015279/.local/lib/python3.9/site-packages/keras/src/backend/tensorflow/trainer.py", line 213, in one_step_on_data_distributed

DNN library initialization failed. Look at the errors above for more details. [[{{node StatefulPartitionedCall}}]] [Op:__inference_one_step_on_data_distributed_455]

Amalsalem commented 5 months ago

I tried to fix it , but i couldn't .

Could you please check it Notebook : DynAE_Amal__AP / Fmnist_65%-Figure.ipynb

On Wed, Apr 17, 2024 at 12:16 PM Abdulla Subah @.***> wrote:

We do update the libraries in the base environment from time to time. If you need a specific environment, you can request it or create it yourself using Python venv under /data/datasets/.

— Reply to this email directly, view it on GitHub https://github.com/UOB-AI/UOB-AI.github.io/issues/53#issuecomment-2060791476, or unsubscribe https://github.com/notifications/unsubscribe-auth/AY3B76GSX25Z432ZWTXZP3TY5Y4VTAVCNFSM6AAAAABFYLIGFSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDANRQG44TCNBXGY . You are receiving this because you authored the thread.Message ID: @.***>

Amalsalem commented 5 months ago

The current TensorFlow version: 2.17.0-dev20240320 not compatible with CuDNN library , we need to upgrade CuDNN to 8.9.6

this may solve the issue . I donot have privileges to do so.

thanks

On Wed, Apr 17, 2024 at 6:06 PM Amal Shaheen @.***> wrote:

I tried to fix it , but i couldn't .

Could you please check it Notebook : DynAE_Amal__AP / Fmnist_65%-Figure.ipynb

On Wed, Apr 17, 2024 at 12:16 PM Abdulla Subah @.***> wrote:

We do update the libraries in the base environment from time to time. If you need a specific environment, you can request it or create it yourself using Python venv under /data/datasets/.

— Reply to this email directly, view it on GitHub https://github.com/UOB-AI/UOB-AI.github.io/issues/53#issuecomment-2060791476, or unsubscribe https://github.com/notifications/unsubscribe-auth/AY3B76GSX25Z432ZWTXZP3TY5Y4VTAVCNFSM6AAAAABFYLIGFSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDANRQG44TCNBXGY . You are receiving this because you authored the thread.Message ID: @.***>