Hi, Thanks for this great tutorial. Please help i get an error when running the train process.
command:
#building the image (CPU)docker build -t colemurray/age-gender-estimation-tutorial:cpu .
#running the train process (CPU)docker run -v $PWD:/opt/app \-e PYTHONPATH=$PYTHONPATH:/opt/app \-it colemurray/age-gender-estimation-tutorial:cpu \python3 /opt/app/bin/train.py \--img-dir /opt/app/var/crop \--train-csv /opt/app/var/train.csv \--val-csv /opt/app/var/val.csv \--model-dir /opt/app/var/cnn-model \--img-size 224 \--num-steps 200000
#error-output
INFO:tensorflow:Using config: {'_keep_checkpoint_max': 5, '_save_checkpoints_secs': None, '_global_id_in_cluster': 0, '_keep_checkpoint_every_n_hours': 10000, '_task_id': 0, '_cluster_spec': <tensorflow.python.training.server_lib.ClusterSpec object at 0x7f59660a6e80>, '_eval_distribute': None, '_is_chief': True, '_evaluation_master': '', '_log_step_count_steps': 100, '_protocol': None, '_device_fn': None, '_num_ps_replicas': 0, '_num_worker_replicas': 1, '_service': None, '_tf_random_seed': None, '_save_summary_steps': 100, '_master': '', '_session_config': allow_soft_placement: true
graph_options {
rewrite_options {
meta_optimizer_iterations: ONE
}
}
, '_save_checkpoints_steps': 1500, '_task_type': 'worker', '_model_dir': '/opt/app/var/cnn-model', '_experimental_distribute': None, '_train_distribute': None}
INFO:tensorflow:Not using Distribute Coordinator.
INFO:tensorflow:Running training and evaluation locally (non-distributed).
INFO:tensorflow:Start train and evaluate loop. The evaluate will happen after every checkpoint. Checkpoint frequency is determined based on RunConfig arguments: save_checkpoints_steps 1500 or save_checkpoints_secs None.
INFO:tensorflow:Calling model_fn.
INFO:tensorflow:Done calling model_fn.
INFO:tensorflow:Create CheckpointSaverHook.
INFO:tensorflow:Graph was finalized.
2020-01-23 02:16:32.502323: I tensorflow/core/platform/cpu_feature_guard.cc:141] Your CPU supports instructions that this TensorFlow binary was not compiled to use: AVX2 FMA
2020-01-23 02:16:32.583359: W tensorflow/core/framework/allocator.cc:122] Allocation of 411041792 exceeds 10% of system memory.
2020-01-23 02:16:32.846042: W tensorflow/core/framework/allocator.cc:122] Allocation of 411041792 exceeds 10% of system memory.
2020-01-23 02:16:33.128939: W tensorflow/core/framework/allocator.cc:122] Allocation of 411041792 exceeds 10% of system memory.
2020-01-23 02:16:33.366611: W tensorflow/core/framework/allocator.cc:122] Allocation of 411041792 exceeds 10% of system memory.
2020-01-23 02:16:33.714329: W tensorflow/core/framework/allocator.cc:122] Allocation of 411041792 exceeds 10% of system memory.
INFO:tensorflow:Running local_init_op.
INFO:tensorflow:Done running local_init_op.
INFO:tensorflow:Saving checkpoints for 0 into /opt/app/var/cnn-model/model.ckpt.
Traceback (most recent call last):
File "/usr/local/lib/python3.5/dist-packages/tensorflow/python/client/session.py", line 1334, in _do_call
return fn(*args)
File "/usr/local/lib/python3.5/dist-packages/tensorflow/python/client/session.py", line 1319, in _run_fn
options, feed_dict, fetch_list, target_list, run_metadata)
File "/usr/local/lib/python3.5/dist-packages/tensorflow/python/client/session.py", line 1407, in _call_tf_sessionrun
run_metadata)
**tensorflow.python.framework.errors_impl.NotFoundError: /opt/app/var/train.csv; No such file or directory**
[[{{node IteratorGetNext}} = IteratorGetNext[output_shapes=[[?,224,224,3], [?], [?]], output_types=[DT_FLOAT, DT_INT64, DT_INT64], _device="/job:localhost/replica:0/task:0/device:CPU:0"](OneShotIterator)]]
If you don't mind please help and add more documentation of it. Thank you so much.
Hi, Thanks for this great tutorial. Please help i get an error when running the train process. command: #building the image (CPU)
docker build -t colemurray/age-gender-estimation-tutorial:cpu .
#running the train process (CPU)
docker run -v $PWD:/opt/app \-e PYTHONPATH=$PYTHONPATH:/opt/app \-it colemurray/age-gender-estimation-tutorial:cpu \python3 /opt/app/bin/train.py \--img-dir /opt/app/var/crop \--train-csv /opt/app/var/train.csv \--val-csv /opt/app/var/val.csv \--model-dir /opt/app/var/cnn-model \--img-size 224 \--num-steps 200000
#error-output
If you don't mind please help and add more documentation of it. Thank you so much.