Open CSteele97 opened 4 years ago
hey @CSteele97,
unet
is for segmentation task. Please take a look at this section for more info about unet
and second level model. In general, you simply load trained model and use it for your own task.prediction_path
is the path where results will be stored as json file.Hope this helps.
For the simple case of predicting on some new data, prepare sources and environment, then follow this section: https://github.com/neptune-ai/open-solution-mapping-challenge/blob/master/REPRODUCE_RESULTS.md#predict-on-new-data
Hi @kamil-kaczmarek thank you for your reply.
In the case of the REPRODUCE_RESULTS section for predict on new data, would the pipeline_name therefore be unet, as this is the trained model?
Thank you
Hey @CSteele97,
There is a full command provided in the aforementioned section. It looks like this:
python main.py predict_on_dir \
--pipeline_name unet_tta_scoring_model \
--chunk_size 1000 \
--dir_path path/to/inference_directory \
--prediction_path path/to/predictions.json
There is a pipeline name provided: unet_tta_scoring_model
Cheers, Kamil
Thanks @kamil-kaczmarek
I have been trying to run the command you mentioned, but I get an error 'no module named neptune'. I have followed all the previous steps (without a Neptune registration) and am not sure why I am getting this error or how to resolve it.
I appreciate your time in helping me figure all of this out!
Thank you
did you install neptune?
It will be simplest workaround
I have managed to solve the neptune issue using pip install neptune-cli, thanks
I have tried to run the above command however I am now receiving 'Error: No such command 'predict_on_dir'
I see that you installed neptune-cli
. This will very likely not work as neptune-cli
is our heritage library that we no longer support.
The best solution here is to create an environment using conda. Here is full specification of the conda environment: https://github.com/neptune-ai/open-solution-mapping-challenge/blob/master/environment.yml Conda docs about managing environments: https://docs.conda.io/projects/conda/en/latest/user-guide/tasks/manage-environments.html
Regarding Error with predict_on_dir
. Please make sure that you run this command from the repo root. I think that it will solve the problem. This method is defined in the main file: https://github.com/neptune-ai/open-solution-mapping-challenge/blob/master/main.py#L51
Hope this helps :)
Thanks Kamil,
I have updated my environment which seems to now be working.
I have been running the command from the open-solution-mapping-challenge directory - is this correct?
Thank you
Hey @CSteele97,
Yep, it should work.
Thanks Kamil,
I've tried running the command again from the aforementioned directory but it's still giving the predict_on_dir
error - any idea why this might be?
Hey,
Can you paste full error massage?
/anaconda3/envs/mapping/lib/python3.6/site-packages/sklearn/externals/joblib/init.py:15: FutureWarning: sklearn.externals.joblib is deprecated in 0.21 and will be removed in 0.23. Please import this functionality directly from joblib, which can be installed with: pip install joblib. If this warning is raised when loading pickled models, you may need to re-serialize those models with scikit-learn 0.21+.
warnings.warn(msg, category=FutureWarning)
/Users/open-solution-mapping-challenge/src/utils.py:132: YAMLLoadWarning: calling yaml.load() without Loader=... is deprecated, as the default Loader is unsafe. Please read https://msg.pyyaml.org/load for full details.
config = yaml.load(f)
/anaconda3/envs/mapping/lib/python3.6/site-packages/lightgbm/init.py:46: UserWarning: Starting from version 2.2.1, the library file in distribution wheels for macOS is built by the Apple Clang (Xcode_9.4.1) compiler.
This means that in case of installing LightGBM from PyPI via the pip install lightgbm
command, you don't need to install the gcc compiler anymore.
Instead of that, you need to install the OpenMP library, which is required for running LightGBM on the system with the Apple Clang compiler.
You can install the OpenMP library by the following command: brew install libomp
.
"You can install the OpenMP library by the following command: brew install libomp
.", UserWarning)
/Users/open-solution-mapping-challenge/src/utils.py:132: YAMLLoadWarning: calling yaml.load() without Loader=... is deprecated, as the default Loader is unsafe. Please read https://msg.pyyaml.org/load for full details.
config = yaml.load(f)
Usage: main.py [OPTIONS] COMMAND [ARGS]...
Try 'main.py --help' for help.
Error: No such command 'predict_on_dir'.
Great thanks,
Can you also paste full command that you use?
python main.py predict_on_dir \ --pipeline_name unet_tta_scoring_model \ --chunk_size 1000 \ --dir_path /test_images \ --prediction_path /data/experiments/predictions.json
Hi @CSteele97
I have just successfully run:
python main.py predict_on_dir \
--pipeline_name unet_tta_scoring_model \
--chunk_size 100 \
--dir_path data/paper_images \
--prediction_path data/paper_images_predictions.json
perhaps you didn't use the \
?
Hi, I got a different error here, when I ran the above command. Any idea?
ValueError: No transformer cached unet
I'm actually not sure where I should put the released checkpoint. Currently I've put them as /data/experiments/mapping_challenge_baseline/checkpoints/scoring_model
, /data/experiments/mapping_challenge_baseline/checkpoints/unet
.
I'm wondering that you've released checkpoints for scoring_model
and unet
, but to run the inference, it seems like we need transformer
to produce inference based on those released checkpoints. How those can be generated?
Hi @asahi417 those transformers that don't have any state are created on the fly so you only need unet
and scoring_model
.
Both of those trained models should be placed in the transformers
subfolder of your /data/experiments/mapping_challenge_baseline/transformers
so if you just put them there it should run an inference with no problems.
I tried to explain it in the Reproduce Results but I am not sure if it is clear:
project
|-- README.md
|-- ...
|-- data
|-- raw
|-- train
|-- images
|-- annotation.json
|-- val
|-- images
|-- annotation.json
|-- test_images
|-- img1.jpg
|-- img2.jpg
|-- ...
|-- meta
|-- masks_overlayed_eroded_{}_dilated_{} # it is generated automatically
|-- train
|-- distances
|-- masks
|-- sizes
|-- val
|-- distances
|-- masks
|-- sizes
|-- experiments
|-- mapping_challenge_baseline # this is where your experiment files will be dumped
|-- checkpoints # neural network checkpoints
|-- transformers # serialized transformers after fitting
|-- outputs # outputs of transformers if you specified save_output=True anywhere
|-- prediction.json # prediction on valid
I hope this helps.
Thanks, and I finally managed to run an inference with the released checkpoints, which is a huge progress! However, the inference is very random... Do you have any sense why it produces such a poor predictions?
Also, I'm wondering if it possible to finetune the released checkpoint to own dataset.
Hi there,
I think there may be something wrong with the indices of your images in the prediction file. It seems that those predictions belong to different images right? A simple way to debug is to run predict on folder with just one image in it. I had this problem in the past but I haven't encountered it in a while.
You can easily fine-tune by overriding (or simply pasting) a snippet that loads weights when you train in steps/pytorch.models.py.
@jakubczakon Hi, thanks for your feedback. I've tried to export segmentation over single image, but still attained similar results... Could you take a look my code where I export segmentation map from coco-formatted prediction file, which was produced by your python main.py predict_on_dir
script.
https://github.com/asahi417/open-solution-mapping-challenge-script
/anaconda3/envs/mapping/lib/python3.6/site-packages/sklearn/externals/joblib/init.py:15: FutureWarning: sklearn.externals.joblib is deprecated in 0.21 and will be removed in 0.23. Please import this functionality directly from joblib, which can be installed with: pip install joblib. If this warning is raised when loading pickled models, you may need to re-serialize those models with scikit-learn 0.21+. warnings.warn(msg, category=FutureWarning) /Users/open-solution-mapping-challenge/src/utils.py:132: YAMLLoadWarning: calling yaml.load() without Loader=... is deprecated, as the default Loader is unsafe. Please read https://msg.pyyaml.org/load for full details. config = yaml.load(f) /anaconda3/envs/mapping/lib/python3.6/site-packages/lightgbm/init.py:46: UserWarning: Starting from version 2.2.1, the library file in distribution wheels for macOS is built by the Apple Clang (Xcode_9.4.1) compiler. This means that in case of installing LightGBM from PyPI via the
pip install lightgbm
command, you don't need to install the gcc compiler anymore. Instead of that, you need to install the OpenMP library, which is required for running LightGBM on the system with the Apple Clang compiler. You can install the OpenMP library by the following command:brew install libomp
. "You can install the OpenMP library by the following command:brew install libomp
.", UserWarning) /Users/open-solution-mapping-challenge/src/utils.py:132: YAMLLoadWarning: calling yaml.load() without Loader=... is deprecated, as the default Loader is unsafe. Please read https://msg.pyyaml.org/load for full details. config = yaml.load(f) Usage: main.py [OPTIONS] COMMAND [ARGS]... Try 'main.py --help' for help.Error: No such command 'predict_on_dir'.
I solved this error in a different way. Inside the main.py script you will find a line before the function definition as @main.command()
. You actually need to provide a string as the argument to this click method. The string should be the one that you use in the command line, i.e., predict_on_dir
here. So the line before the predict_on_dir
method should be @main.command('predict_on_dir')
. Do the same for all other methods to run it from the command line using click.
Hi, @jakubczakon
I would like to use the model weights to detect buildings from my own imagery, but I'm not entirely sure how to do this. I notice there are two files on the following website (https://ui.neptune.ai/neptune-ai/Mapping-Challenge/e/MC-1057/artifacts) - but I am not sure which file is the model weights and how to implement it on my own imagery. I have also seen the 'Predict on new data' section of REPRODUCE_RESULTS but I do not know what the pipeline_name would be or the prediction_path.
I hope this makes sense, I am very new to machine learning so do not yet understand a lot of things.
I would really appreciate it if you could provide some instructions on how I can achieve this. Thank you.