Julian-Wyatt / AnoDDPM

CVPR Workshop paper - AnoDDPM: Anomaly Detection with Denoising Diffusion Probabilistic Models using Simplex Noise
https://julianwyatt.co.uk/anoddpm
MIT License
154 stars 27 forks source link

Datasets Question #12

Closed WilliamJudge94 closed 1 year ago

WilliamJudge94 commented 1 year ago

Hello,

I am trying to run this repo with the same data as presented in the paper in order to gain a baseline understanding of this diffusion model. However, I am having trouble setting up the datasets. For the datasets:

  1. Are these the links for the training and test sets?
  1. There are a lot of .tar files within the Testing dataset (if that link is correct). Did you decompress the files, open the individual datasets and save them as a .npy file?

  2. Will the files have to be read and saved to .npy files within these directories?

  3. Do you have a detailed folder structure and file format we need to use in order to get your code working?

Julian-Wyatt commented 1 year ago
  1. Yeah exactly those however I think the testing data is now closed or locked as others have since reported.
  2. Again, yes exactly, it was more efficient to open the file with the library in python (can't remember the name of it, I must import it in my datasets.py file) and then save the processed images as npy files.
  3. If you want to handle this the same as I did, I wrote a directory scraper to loop over all the files and save them - although don't believe I uploaded that with this repository.
  4. I've uploaded a screenshot of a minimal case. The datasets.py file should loop through the contents of those directories as the training, test and anomalous data sources. Screenshot 2023-03-26 at 16 01 30

If you have any further questions, please let me know