dk-liang / CLTR

[ECCV 2022] An End-to-End Transformer Model for Crowd Localization
MIT License
88 stars 13 forks source link

Training with NWPU #11

Closed henvh closed 1 year ago

henvh commented 1 year ago

Hello, I wanted to train the model on the NWPU dataset and since I am located in Germany I can't download the resized NWPU dataset from baidu - hence I took the 'preprocessing' from the JHU dataset and applied it to the original NWPU dataset. I started the training with the parameters given by your provided log-file, but the model didn't learn anything (MAE and MSE were constant). I tried it with a bigger learning rate, but that gave me an error with the cost matrix (matrix contains invalid numeric entries) when calculating the linear sum assignment (matcher.py line 80 for linear_sum_assignment from scipy.optimize). I wondered if the resized NWPU dataset you provided on baidu has some other characteristics than applying the JHU preprocessing to the original NWPU? What could possibly be another reason for such a behavior? Thanks.

dk-liang commented 1 year ago

I will upload the datasets into Onedrive as soon as possible

dk-liang commented 1 year ago

You can download from https://1drv.ms/u/s!Ak_WZsh5Fl0lhF0V7sxTVv1Vs0Aq?e=drd48k

henvh commented 1 year ago

Thanks!