Open whencar opened 1 year ago
I find that some Scholars study crowd counting on basis of PVT (Pyramid Vision Transformer). So can I use iTPN to study crowd counting?
In my opinion, iTPN serves as a basic vision model, it can do what other networks can do.
I find that some Scholars study crowd counting on basis of PVT (Pyramid Vision Transformer). So can I use iTPN to study crowd counting?