shackenberg / Minimal-Bag-of-Visual-Words-Image-Classifier

Implementation of a content based image classifier using the bag of visual words approach in Python together with Lowe's SIFT and Libsvm.
236 stars 94 forks source link

Increasing accuracy #15

Closed IshaanGupta5 closed 8 years ago

IshaanGupta5 commented 8 years ago

Hi.Hope you're doing fine.I need some help.

I've been running your code for classifying food images(banana,strawberry,pizza,donuts etc).So far the accuracy comes out to be 72% and I was hoping if there were any ways to increase it.

I read up on the net and found that normalizing the features in k-means clustering gives better results.Also,I read up on how SIFT features are computed and found that the images are scaled before hand.Do they provide the same functionality?

Do you know any way to increase the accuracy?Also, how did you decide that the number of clusters will be the square root of the number of features?

Any insight is appreciated.Thanks.

shackenberg commented 8 years ago

Dear Ishaan Gupta,

please have a look at https://github.com/shackenberg/phow_caltech101.py. As written in the readme the phow_caltech code is very simliar but should provide much better results. And if you really need much better or state of the art performance you need to look into deep learning.

Ludwig

IshaanGupta5 commented 8 years ago

I found the link https://github.com/shackenberg/phow_caltech101.py but thought that it would only provide faster computation, not accuracy.I'll surely give this a try then. Thank you again for the deep learning link and for the help.Really appreciate it.

shackenberg commented 8 years ago

but thought that it would only provide faster computation, not accuracy. Dear Ishaan Gupta, that is a good point. I will need to make that clearer in the readme.