Closed favetelinguis closed 8 years ago
Sorry about the trouble, Henrik. I just re-ran this example and it works fine for me. I prepared a simpler example for debugging, using only 100 samples from the movie dataset. Maybe you could try execute them on your system so that we could maybe get a better idea of what's going on. Attached are 4 files, the small movie dataset, the vectorizer.py script, and the 2 scripts to execute. The first one, ch08_pickle-dump-test.py, creates the classifier and stopword pickle files, and the second one, python ch08_pickle-load-test.py, loads the vectorizer and the classifier to make a prediction. The files should be all in the same directory, e.g., just put them on your desktop or so.
When I execute the two files, I get the following ...
~/Desktop$ python ch08_pickle-dump-test.py
~/Desktop$ python ch08_pickle-load-test.py
Prediction: positive
Probability: 85.71%
Would be nice if you could check whether they also throw this AttributeError: Can't get attribute 'tokenizer' on <module '__main__'>
so that we know more!
PS: Sorry, had to ZIP the files since GitHub complained about the attachment via a "Unfortunately, we don’t support that file type. Try again with a PNG, GIF, JPG, DOCX, PPTX, XLSX, TXT, PDF, or ZIP."). pickle-debugging-1.zip
Best, Sebastian
Thanks for a fast reply, trying your new example works. And after changes to a new conda virtual env i could also get my code working, must have messed something up in the env.
Glad to here that it was such an "easy" fix and not a deeper problem with the code itself :)
When i try to read back the classifier on page 254 i get the following error. I have done like in the book the whole way and things have worked find until now. Any idea what has gone wrong?
Im using ipython 4.2.0