skasim / gutenberg_scifi

2 stars 1 forks source link

updates for 10/1 session #6

Closed skasim closed 9 years ago

skasim commented 9 years ago

Hi Nathan,

I apologize, but, I merged my work with the master instead of creating a separate branch and a pull request, and only realized later that I can't create a pull request unless there are differences between the master and the branch, so I'll be sure to correct that next time.

I was on the scifi bookshelf on Project Gutenberg and saw the warning:

   Note on the Project Gutenberg HTML page
   DON'T USE THIS PAGE FOR SCRAPING.

   Seriously. You'll only get your IP blocked.

   Download http://www.gutenberg.org/feeds/catalog.rdf.bz2 instead,
   which contains *all* Project Gutenberg metadata in one RDF/XML file.

But I found a solution. It involves pip installing gutenberg. Here are the instructions: https://github.com/c-w/Gutenberg. It doesn't work perfectly on my computer because of the issue with the bsddb (documented here: http://stackoverflow.com/questions/814041/how-to-fix-the-broken-bsddb-install-in-the-default-python-package-on-mac-os-x-10). It looked like I would have to change something in a python file and I didn't want to do that. However, It turns out I can still access the book text, which is what I wanted.

I made good progress this week:

ndanielsen commented 9 years ago

first thought: You might be using the global path in mac instead of the local path in your virtualenv.

ndanielsen commented 9 years ago

Taking a deeper look at your code, and I'm really proud of the progress that you've made and some of the great patterns that you've developed. Now that you've got a good grasp on functions, basic data structures and file i/o, I think that it's time to work on classes (or objects) in python. I'll create a separate ticket.