ptwobrussell / Mining-the-Social-Web-2nd-Edition

The official online compendium for Mining the Social Web, 2nd Edition (O'Reilly, 2013)
http://bit.ly/135dHfs
Other
2.9k stars 1.49k forks source link

Example 6.2 : cannot build the mbox file correctly #219

Closed alafanechere closed 10 years ago

alafanechere commented 10 years ago

I successfully ran the script to download and extract enron data (it returns the last prints : Decompression complete - Data is ready'). My problem is when I run the example 6.2 the scripts creates the mbox file, but the file is empty. I tried to open it with Apple Mail, which says it's empty. And when I try to run the next example the json file created is also empty. Do you have an idea why it happens ? I got no error from both scripts...

ptwobrussell commented 10 years ago

Sorry to hear that this is causing you problems. I'd be glad to try and help diagnose what the problem may be, but I do want to just double check that you really do want to go through the process as opposed to jumping ahead a couple of cells and using the pre-fabricated JSON that I provide you. (Generally, you'd only want to wade through Examples 6-1/2 if you have a genuine interest in that kind of data munging as opposed to the analysis.)

On Aug 29, 2014, at 4:07 PM, alafanechere notifications@github.com wrote:

I successfully ran the script to download and extract enron data (it returns the last prints : Decompression complete - Data is ready'). My problem is when I run the example 6.2 the scripts creates the mbox file, but the file is empty. I tried to open it with Apple Mail, which says it's empty. And when I try to run the next example the json file created is also empty. Do you have an idea why it happens ? I got no error from both scripts...

— Reply to this email directly or view it on GitHub.

alafanechere commented 10 years ago

Thanks for this amazingly quick answer ! I think jumping ahead is the best option for both of us. The code itself is a bit hard for my understanding of Python, and I have no clue of where the problem could come from.

I decompressed the JSON archive you provide and on my way to the rest of the chapter !

Thanks from France.

ptwobrussell commented 10 years ago

I think you've made a good decision. Probably best to move onto the fun analysis.

Glad to help, and don't hesitate to reach back if anything else is a problem.

On Aug 29, 2014, at 4:46 PM, alafanechere notifications@github.com wrote:

Thanks for this amazingly quick answer ! I think jumping ahead is the best option for both of us. The code itself is a bit hard for my understanding of Python, and I have no clue of where the problem could come from.

I decompressed the JSON archive you provide and on my way to the rest of the chapter !

Thanks from France.

— Reply to this email directly or view it on GitHub.

jlm2239 commented 7 years ago

Hi, I am trying to take the Enron data and convert to a json file so that I can run another program that I have working that uses kmeans to cluster the emails. I tried the code from 6.2 but I am getting the error: "unknown string format" for _date=asctime(parse(_date).timetuple()). I tried print _date and I get the entire message (so maybe re.search is not working?). Any suggestions? The mbox is created but is empty too.